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AGAATTAGTT GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC 150 

CAGGCGATGA CACTCCAGTT GTTCGCGGTT CAGCACTGAA AGCGCTGGAA 2 00 

GGCAACCCAG AGTGGGAAGC AAAAATTGTT GAATTAGCAG GTTACCTGGA 250 

TTCTTACATC CCAGAACCAG AGCGTGCAAT TGACAAGCCA TTCCTGCTGC 3 00 

5 CAATCGAAGA CGTATTCTCA ATCTCTGGTC GTGGTACAGT AGTAACAGGC 3 50 

CGTGTTGAGC GTGGTATCAT CAAAGTTGGT GAAGAAGTTG AAATCGTTGG 4 00 

TATTCAAGCG ACTGCGAAAA CAACTTGTAC TGGCGTTGAA ATGTTCCGTA 4 50 

AACTGCTGGA TGAAGGTCGT GCGGGTGAGA ACGTTGGTGT TCTGCTGCGT 500 

GGTACTAAAC GTGAAGAAAT TCAACGTGGT CAAGTACTGG CTAAACCAGG 550 

10 TTCAATCAAG CCACACACTC AATTCGAATC AGAAGTATAT ATTCTGAGCA 600 

AAGATGAAGG TGGTCGTCAT ACTCCATTCT TCAAAGGCTA CCGTCCACAG 650 

TTCTACTTCC GTACAACTGA CGTAACCGGT ACTATCGAAC TGC CAGAAGG 700 

CGTAGAGATG GTAATGCCAG GCGACAACAT CAACATGATC GTGAC 745 

15 



2) INFORMATION FOR SEQ ID NO: 150 

(i) SEQUENCE CHARACTERISTICS : 
2 0 (A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rettgeri 

(B) STRAIN: ATCC 9250 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150 



CGGTGCAATC CTGGTTGTTG CTGCGACTGA TGGCCCAATG CCACAAACTC 50 

GTGAGCACAT CCTGTTAGGY CGCCAAGTAG GTGTWCCTTA CATCATCGTT 100 

35 TTCCTGAACA AATGTGACAT GGTAGACGAC GAAGAACTGT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA ATACGATTTC CCAGGCGACG 200 

ACACTCCAGT TGTCCGTGGT TCAGCTCTGA AAGCGCTGGA AGGCAACCCA 250 

GAGTGGGAAG CGAAAATTGT TGAATTAGCA GGTCACTTGG ATTCTTACAT 3 00 

CCCAGAACCA GAGCGTGCAA TTGACAAACC ATTCCTGCTG CCAATCGAAG 3 50 

40 ACGTATTCTC AATCTCTGGT CGTGGTACAG TAGTAACAGG CCGTGTTGAG 4 00 

CGTGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCCAAGA 450 

CACGGTTAAA ACAACTTGTA CTGGCGTTGA~ AATGTTCC^ ~ 50V ~ 

ACGAAGGTCG TGCGGGTGAG AACGTTGGTG TTCTGCTGCG TGGTACTAAA 550 

CGTGAAGAAA TTCAACGTGG TCAAGTACTG GCAAAAC CAG GTTCAATCAA 600 

45 GCCACACACT AAATTCGAAT CAGAAGTCTA TATTCTGAGC AAAGATGAAG 650 

GTGGTCGTCA CACTCCATTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACAGG TACTAT CGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGTGATAACA TCAACATGAT CGTTACCCTG ATCCACCCAA 8 00 

TCGCGATGGA CGACGGTTTA CGTTTCGCAA 830 



50 



2) INFORMATION FOR SEQ ID NO: 151 



55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rustigianii 

(B) STRAIN: ATCC 33673 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151 



CGGTGCAATC 

10 GTGAGCACAT 
TTCCTGAACA 
TGAAATGGAA 
ACACTCCAGT 
GAGTGGGAAG 

15 CCCAGAACCA 
ACGTATTCTC 
CGTGGTATCG 
CACRGTTAAA 
ACGAAGGTCG 

2 0 CGTGAAGAAA 
GCCACACACT 
GTGGTCGTCA 
CGTACAACTG 
GGTAATGCCA 

25 TCGCGATGGA 



CTGGTTGTTG 
CCTGTTAGGT 
AATGTGACAT 
GTTCGTGAAC 
TGTTCGYGGT 
CGAAAATTGT 
GAGCGCGCAA 
AATCTCTGGT 
TTAAAGTTGG 
ACAACTTGTA 
TGCTGGTGAG 
TTCAACGTGG 
ACTTTTGAAT 
TACTCCATTC 
ACGTAACCGG 
GGCGACAACA 
TGATGGTTTA 



CTGCGACTGA 
CGCCAAGTAG 
GGTTGACGAC 
TTCTGTCTCA 
TCAGCACTGA 
TGAATTAGCC 
TTGACCGTCC 
CGTGGTACAG 
TGAAGAAGTT 
CTGGCGTTGA 
AACGTTGGTG 
TCAAGTACTG 
CAGAAGTTTA 
TTCAAAGGCT 
TACTATCGAA 
TCAACATGAT 
CGTTTC 



TGGC CCAATG 
GTGTTCCTTA 
GAAGAACTGT 
GTACGATTTC 
AAGCGCTGGA 
GGTTACCTGG 
ATTCCTGCTG 
TAGTAACAGG 
GAAATCGTTG 
AATGTTCCGT 
TTTTACTGCG 
GCTAAACCAG 
TATTCTGAGC 
ACCGTCCACA 
CTGCCAGAAG 
CGTGACACTG 



CCACAAACTC 50 

CATCATCGTT 100 

TAGAATTAGT 150 

CCAGGCGACG 2 00 

AGGTATCCCT 2 50 

ACAGCTACAT 3 00 

CCAATCGAAG 3 50 

SCGTGTTGAG 4 00 

GTATC CAAGA 450 

AAACTGCTTG 500 

TGGTACTAAG 550 

GTTCAATCAA 600 

AAAGATGAAG 650 

GTTCTACTTC 700 

GCGTAGAGAT 750 

ATTCACCCAA 800 
826 
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35 



40 



2) INFORMATION FOR SEQ ID NO: 152 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia. stuartii 

(B) STRAIN: ATCC 33672 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152 



45 CGGTGCAATC 
GTGAGCACAT 
TTCCTGAACA 
TGAAATGGAA 
ACACTCCAGT 

50 GAGTGGGAAG 
CCCAGAGCCA 
ACGTATTCTC 
CGTGGTATCA 
GACTGCGAAA 

55 ACGAAGGCCG 
CGTGAAGAAA 
GCCACACACA 
GTGGTCGTCA 
CGTACAACTG 

60 GGTAATGCCA 



CTAGTTGTTG 
CCTGTTAGGT 
AATGTGACAT 
GTTCGTGAAC 
TAT CCGTGGT 
CGAAAATCGT 
GAGCGTGCAA 
AATCTCAGGT 
TCAAAGTTGG 
ACCACTTGTA 
TGCGGGTGAG 
TCGAACGTGG 
ACTTTCGAAT 
CACGCCATTC 
ACGTAACAGG 
GGCGACAACG 



CGGCAACAGA 
CGTCAGGTTG 
GGTAGACGAC 
TTCTGTCTCA 
TCAGCGCTGA 
TGAACTAGCA 
TTGACAAGCC 
CGTGGTACAG 
TGAAGAAGTT 
CTGGCGTTGA 
AACGTAGGTG 
TCAAGTTCTG 
CAGAAGTTTA 
TTCAAAGGYT 
TACTATCGAA 
TGAACATGAA 



TGGCCCAATG 
GCGTTCCTTA 
GAAGAGCTGC 
ATACGATTTC 
AAGCGTTGGA 
GAAGCACTGG 
ATTCCTGCTG 
TAGTCACAGG 
GAAATCGTAG 
AATGTTCCGT 
TTCTGCTGCG 
GCGAAACCAG 
TATTCTGAGC 
ACCGTCCACA 
CTGCCAGAAG 
AGTAACTCTG 



CCACAAACTC 50 

CATCATCGTG 100 

TGGAACTGGT 150 

CCAGGTGATG 2 00 

AGGCAACCCA 250 

ACAGCTACAT 300 

CCAATCGAAG 350 

CCGTGTTGAG 4 00 

GTATCAAAGA 450 

AAACTGCTGG 500 

TGGTACTAAG 550 

GTTCAATCAA 600 

AAAGATGAAG 650 

GTTCTACTTC 700 

GCGTAGAGAT 750 

ATTCACCCAA 800 
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TCGCGATGGA CGATGGTTTG CGTTTCGCAA 



5 2) INFORMATION FOR SEQ ID NO: 153 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 7 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: ATCC 3 5554 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 153 



CCTGGTTTGC TCGGCTGCCG ACGGCCCCAT GCCGCAGACC CGCGAGCACA 50 

TCCTGCTGTC CCGCCAGGTA GGCGTTCCCT ACATCGTCGT GTTCCTGAAC 100 

AAAGCCGACA TGGTCGACGA CGCCGAGCTG CTGGAACTGG TCGAGATGGA 150 

AGTTCGCGAT CTGCTGAACA CCTACGACTT CCCGGGCGAC GACACTCCGA 2 00 

25 TCATCATCGG TTCCGCGCTG ATGGCGCTGG AAGGCAAGGA TGACAACGGC 250 

ATCGGCGTAA GCGCCGTGCA GAAGCTGGTA GAGACC CTGG ACTCCTACAT 300 

TCCGGAGCCG GTTCGTGCCA TCGACCAGCC GTTCCTCATG CCGATCGAAG . 350 

ACGTGTTCTC GATCTCCGGT CGCGGTACCG TGGTAACCGG TCGTGTAGAG 4 00 

CGCGGCATCA TCAAGGTCCA GGAAGAAGTG GAAATCGTCG GCATCAAGGC 450 

30 GACCACCAAG ACCACCTGCA CCGGCGTTGA AATGTTCCGC AAGCTGCTCG 500 

ACGAAGGTCG TGCTGGTGAG AACGTTGGTA TCCTGCTGCG TGGCACCAAG 550 

CGTGAAGACG TAGAGCGTGG CCAGGTACTG GCCAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAGT GCGAAGTGTA CGTGCTGTCC AAGGAAGAAG 650 

GTGGTCGTCA CACCCCGTTC TTCAAGGGCT ACCGTCCGCA GTTCTACTTC 70 0 

35 CGTACCAC KG ACGTGACCGG TAMCTGCGAG CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAGATGGT TGTCACCCTG ATCGCTCCGA 80 0 

TCGCCATGGA AGATGGCTGC GTTCGCG 82 7 



40 



50 



2) INFORMATION FOR SEQ ID NO: 154 



< i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 841 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas fluorescens 

(B) STRAIN: ATCC 13525 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154 

CGGCGCAATC CTGGTTTGCT CGGCCGCTGA TGGTCCGATG CCACAAACCC 50 

GTGAACACAT CCTGCTGTCC CGTCAGGTTG GCGTTCCGTA CATCGTGGTT 10 0 

TACCTGAACA AGGCTGACCT GGTAGACGAC GCTGAGCTGC TGGAACTGGT 150 

6 0 TGAGATGGAA GTGCGCGATC TGCTGAGCAC TTACGACTTC CCAGGCGACG 2 00 
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ACACTCCGAT CATCATCGGT TCTGCTCGTA 
GACAACGAAA TGGGCACCAC GTCCGTTCGT 
CAGCTACATC CCAGATCCAG TTCGTGTTAT 
CAATCGAAGA CGTGTTCTCG ATCTCCGGTC 
CGTATCGAGC GCGGTATCGT TAAGGTTCAA 
TCTGCGTGAC ACTACCGTCA CCACCTGCAC 
AGCTGCTCGA CGAAGGTCGT GCTGGCGAGA 
GGTACCAAGC GTGACGACGT TGAGCGTGGC 
TTCGGTTAAG CCGCACACCA AGTTCGAAGC 
AAGAAGAAGG CGGTCGTCAC ACTCCGTTCT 
TTCTACTTCC GTACTACTGA CGTGACTGGT 
CGTTGAAATG GTTATGCCAG GCGACAACAT 
TCAAAACCAT CGCAATGGAA GACGGTCTGC 



TGGCTCTGGA AGGCAAAGAC 250 

AAACTGGTTG AAACTCTGGA 3 00 

CGACAAGCCG TTCCTGATGC 350 

GCGGTACTGT TGTGACTGGT 4 00 

GATCCACTGG AAATCGTTGG 4 50 

CGGTGTTGAA ATGTTCCGTA 500 

ACTGCGGCGT TCTGCTGCGT 550 

CAGGTTCTGG TTAAGCCAGG 600 

TGAAGTCTAC GTACTGAGCA 650 

TCAAAGGCTA CCGTCCACAG 700 

AACTGCGAGC TGCCGGAAGG 750 

CAAAATGGTT GTTACCCTGA 800 

GTTTCGCTAT T 841 



2) INFORMATION FOR SEQ ID NO: 155 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas stutzeri 

(B) STRAIN: ATCC 17588 

0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155 

CGGCGCGATC CTGGTCTGCT CGGCTGCTGA CGGCCCCATG CCGCAGACTC 50 

GCGAGCACAT CCTGCTGTCC CGTCAGGTTG GTGTTCCGTA CATCGTCGTG 100 

TTCCTGAACA AGGCCGACAT GGTTGATGAC GCCGAGCTGC TCGAGCTGGT 150 

5 CGAGATGGAA GTTCGYGACC TGCTGTCGAC CTACGACTTC CCGGGTGAYG 200 

ACACTCCGAT CATCATCGGC TCCGCGCTGA TGGCGC TGAA CGGCGAAGAC 2 50 

GACAACGAGC TCGGCACCAC TGCGGTGAAG AAGCTGGTCG AGACCCTGGA 3 00 

CAGCTACATT CCCGAGCCGG TTCGTGCCAT CGACAAGCCG TTCCTGATGC 3 50 

CGATCGAAGA CGTGTTCTCG ATCTCCGGTC GCGGCACSGT GGTAACCGGT 4 00 

0 CGCGTAGAGC GCGGCATCGT CAAGGTT C AG GAAGAGATCG AGATCGTCGG 4 50 

TCTGCGTCCG ACCACCAAGA CTACCTGCAC CGGCGTTGAG ATGTTCCGCA 500 

AGCTGCTCG/T ~Y GAKciG T i^G 7 i r X*CTC^C^AGA~ ACTCJCciu Y CiT CJClTiC'lIiCUi b b U 

GGCACCAAGC GTGACGAAGT GGAGCGTGGT CAGGTTCTGG CCAAGCCGGG 600 

CACCAT CAAG CCGCACACCA AGTTCGAAGC CGAAGTGTAC GTGCTGTCCA 650 

AGGAAGAAGG TGGTCGTCAC ACCCCGTTCT TYAAGGGCTA CCGTCCKCAG 7 00 

TTCTACTTCC GTACCACTGA YGTGACYGGW TCGTGCGARC TGCCGGAAGG 7 50 

CGTCGAGATG GTAATGCCGG GCGACAACGT GAAGATGGTT GTCACCCTGA 8 00 

TCAAGC CGAT CGCCATGGAA GACGGCCTGC GCTTCGCGAT T 841 



2) INFORMATION FOR SEQ ID NO: 156 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Psychrobacter phenylpyruvicus 

(B) STRAIN: ATCC 23333 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156 

GCTATTCTAG TAGTAT CAGC AACTGACGGT CCAATGCCAC AAACACGTGA 50 

GCACATTCTA TTATCACGTC AGGTTGGTGT ACCATACATC ATCGTATTCA 100 

10 TGAACAAATG TGACATGGTA GATGACGAAG AGTTACTAGA GCTAGTAGAA 150 

ATGGAAGTGC GTGAATTACT TTCAGACTAC GACTTCCCAG GTGATGACAC 200 

TCCAATCATC AAAGGTTCAG CTTTAGAAGC GTTAAATGGT AACGACGGTA 250 

AGTACGGTGA GCCAGCAGTT ATCGAACTAC TAAACACTCT AGACACTTAC 300 

ATTCCAGAGC CAGAGCGTGA CATCGATAAG CCATTCCTAA TGCCAATCGA 350 

15 AGACGTATTC TCAATC TCAG GTCGTGGTAC AGTAGTAACA GGCCGTGTTG 400 

AATCTGGTAT CATCAAAGTT GGTGACGAAA TCGAAATCGT TGGTATCAAA 450 

GACACAGTTA AAACAACTTG TACTGGTATC GAGATGTTCC GTAAGTTACT 500 

AGACGAAGGT CGTGCTGGTG AGAACTGTGG TGTACTATTA CGTGGTACTA 550 

AGCGTGAAGA CGTACAACGT GGTCAAGTAC TTGCTAAGCC AGGTTCAATC 600 

20 ACTCCACACA CCAACTTCGA CGCAGAAGTA TACGTACTAT CAAAAGAAGA 650 

AGGTGGTCGT CACACTCCAT TCTTAAATGG TTACCGTCCA CAGTTCTACT 7 00 

TCCGTACTAC TGACGTAACA GGTGCAATCA CGTTACAAGA AGGTACTGAA 750 

ATGGTAATGC CAGGCGATAA CGTTGAGATG AGCGTAGAGC TAATCCACCC 800 

AATCGCTAGG ACAAAGGTTT ACGTTTCGCA ATC 833 

25 



2) INFORMATION FOR SEQ ID NO: 157 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rahnella aquatilis 
40 (B) STRAIN: ATCC 33071 

GGCGCTATCC TGGTTGTTGC TGCAACTGAC GGCCCTATGC CTCAGACTCG 50 

4 5 TGAGCACATC CTGCTGGGTC GCCAGGTTGG CGTTCCATAC ATGATCGTGT 100 

TCATGAACAA ATGCGACATG GTAGATGACG AAGAGC TGCT GGAACTGGTA 150 

GAAATGGAAG TTCGCGAACT TCTGTCTGCT TACGAATTCC CAGGCGACGA 2 00 

CATCCCGGTC ATCAAAGGTT CAGCGCTGAA AGCACTGGAA GGCGATGCTA 250 

CTTGGGAAGC GAAAATCATC GAACTGGCAG AAGCACTGGA CAGCTACATT 3 00 

50 CCATTGCCAG AGCGTGCTAT CGATAAGCCA TTCCTGCTGC CAATCGAAGA 350 

CGTATTCTCC ATCTCCGGTC GTGGTACAGT GGTTACCGGT CGTGTAGAGC 4 00 

GCGGTATCGT TAAAGTGGGC GAAGAAGTTG AAATCGTCGG TAT CAAGG AC 4 50 

ACTGTTAAGT CTACTTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCGGGCGAGA ACGTGGGTGT TCTGCTGCGT GGTATCAAGC 550 

55 GTGAAGACAT CGAACGTGGT CAGGTTCTGG CTAAAC CAGG TTCAATCAAA 600 

CCACACACCA AGTTTGATTC CGAAGTGTAC ATCCTGAGCA AAGATGAAGG 650 

TGGTCGTCAC ACTCCATTCT T CAAAGGCT A CCGTCCACAG TTCTACTTCC 7 00 

GTACAACTGA CGTGACCGGT ACTATCGAAC TGCCAGAAGG CGTTGAGATG 750 

GTTATGCCTG GTGACAACGT GAACATGGTT GTTACCCTGA TCCACCCAAT 8 00 

60 CGCGATGGAT GACGGTCTGC GTTTC 825 
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2) INFORMATION FOR SEQ ID NO: 158 

5 

( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Salmonella choleraesuis subsp. arizonae 

(B) STRAIN: ATCC 13314 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 158 

20 CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGC CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 200 

ATACGC CGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

25 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ACTCTTACAT 3 00 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 450 

GAC TCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 50 0 

30 ACGAAGGCCG TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 55 0 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTT CGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTGGAGAT 750 

35 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA 83 0 



40 2) INFORMATION FOR SEQ ID NO: 159 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Choleraesuis 

(B) STRAIN: ATCC 7001 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGYCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

60 TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 2 00 
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ACACGCCGAT 
GAGTGGGAAG 
TCCGGAACCA 
ACGTATTCTC 
CGCGGTATCA 
GACTCAGAAG 
ACGAAGGCCG 
CGTGAAGAAA 
GCCGCACACC 
GCGGCCGTCA 
CGTACTACTG 
GGTAATGCCG 
TCGCAATGGA 



CGTTCGTGGT 
CGAAAATCAT 
GAGCGTGCGA 
CATCTCCGGT 
TCAAAGTGGG 
TCTACCTGTA 
TGCCGGTGAG 
TCGAACGTGG 
AAGTTCGAAT 
TACTCCGTTC 
ACGTGACTGG 
GGCGACAACA 
CGACGGTCTG 



TCTGCTCTGA 
CGAACTGGCT 
TTGACAAGCC 
CGTGGTACCG 
YGAAGAAGTT 
CTGGCGTTGA 
AACGTAGGTG 
TCAGGTACTG 
CTGAAGTGTA 
TTCAAAGGCT 
CACCATCGAA 
TCAAAATGGT 
CGTTTCGCAA 



AAGCGCTGGA AGGTGACGCA 
GGCTTCCTGG ATTCTTACAT 
GTTCCTGCTG CCGATCGAAG 
TTGTTACCGG TCGTGTAGAG 
GAAATCGTTG GTATCAAAGA 
AATGTTCCGC AAACTGTTGG 
TTCTGCTGCG TGGTATCAAA 
GCTAAGCCGG GCACCATCAA 
CATTCTGTCC AAAGATGAAG 
ACCGTCCGCA GTTCTACTTC 
CTGCCGGAAG GCGTAGAGAT 
TGTTACCCTG ATCCACCCGA 
TC 



250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
832 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 160 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. dlarizonae 

(B) STRAIN: ATCC 43973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160 

CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACC CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATTATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC GACACGCCAA 2 00 

TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 2 50 

GCGAAAATCA TCGAACTGGC TGGCTTCCTG GATTCTTACA TCCCGGAACC 3 00 

AGAGCGTGCG ATTGACAAGC CGTTCCTGCT GCCGATCGAA GACGTATTCT 350 

CCATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 4 00 

ATCAAAGTGG GCGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTCAGAA 4 50 

GTCTACCTGT ACTGGCGTTG AAATGTTCCG CAARCTGCTG GACGAAGGCC 500 

GTGCWGGTGA GAACGTAGGT GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGC TAAGCCG GGCACCATCA AGCCGCACAC 600 

CAAGTTCGAA TCTGAAGTGT ATATTCTGTC CAAAGATGAA GGCGGCCGTC 650 

ATACTCCGTT CTTCAAAGGC TACCGTCCGC AGTTCTACTT CCGTACCACT 700 

GACGTGACTG GCACCATCGA AC TGC CGGAA GGCGTGGAGA TGGTAATGCC 750 

GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 800 

ACGACGG 8 07 



2) INFORMATION FOR SEQ ID NO: 161 

( i ) SEQUENCE CHARACTERISE CS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella, choleraesuis subsp. choleraesuxs 
serotype Heidelberg 

(B) STRAIN: ATCC 8326 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161 

CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 
GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 
TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 
TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 
ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 
GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 
CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 
ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 
CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATC GTTG GTATCAAAGA 
GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 
ACGAAGGCCG TGC CGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 
CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 
GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 
GCGGCCGTCA YACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 
CGTACTACTG ACGTGACTGG C AC CAT CGAA TTGCCGGAAG GCGTAGAGAT 
GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 
TCGCGATGGA CGACGGTCTG CGTTTCGCAA TC 



50 
100 
150 
200 
250 
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350 
400 
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750 
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2) INFORMATION FOR SEQ ID NO: 162 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. houtenae 

(B) STRAIN: ATCC 43974 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162 

CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACC CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGATTT CCCGGGCGAC GACACGCCGA 2 00 

TCGTGCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAATGGGAA 250 

GCGAAAATCA TCGAACTGGC TGGCTACCTG GATTCTTACA TCCCGGAACC 300 

AGAGCGTGCG ATTGACAAAC CGTTCCTGCT GCCGATCGAA GACGTATTCT 350 

CTATCTCCGG TCGTGGTACC GTTGTTACCG GTCGTGTAGA GCGCGGTATC 400 

ATCAAAGTGG GCGAAGAAGT TGAAATCGTT GGTATCAAAG AGACTCAGAA 450 

GTCGACCTGT ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC 500 

GTGCTGGCGA GAACGTAGGT GTTCTGCTGC GTGGTATCAA ACGTGAAGAA 550 

ATCGAACGTG GTCAGGTACT GGCTAAGCCG GGCAC CAT CA AGCCGCACAC 600 

CAAGTTCGAA TCTGAAGTGT ACATTCTGTC CAAAGATGAA GGCGGCCGTC 650 

ATACTC CGTT CTTCAAAGGC TACCGTCCGC AATTCTACTT CCGTACGACT 700 

GACGTGACTG GCAC CAT CG A ACTGCCGGAA GGCGTGGAGA TGGTAATGCC 750 
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GGGCGACAAC ATCAAAATGG TTGTTACCCT GATCCACCCG ATCGCGATGG 
ACGACGG 



800 
807 
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2) INFORMATION FOR SEQ ID NO: 163 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 7 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. indlca 

(B) STRAIN: ATCC 43976 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163 

CGCGATCCTG GTTGTTGCTG CGACTGACGG CCCGATGCCG CAGACCCGTG 50 

AGCACATCCT GCTGGGTCGT CAGGTAGGCG TTCCGTACAT CATCGTGTTC 100 

CTGAACAAAT GCGACATGGT TGATGACGAA GAGCTGCTGG AACTGGTTGA 150 

AATGGAAGTT CGTGAACTTC TGTCTCAGTA CGACTTCCCG GGTGACGACA 2 00 

CGCCGATCGT GCGTGGTTCT GCTCTGAAAG CGCTGGAAGG CGACGCAGAG 250 

TGGGAAGCGA AAATCATCGA ACTGGCTGGC TTCCTGGATT CTTACATTCC 3 00 

GGAACCAGAG CGTGCGATTG ACAAGCCGTT CCTGCTGCCG ATCGAAGACG 350 

TATTCTCCAT CTCCGGTCGT GGTACYGTTG TTACCGGTCG TGTAGAGCGC 4 00 

GGTATCATCA AAGTGGGCGA AGAAGTTGAA ATCGTTGGTA TCAAAGAGAC 4 50 

TCAGAAGTCT ACCTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGACG 500 

AAGGCCGTGC CGGTGAGAAC GTAGGTGTTC TGCTGCGTGG TATCAAACGT 550 
GAAGAAATCG AACGTGGTCA GGTACTGGCT AAGCCGGGCA CCATCAAGCC 6 00 

GCACACCAAG TTCGAATCTG AAGTGTACAT TCTGTCCAAA GATGAAGGCG 650 

GCCGTCATAC TCCGTTCTTC AAAGGCTACC GTCCGCAGTT CTACTTCCGT 700 

ACTACTGACG TGACTGGCAC CATCGAACTG CCGGAAGGCG TAGAGATGGT 750 

AATGCCGGGC GACAACATCA AAATGGTTGT TACCCTGATC CATCCGATCG 800 
CRATGGACGA CGGTCTGCGT TTCGCAA 827 



2) INFORMATION FOR SEQ ID NO: 164 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. salamae 

(B) STRAIN: ATCC 43972 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164 

CCTGGTTGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACC CGTGAGCACA 50 
TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 
AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TAGAAATGGA 150 
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10 



15 



AGTTCGTGAA 
TCGTGCGTGG 
GMGAAAATCA 
AGAGCGTGCG 
CCATCTCCGG 
ATCAAAGTGG 
GTCTACCTGT 
GTGCCGGTGA 
ATCGAACGTG 
CAAGTTCGAA 
ATACTCCGTT 
GACGTGACTG 
GGGCGACAAC 
ACGACGG 



CTTCTGTCTC 
TTCCGCTCTG 
TCGAACTGGC 
ATTGACAAGC 
TCGTGGTACC 
GCGAAGAAGT 
ACTGGCGTTG 
GAACGTAGGT 
GTCAGGTACT 
TCTGAAGTGT 
CTTCAAAGGC 
GCACCATCGA 
ATCAAAATGG 



AGTACGACTT 
AAAGCGCTGG 
TGGCTWCCTG 
CGTTCCTGCT 
GTTGTTACCG 
TGAAATCGTT 
AAATGTTCCG 
GTTCTGCTGC 
GGCTAAGCCG 
ACATTCTGTC 
TACCGTCCGC 
ACTGCCGGAA 
TTGTTACCCT 



CCCGGGCGAC 
AAGGCGAMGC 
GATTCTTACA 
GCCGATCGAA 
GTCGTGTAGA 
GGTATCAAAG 
CAAACTGCTG 
GTGGTATCAA 
GGCACCATCA 
CAAAGATGAA 
AGTTCTACTT 
GGCGTGGAGA 
GATCCACCCG 



GACACGCCGA 2 00 

TGAGTGGGAA 2 50 

TTCCGGAACC 3 00 

GACGTATTCT 3 50 

GCGCGGTATC 4 00 

AGAC TCAGAA 4 50 

GACGAAGGCC 500 

ACGTGAAGAA 550 

AGCCGCACAC 6 00 

GGCGGCCGTC 650 

CCGTACCACT 700 

TGGTAATGCC 750 

ATCGCGATGG 8 00 

807 



2) INFORMATION FOR SEQ ID NO: 165 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp . choleraesuis 
30 serotype Typhi 

(B) STRAIN: ATCC 10749 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 165 

35 CGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 2 00 

ACACGCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

4 0 GAGTGGGAAG CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

CCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 3 50 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 4 00 

CGCGGTATCA TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

45 ACGAAGGCCG TGCNGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCAT CAA 6 00 

GCCGCACACY AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 7 00 

CGTACTACTG ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

50 GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 800 

TCGCAATGGA CGACGGTCTG CGTTTCGCAA TC 832 



55 2 ) INFORMATION FOR SEQ ID NO: 166 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia fonticola 

(B) STRAIN: DSM 4576 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166 
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CGGCGCTATC 
GTGAGCACAT 
TTCATGAACA 
AGAAATGGAA 
ACCTGCCGGT 
GAGTGGGAAG 
CCCAGAACCA 
ACGTATTCTC 
CGCGGTATCG 
CACCGTTAAG 
ACGAAGGCCG 
CGTGAAGACA 
GCCGCACACT 
GTGGTCGTCA 
CGTACAACTG 
GGTAATGCCA 
TCGCTATGGA 



CTGGTTGTAG 
CCTGCTGGGT 
AATGCGACAT 
GTTCGTGAAC 
TGTTCGTGGT 
CTAAAATCAT 
GAGCGTGCTA 
CATCTCCGGT 
TTAAAGTTGG 
TCTACCTGTA 
TGCTGGTGAG 
TCGAACGTGG 
CAGTTCGATT 
TACTCCATTC 
ACGTGACCGG 
GGCGATAACG 
CCAAGGC 



CTGCGACTGA 
CGTCAGGTTG 
GGTTGATGAT 
TTCTGTCTGC 
TCAGCGCTGA 
CGAGCTGGCC 
TCGATCAGCC 
CGTGGTACCG 
CGAAGAAGTT 
CTGGCGTTGA 
AACGTTGGTG 
TCAGGTACTG 
CAGAAGTGTA 
TTCAAAGGCT 
TACCATCGAA 
TGAACATGGT 



CGGCCCTATG 
GCGTTCCTTA 
GAAGAGCTGC 
TTATGACTTC 
AAGCACTGGA 
GGTCACCTGG 
GTTCCTGCTG 
TAGTTACCGG 
GAAATCGTTG 
AATGTTCCGC 
TTCTGCTGCG 
GCTAAACCAG 
TATCCTGAGC 
ACCGTCCACA 
CTGCCAGAAG 
TGTTACCCTG 



CCTCAGACTC 50 

CATCATCGTA 10 0 

TGGAACTGGT 150 

CCTGGTGATG 200 

AGGCGAAGCT 250 

ATTCCTACAT 3 00 

CCAATCGAAG 3 50 

TCGTGTTGAG 4 00 

GTATCAAAGA 450 

AAACTGCTGG 500 

TGGTATCAAG 550 

GTTCCATCAA 60 0 

AAAGAAGAAG 650 

GTTCTACTTC 700 

GCGTAGAGAT 750 

ATCCACCCAA 800 

817 



30 



40 



2) INFORMATION FOR SEQ ID NO: 167 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 787 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia liquefaciens 

(B) STRAIN: ATCC 27592 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167 

GCTGCGACTG ACGGCCCAAT GCCTCAGACC CGTGAGCACA TCCTGCTGGG 50 

TCGTCAGGTT GGCGTTCCTT TCATCATCGT ATTCATGAAC AAATGCGACA 10 0 

TGGTTGATGA TGAAGAGCTG CTGGAACTGG TAGAAATGGA AGTTCGTGAA 150 

50 CTTCTGTCTG CTTACGACTT CCCTGGTGAT GACCTGCCGG TTGTTCGTGG 2 00 

TTCAGCGCTG AAAGCAC TGG AAGGCGAAGC TGAGTGGGAA GCTAAAATCA 250 

TCGAGCTGGC CGGTTACCTG GATTCTTACA TCCCAGAACC AGAGCGTGCT 3 00 

ATCGACAAGC CGTTCCTGCT GCCAATCGAA GACGTCTTCT CCATCTCCGG 3 50 

TCGTGGTACC GTTGTTACCG GTCGTGTTGA GCGCGGTATC GTTAAAGTTG 4 00 

55 GCGAAGAAGT TGAAATCGTT GGTATCAAAG ACACCGTTAA GTCTACCTGT 450 

ACTGGCGTTG AAATGTTCCG CAAACTGCTG GACGAAGGCC GTGCTGGTGA 500 

GAACGTTGGT GTTCTGCTGC GTGGTATCAA GCGTGAAGAC ATCGAACGTG 550 

GTCAGGTACT GGCTAAACCA GGTTCAATCA AGCCACACAC CAAGTTCGAC 600 

TCAGAAGTGT ACATCCTGAG CAAAGAAGAA GGTGGTCGTC ATACTCCATT 650 

60 CTTCAAAGGC TACCGTCCAC AGTTCTACTT CCGTACAACT GACGTGACCG 700 
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GTACCATCGA ACTGCCAGAA GGCGTTGAAA 
GTGAACATGG TTGTTACCCT GATCCACCCA 



TGGTAATGCC AGGTGACAAC 
ATCGCGA 
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2) INFORMATION FOR SEQ ID NO: 168 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 745 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 

(B) STRAIN: ATCC 13880 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168 

GCCTCAGACT CGTGAGCACA TCCTGCTGGG TCGTCAGGTT GGCGTTCCTT 50 

TCATCATCGT ATTCATGAAC AAATGCGACA TGGTTGATGA TGAAGAGCTG 100 

YTGGAACTGG TAGAAATGGA AGTTCGCGAA CTGCTGTCCG CTTACGACTT 150 

CCCTGGCGAC GACCTGCCGG TAATCCGCGG TTCCGCGCTG AAAGCGCTGG 2 00 

AAGGCGAAGC TGAGTGGGAA GCGAAAATCA TCGAACTGGC CGAAGCCCTG 2 50 

GACAGCTACA TCCCAGAGCC AGAGCGTGCT ATCGACAAGC CGTTCCTGCT 3 00 

GCCAATCGAA GACGTATTCT CCATCTCCGG TCGTGGTACC GTTGTTACCG 350 

GTCGTGTTGA GCGCGGCATC ATCAAAGTTG GCGAAGAAGT TGAAATCGTT 4 00 

GGTATCAAAG ACACCGTTAA GTCTACCTGT ACTGGCGTTG AAATGTTCCG 4 50 

CAAACTGCTG GACGAAGGCC GTGCTGGTGA GAACGTAGGT GTTCTGCTGC 500 

GTGGTAT CAA ACGTGAAGAA ATCGAACGTG GTCAGGTACT GGCTAAGCCA 550 

GGCTCCATCA AGCCGCACAC CCAGTTCGAA TCTGAAGTGT ACATCCTGAG 600 

CAAAGATGAA GGTGGTCGTC ACACKCCATT CTTCAAAGGC TACCGTCCAC 650 

AGTTCTACTT CCGTACCACT GACGTGACCG GTACCATCGA ACTGCCAGAA 700 

GGCGTAGAGA TGGTAATGCC AGGCGACAAC GTGAACATGG TTGTA 74 5 



2) INFORMATION FOR SEQ ID NO: 169 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia odor if era 

(B) STRAIN: ATCC 3 3077 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169 

GGCGCAATCC TGGTTGTTGC TGCGACTGAC GGCCCTATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GCCAGGTTGG CGTTCCTTTC ATCATCGTGT 100 

TCATGAACAA ATGTGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGCGAGCT GCTGTCTGCT TACGATTTCC CTGGCGACGA 2 00 

CCTGCCAGTA ATCCGCGGTT CTGCGCTGAA AGCGCTGGAA GGCGAAGCAG 250 
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AGTGGGAAGC 
CCAGAACCAG 
CGTATTCTCC 
GCGGTATCAT 
ACCGTTAAGT 
CGAAGGCCGT 
GTGAAGACAT 
CCGCACACCA 
TGGTCGTCAC 
GTACTACTGA 
GTAATGCCAG 
CGCAATGGAC 



TAAGATTGTA 
AGCGTGCTAT 
ATCTCCGGTC 
CAAAGTTGGC 
CTACCTGTAC 
GCTGGTGAGA 
CGAACGTGGT 
AATTCGACTC 
ACGCCATTCT 
CGTGACCGGT 
GCGATAACGT 
GACGGTCTGC 



GAACTGGCTG 
CGACAAGCCG 
GTGGTACCGT 
GAAGAAGTTG 
CGGTGTAGAA 
ACGTTGGTGT 
CAGGTTCTGG 
AGAAGTGTAC 
TCAAAGGCTA 
ACCATCGAAC 
GAACATGGTT 
GTTTCGCAA 



AAGCGCTGGA TTCTTACATC 
TTCCTGCTGC CAATCGAAGA 
TGTTACCGGT CGTGTTGAGC 
AAATCGTTGG TATCAAAGAC 
ATGTTCCGCA AACTGCTGGA 
TCTGCTGCGT GGTATCAAGC 
CTAAACCAGG TTCTATCAAG 
AT CC TGAGCA AAGAAGAAGG 
CCGTCCACAG TTCTACTTCC 
TGCCAGAAGG CGTAGAGATG 
GTTACCCTGA TTCACCCAAT 



300 
350 
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2) INFORMATION FOR SEQ ID NO: 170 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia plymuthica 

(B) STRAIN: DSM 4540 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170 

CGGCGCAATC CTGGTTGTTG CTGCGACTGA CGGCCCAATG CCTCAGACCC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTTG GCGTTCCTTT CATCATCGTA 100 

TTCATGAACA AATGCGACAT GGTTGATGAT GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTGC TTACGACTTC CCTGGTGATG 2 00 

ACCTGCCGGT TGTTCGTGGT TCAGCGCTGA AAGCACTGGA AGGCGAACCA 2 50 

GAGTGGGAAG CTAAAATCAT CGAGCTGGCT GGTTTCCTGG ATTCTTACAT 3 00 

CCCAGAACCA GAGCGTGCTA TCGACAAGCC GTTCCTGCTG CCAATCGAAG 3 50 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTTGAG 4 00 

CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTGG GTATCAAAGA 450 

CACCGTTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTGGGTG TTCTGCTGCG TGGTAT C AAG 550 

CGCGAAGATA TCGAACGTGG TCAGGTCCTG GCTAAACCAG GTTCAATCAA 600 

GCCACACACC AAGTTTGACT CAGAAGTGTA CATCCTGAGC AAAGAAGAAG 650 

GTGGTCGTCA TACTCCATTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTGACCGG TACCAT CGAA CTGCCAGAAG GCGTAGAGAT 750 

GGTAATGCCA GGTGACAACG TGAACATGGT TGTAACCCTG ATCCACCCAA 800 

TCGCGATGGA CGACGGCCTG CGTTTCGCAA 83 0 



2) INFORMATION FOR SEQ ID NO: 171 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia rubidaea 

(B) STRAIN: ATCC 27593 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171 



GGCGCAATCC TGGTAGTAGC AGCGACTGAC GGCCCAATGC CTCAGACCCG 50 

TGAGCACATC CTGCTGGGCC GCCAGGTAGG CGTACCTTAC ATCATCGTAT 100 

10 TCATGAACAA ATGCGACATG GTAGATGATG AAGAGCTGCT GGAACTGGTA 150 

GAGATGGAAG TTCGCGAACT GCTGTCTGCT TACGACTTCC CAGGCGACGA 2 00 

CCTGCCGGTA ATCCGTGGTT CCGCGCTGAA AGCGCTGGAA GGCGAAGCCG 250 

AGTGGGAAGC GAAAATCGTT GAGCTGGCAG AAGCGCTGGA CAGCTACATC 300 

CCAGAGCCAG AGCGTGCTGT AGACAAGCCG TTCCTGCTGC CAATCGAAGA 350 

15 CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 4 00 

GCGGTATCAT CAAAGTTGGT GAAGAAGTAG AAATCGTAGG TATCAAAGAC 4 50 

ACCGTTAAGT CTACCTGTAC TGGCGTAGAA ATGTTCCGCA AACTGCTGGA 500 

CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAGC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CGAAGCCAGG TTCAATCAAG 600 

20 CCGCACACCC AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 7 00 

GTACAACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTAGAGATG 750 

GTAATGCCAG GCGACAACGT GAACATGAAA GTTACTCTGA TTCACCCAAT 8 00 

CGCAATGGAC GACGGTCTGC GTTTCGCAA 82 9 

25 



2) INFORMATION FOR SEQ ID NO: 172 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella boydii 
40 (B) STRAIN: ATCC 9207 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172 

CGGCGCGATC YTGGTAGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

4 5 GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 2 00 

ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

GAGTGGGAAG CGAAAATCCT GGAACTGGCT GGCTTCCTGG ATTCTTACAT 300 

50 TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 400 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

55 CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 700 

CGTACTACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA 8 00 

SO TCGCGATGGA CGACGGTCTG CGTTTC 826 
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2) INFORMATION FOR SEQ ID NO: 173 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Shigella dysenteriae 

(B) STRAIN: ATCC 11835 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173 

2 0 TGGTAGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG TGAGCACATC 50 

CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA 100 

ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT GAAATGGAAG 150 

TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA CACTCCGATC 2 00 

GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG AGTGGGAAGC 2 50 

25 GAAAATCCTG GAACTGGCTG GCTTCCTGGA TTCYTAYATT CCGGAACCAG 3 00 

AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC 3 50 

ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC GCGGTATCAT 4 00 

CAAAGTTGGT GAAGAAGTTG AAATCGTTGG TATCAAAGAG ACYCAGAAGT 450 

CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGCCGT 500 

30 GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC GTGAAGAAAT 550 

CGAACGTGGT CAGGTACTGG CGAAGCCRGG CACCATCAAG CCGCACACCA 600 

AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG CGGCCGTCAT 650 

ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC GTACTACTGA 700 

CGTGACTGGT ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG 750 

3 5 GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT CGCGATGGAC 8 00 

GACGGTCTGC GTTTCGCA 818 



40 2) INFORMATION FOR SEQ ID NO: 174 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 06 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella flexneri 

(B) STRAIN: ATCC 12 022 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174 



CCTGGTAGTT GCTGCGACTG ACGGCCCGAT GCCGCAGACT CGTGAGCACA 50 

TCCTGCTGGG TCGTCAGGTA GGCGTTCCGT ACATCATCGT GTTCCTGAAC 100 

AAATGCGACA TGGTTGATGA CGAAGAGCTG CTGGAACTGG TTGAAATGGA 150 

AGTTCGTGAA CTTCTGTCTC AGTACGACTT CCCGGGCGAC GACACTCCGA 200 

60 TCGTTCGTGG TTCTGCTCTG AAAGCGCTGG AAGGCGACGC AGAGTGGGAA 250 
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GCGAAAATCC 
AGAGCGTGCG 
CCATCTCCGG 
ATCAAAGTTG 
GTCTACCTGT 
GTGC TGGTGA 
ATCGAACGTG 
CAAGTTCGAA 
ATACTCCGTT 
GACGTGACTG 
GGGCGACAAC 
ACGACG 



TGGAACTGGC 
ATTGACAAGC 
TCGTGGTACC 
GTGAAGAAGT 
ACTGGCGTTG 
GAACGTAGGT 
GTCAGGTACT 
TCTGAAGTGT 
CTTCAAAGGC 
GTACCATCGA 
ATCAAAATGG 



TGGCTTCCTG 
CGTTCCTGCT 
GTTGTTACCG 
TGAAATCGTT 
AAATGTTCCG 
GTTCTGCTGC 
GGCTAAGCCG 
ACATTCTGTC 
TACCGTCCGC 
ACTGCCGGAA 
TTGTTACCCT 



GATTCTTACA TTCCGGAACC 
GCCGATCGAA GACGTATTCT 
GTCGTGTAGA ACGCGGTATC 
GGTATCAAAG AGACTCAGAA 
CAAACTGCTG GACGAAGGCC 
GTGGTATCAA ACGTGAAGAA 
GGCACCATCA AGCCGCACAC 
CAAAGATGAA GGCGGMCGTC 
AGTTCTACTT CCGTACTACT 
GGCGTAGAGA TGGTAATGCC 
GATCCACCCG ATCGCGATGG 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
806 



10 



15 



20 



25 



30 



35 



40 



45 



50 
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2) INFORMATION FOR SEQ ID NO: 175 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella sonnei 

(B) STRAIN: ATCC 29930 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175 

CGGCGCGATC CTGGTAGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

TGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGACTTC CCGGGCGACG 2 00 

ACACTCCGAT CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA 250 

GAGTGGGAAG CGAAAATCCT GGAACTGGCT GGCTTCCTGG ATTCTTACAT 3 00 

TCCGGAACCA GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG 350 

ACGTATTCTC CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA 4 00 

CGCGGTATCA TCAAAGTTGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

GACTCAGAAG TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGG CCG TGCTGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA 600 

GCCGCACACC AAGTT CGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG 650 

GCGGYCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 7 00 

CGTACTACTG ACGTGACTGG TAC CATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTAC CCTG ATCCACCCGA 8 00 

TCGCGATGGA CGACGGTCTG CGTTTCGCAA TC 832 



2) INFORMATION FOR SEQ ID NO: 176 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 716 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 13301 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176 

TCTGCTGCTG ACGGTCCAAT GCCACAAACT CGTGAACACA TTCTTTTATC 50 

ACGTAACGTT GGTGTACCAG CATTAGTAGT ATTCTTAAAC AAAGTTGACA 100 

10 TGGTTGACGA TGAAGAATTA TTAGAATTAG TAGAAATGGA AGTTCGTGAC 150 

TTATTAAGCG AATATGACTT CCCAGGTGAC GATGTACCTG TAATCGCTGG 200 

TTCAGCATTA AAAGCTTTAG AAGGCGATGC TCAATACGAA GAAAAAATCT 2 50 

TAGAATTAAT GGAAGCTGTA GATACTTACA TTCCAACTCC AGAACGTGAT 300 

TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT CAATCACTGG 350 

15 TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA ATCAAAGTTG 400 

GTGAAGAAGT TGAAATCATC GGTTTACATG ACACATCTAA AACAACTGTT 450 

ACAGGTGTTG AAATGTTCCG TAAATTATTA GACTACGCTG AAGCTGGTGA 500 

CAACATTGGT GCATTATTAC GTGGTGTTGC TCGTGAAGAC GTACAACGTG 550 

GTCAAGTATT AGCTGCTCCT GGTTCAATTA CACCACATAC TGAATTCAAA 600 

2 0 GCAGAAGTAT ACGTATTATC AAAAGACGAA GGTGGACGTC ACACTCCATT 650 

CTTCTCAAAC TATCGTCCAC AATTCTATTT C CGT AC TACT GACGTAACTG 700 

GTGTTGTTCA CTTACC 716 



25 



35 



2) INFORMATION FOR SEQ ID NO: 177 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 719 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 29247 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 177 

TTCTTTTATC ACGTAACGTT GGTGTACCAG CATTAGTAGT ATTCTTAAAC 50 

AAAGTTGACA TGGTTGACGA TGAAGAATTA TTAGAATTAG TAGAAATGGA 100 

AGTTCGTGAC TTATTAAGCG AATATGACTT CCCAGGTGAC GATGTACCTG 150 

45 TAATCGCTGG TTCAGCATTA AAAGCTTTAG AAGGCGATGC TCAATACGAA 200 

GAAAAAATCT TAGAATTAAT GGAAGCTGTA GATACTTACA TTCCAACTCC 250 

AGAACGTGAT TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT 3 00 

CAATCACTGG TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA 350 

ATCAAAGTTG GTGAAGAAGT TGAAATCATC GGTTTACATG ACACATCTAA 4 00 

50 AACAACTGTT ACAGGTGTTG AAATGTTCCG TAAATTATTA GACTACGCTG 450 

AAGCTGGTGA CAACATTGGT GCATTATTAC GTGGTGTTGC TCGTGAAGAC 500 

GTACAACGTG GTCAAGTATT AGCTGCTCCT GGTTCAATTA CACCACATAC 550 

TGAATTCAAA GCAGAAGTAT ACGTATTATC AAAAGACGAA GGTGGACGTC 600 

ACACTCCATT CTTCTCAAAC TATCGTCCAC AATTCTATTT CCGTACTACT 650 

55 GACGTAACTG GTGTTGTTCA CTTACCAGAA GGTMCTGAAA TGGTAATGCC 700 

TGGTGATAAC GTTGAAATG 719 



60 2) INFORMATION FOR SEQ ID NO: 178 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 62 5 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 33591 



15 



30 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178 



GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC ATTAGTAGTA 50 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 100 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 150 

ATGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 2 00 

2 0 CAATACGAAG AAAAAATCTT AGAATTAATG GAAGCTGTAG ATACTTACAT 250 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 300 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 3 50 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 4 00 

CACATCTAAA ACAACTGTTA CAGGTGTTGA AATGTTCCGT AAATTATTAG 450 

2 5 ACTACGCTGA AGCTGGTGAC AACATTGGTG CATTATTACG TGGTGTTGCT 500 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATTAC 550 

ACCACATACT GAATTCAAAG CAGAAGTATA CGTATTATCA AAAGACGAAG 600 

GTGGACGTCA CACTCCATTC TTCTC 62 5 



2) INFORMATION FOR SEQ ID NO: 179 



(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 704 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 0 ( i i ) MOLECULE TYPE : Genomi c DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 43300 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179 



GTTGGTGTAC CAGCATTAGT AGTATTCTTA AACAAAGTTG ACATGGTTGA 50 

CGATGAAGAA TTATTAGAAT TAGTAGAAAT GGAAGTTCGT GACTTATTAA 100 

50 GCGAATATGA CTTCCCAGGT GACGATGTAC CTGTAATCGC TGGTTCAGCA 150 

TTAAAAGCTT TAGAAGGCGA TGCTCAATAC GAAGAAAAAA TCTTAGAATT 20 0 

AATGGAAGCT GTAGATACTT ACATTCCAAC TCCAGAACGT GATTCTGACA 2 50 

AACCATTCAT GATGCCAGTT GAGGACGTAT TCTCAATCAC TGGTCGTGGT 3 00 

ACTGTTGCTA CAGGCCGTGT TGAACGTGGT CAAATCAAAG TTGGTGAAGA 35 0 

55 AGTTGAAATC ATCGGTTTAC ATGACACATC TAAAACAACT GTTACAGGTG 4 00 

TTGAAATGTT CCGTAAATTA TTAGACTACG CTGAAGCTGG TGACAACATT 450 

GGTGCATTAT TACGTGGTGT TGCTCGTGAA GACGTACAAC GTGGTCAAGT 500 

ATTAGCTGCT CCTGGTTCAA TTACACCACA TACTGAATTC AAAGCAGAAG 550 

TATACGTATT ATCAAAAGAC GAAGGTGGAC GTCACACTCC ATTCTTCTCA 600 

60 AACTATCGTC CACAATTCTA TTTCCGTACT ACTGACGTAA CTGGTGTTGT 650 
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TCACTTACCA GAAGGTACTG AAATGGTAAT GCCTGGTGAT AACGTTGAAA 
TGAC 



700 
704 
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2) INFORMATION FOR SEQ ID NO: 18 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus subsp. aureus 

(B) STRAIN: ATCC 653 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGT AC CAGC ATTAGTAGTA 50 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 100 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 150 

ATGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 200 

CAATACGAAG AAAAAATCTT AGAATTAATG GAAGCTGTAG ATACTTACAT 250 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 3 00 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 3 50 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 4 00 

CACATC TAAA ACAACTGTTA CAGGTGTTGA AATGTT CCGT AAATTATTAG 4 50 

ACTACGCTGA AGCTGGTGAC AACATTGGTG CATTATTACG TGGTGTTGCT 500 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATTAC 550 

ACCACATACT GAATTCAAAG CAGAAGTATA CGTATTATCA AAAGACGAAG 600 

GTGGACGTCA CACTCCATTC TTCTCAAACT ATCGTCCACA ATTCTATTTC 650 

CGTACTACTG ACGTAACTGG TGTTGTTCAC TTACCAGAAG GTACTGAAAT 700 

GGTAATGCCT GGTGATAACG TTGAAATGAC 730 



2) INFORMATION FOR SEQ ID NO: 181 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus auricular is 

(B) STRAIN: ATCC 33753 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181 

CGGTGCGATC TTAGTTGTAT CTGCAGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTGACCA AGTTGACGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATACGACTAC CCAGGTGACG 2 00 

ATGTACCTGT AATCTCTGGT TCTGCGTTGA AAGCATTAGA AGGCGACAAA 2 50 
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GAATACGAAC 
TCCAACTCCA 
ACGTATTCTC 
CGTGGTCAAA 
CGGTTCACAA 
TAGACTACGC 
TCACGTGAAG 
TACACCACAC 
AAGGTGGACG 
TTCCGTACTA 
AATGGTAATG 
CAATCGCTAT 



AAAAAATCTT 
GAACGTGACT 
AATCACTGGT 
TCAAAGTCGG 
AAAACAACAG 
TGAAGCTGGT 
AAGTACAACG 
ACTAAATTCA 
TCACACTCCA 
CTGACGTAAC 
CCTGGCGATA 
CGAAGACGGT 



AGACTTAATG 
CTGATAAACC 
CGTGGTACTG 
TGAAGAAGTT 
TTACTGGTGT 
GACAACATCG 
TGGTCAAGTT 
CTGCAGAAGT 
TTCTTCTCTA 
AGGTGTTGTT 
ACGTTAAAAT 
ACTCGTTTCT 



CAACAAGTTG ACGATTACAT 
ATTCATGATG CCAGTTGAAG 
TTGCAACAGG CCGTGTTGAA 
GAAATCATCG GTATGAAAGA 
AGAAATGTTC CGTAAATTAT 
GTGCTTTATT ACGTGGTATT 
TTAGCTGCTC CTGGTTCAAT 
TTACGTATTA TCTAAAGATG 
ACTACCGTCC ACAATTCTAT 
ACTTTACCAG AAGGTACAGA 
GGAAGTTGAA TTAATTTCTC 
CAAT 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
834 
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2) INFORMATION FOR SEQ ID NO: 182 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus capitis subsp. capitis 

(B) STRAIN: ATCC 27840 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182 

CGGCGGTATC TTAGTAGTAT CTGCTGCTGA CGGTC CAATG CCACAAACTC 50 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC ATTAGTTGTA 100 

TTC TTAAACA AAGTTGACAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGATG 2 00 

ATGTAC CTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGATGCT 2 50 

CAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGATTACAT 3 00 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 3 50 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 40 0 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTATCCACGA 450 

AACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAATTATTAG 500 

ACTACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCT 550 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATCAC 600 

ACCACACACT AAATTCAAAG CGGAAGTTTA CGTTTTATCT AAAGACGAAG 650 

GTGGACGTCA CACTCCATTC TTCAGTAACT ACCGCCCACA ATTCTATTTC 700 

CGTACTACTG ACGTAACTGG TGTTGTTAAC TTAC CAGAAG GTACTGAAAT 750 

GGTTATGCCT GGCGACAACG TTGAAATGAC AGTTGAATTA ATCGCTCCTA 800 

TCGCTATTGA AGACGGTACT CGTTTCTCAA TCGGA 83 5 



2) INFORMATION FOR SEQ ID NO: 183 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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5 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Micrococcus caseolyticus 

(B) STRAIN: ATCC 13548 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183 



GTATCTTAGT AGTATCTGCT GCTGACGGTC CAATG C C AC A AACTCGTGAA 50 

CACATCCTTT TATCACGTAA CGTTGGTGTA CCAGCATTAG TAGTATTCTT 10 0 

10 GAACAAAGTT GACATGGTTG ACGATGAAGA ATTATTAGAA TTAGTTGAAA 150 

TGGAAGTTCG TGACTTATTA TCTGAATATG ACTTCCCTGG TGACGATGTA 2 00 

CCTGTAATCG CTGGATCTGC TTTAAAAGCA TTAGAAGGCG TTGAAGAATA 2 50 

CGAAGACAAA ATCATGGAAT TAATGGACGC AGTTGATGAG TACATCCCAA 300 

CTCCAGAACG TGATTCTGAC AAACCATTCA TGATGCCAGT TGAGGACGTA 350 

15 TTCTCAATCA CTGGTCGTGG TACAGTTGCA ACTGGACGTG TTGAGCGTGG 4 00 

ACAAGTTAAA GTTGGTGAAG AAGTTGAAAT CATTGGTTTA ACTGAAGAAC 450 

CAGCAAAAAC TACAGTTACA GGTGTAGAAA TGTTCCGTAA ATTATTAGAT 500 

TACGCTGAAG CTGGAGATAA CATCGGTGCT TTATTACGTG GTGTTTCTCG 55 0 

TGAAGACGTA CAACGTGGAC AAGTATTAGC TAAACCAGGT TCAATTACTC 60 0 

2 0 CACATACTAA ATTCAAAGCT GAAGTTTACG TATTATCTAA AGAAGAAGGT 65 0 

GGACGTCATA CTCCATTCTT CACTAACTAC CGCCCTCAGT TCTACTTCCG 700 

TACAACTGAC GTAACTGGTG TAGTTAACTT ACCAGAAGGT ACTGAAATGG 75 0 

TAATGCCTGG AGATAACATC GAAATGAACG TTGAATTAAT TTCTCCAATC 8 00 

GCGA 804 

25 



2) INFORMATION FOR SEQ ID NO: 184 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 2 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus cohnii 
40 (B) STRAIN: DSM 20260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 184 

CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC 50 

4 5 GTGAACATAT CCTTTTATCA CGTAACGTTG GTGTTCCAGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 2 00 

ATGTACCTGT AATCTCTGGT TCAGCATTAA AAGCTCTTGA AGGCGACGCT 250 

GACTATGAGC AAAAAATCTT AGACTTAATG CAAGCTGTTG ATGACTTCAT 300 

50 TCCAACACCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 3 50 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG GCGTGTTGAA 40 0 

CGTGGTCAAA TCAAAGTCGG TGAAGAAGTT GAAATCATCG GTATGCAAGA 450 

AGATTCAAGC AAAACAACTG TTACTGGTGT AGAAATGTTC CGTAAATTAT 50 0 

TAGACTACGC TGAAGCTGGT GACAACATTG GTGCGTTATT ACGTGGTGTT 55 0 

55 GCACGTGAAG ACATCCAACG TGGTCAAGTT TTAGCTGCTC CTGGTTCAAT 600 

TACACCACAC ACAAACTTTA AAGCGGAAGT TTACGTTTTA TCAAAAGATG 650 

AAGGTGGCCG TCATACGCCA TTCTTCAGTA ACTATCGCCC ACAATTCTAT 700 

TTCCGTACTA CTGACGTAAC AGGTGTTGTT ACTTTACCAG AAGGTACTGA 750 

AATGGTTATG CCTGGCGACA ACGTAGAAAT GGAAGTTGAA CTAATTTCTC 800 

SO CAATCGCTAT CGAAGACGGT ACACGTTTCT CT 832 
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2) INFORMATION FOR SEQ ID NO: 185 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 99 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 



15 



20 



25 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: CSG 269 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 185 



ATCTGCTGCT 
CACGTAACGT 
ATGGTAGACG 
CTTATTAAGC 
GTTCTGCATT 
TTAGACTTAA 
TTCTGACAAA 
GTCGTGGTAC 
GGTGAAGAAG 
TACTGGTGTA 
ACAACATCGG 
GGTCAAGTAT 
AGCTGAAGTA 
TCTTCACTAA 



GACGGTCCAA 
TGGTGTACCA 
ACGAAGAATT 
GAATATGACT 
AAAAGCATTA 
TGCAAGCAGT 
CCATTCATGA 
TGTTGCTACA 
TTGAAATCAT 
GAAATGTTCC 
TGCTTTATTA 
TAGCTGCTCC 
TACGTATTAT 
CTATCGCCCA 



TGCCACAAAC 
GCATTAGTTG 
ATTAGAATTA 
TCCCAGGTGA 
GAAGGCGATG 
TGATGATTAC 
TGCCAGTTGA 
GGCCGTGTTG 
CGGTATGCAC 
GTAAATTATT 
CGTGGTGTTG 
TGGTTCTATT 
CTAAAGATGA 
CAATTCTATT 



TCGTGAACAC 
TATTCTTAAA 
GTTGAAATGG 
CGATGTACCT 
CTGAATACGA 
ATTCCAACTC 
GGACGTATTC 
AACGTGGTCA 
GAAACTTCTA 
AGACTACGCT 
CACGTGAAGA 
ACACCACACA 
AGGTGGACGT 
TCCGTACTAC 



ATCTTATTAT 50 

CAAAGTTGAC 100 

AAGTTCGTGA 150 

GTAATCGCTG 2 00 

ACAAAAAATC 2 50 

CAGAACGTGA 3 00 

TCAATCACTG 3 50 

AATCAAAGTT 4 00 

AAACAACTGT 4 50 

GAAGCTGGTG 500 

CGTACAACGT 550 

CAAAATTCAA 600 

CACACTCCAT 650 

TGACGTAAC 699 



35 



2 ) INFORMATION FOR SEQ ID NO: 186 

(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



50 



55 



60 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 2 9970 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186 



CGGCGGTATC TTAGTAGTAT 
GTGAACACAT TCTTTTATCA 
TTCTTAAATA AAGTTGACAT 
TGAAATGGAA GTACGTGACT 
ATGTACCTGT AATCGCTGGT 
CAATACGAAG AAAAAATCTT 
TCCAACTCCA GAACGTGATT 
ACGTATTCTC AATCACTGGT 



CTGCTGCTGA CGGTCCAATG 
CGTAACGTTG GTGTACCAGC 
GGTTGACGAT GAAGAATTAT 
TATTATCTGA ATACGACTTC 
TCAGCATTAA AAGCTTTAGA 
AGAATTAATG CAAGCAGTTG 
CTGACAAACC ATTCATGATG 
CGTGGTACTG TTGCTACAGG 
112 



CCACAAACTC 50 

ATTAGTAGTA 100 

TAGAATTAGT 150 

CCAGGTGACG 200 

AGGCGATGCT 250 

ATGACTACAT 300 

CCAGTTGAGG 350 

CCGTGTTGAA 4 00 
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CGTGGGCAAA TCAAAGTTGG TGAAGAAGTT 
CACTTCTAAA ACAACTGTTA CTGGTGTAGA 
ACTACGCTGA AGCTGGTGAC AACATCGGTG 
CGTGAAGACG TACAACGTGG TCAAGTATTA 
ACCTCACACA AAATTTAAAG CAGACGTATA 
GTGGACGTCA CACTCCATTC TTCAC AAAC T 
CGTACTACTG ACGTAACTGG TGTTGTTAAC 
GGTTATGCCT GGCGACAACG TTGAAATGAC 
TCGCGATTGA AGACGGTACT CGTTTCTCA 



GAAATCATTG tZVATCZXT&W 4^0 

AATGTTCCGT AAATTATTAG 500 

CATTATTACG TGGTGTTGCT 550 

GCTGCTCCAG GTTCAATCAC 600 

CGTTTTATCT AAAGACGAAG 650 

ATCGTCCACA ATTCTATTTC 7 00 

TTACCAGAAG GTACTGAAAT 750 

AGTAGAATTA ATCGCTCCTA 800 

829 



2) INFORMATION FOR SEQ ID NO: 187 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 705 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: CSG 123 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187 

CACAAACTCG TGAACACATT CTTTTATCAC GTAACGTTGG TGTACCAGCT 5 0 

TTAGTTGTAT TCTTAAACAA AGTTGATATG GTAGACGACG AAGAATTATT 100 

AGAATTAGTA GAAATGGAAG TTCGTGACTT ATTATCTGAA TATGACTTCC 150 

CAGGTGACGA CGTACCTGTA ATCGCTGGTT CAGCATTAAA AGCTTTAGAA 2 00 

GGCGACGAAA AATACGAAGA AAAAATCTTA GAATTAATGC AAGCAGTTGA 250 

TGACTACATT CCAACTCCAG AACGTGATTC TGACAAACCA TTCATGATGC 30 0 

CAGTTGAGGA CGTATTCTCA ATCACTGGTC GTGGTACTGT TGCTACAGGC 350 

CGTGTTGAAC GTGGTCAAAT CAAAGTTGGT GAAGAAGTTG AAATCATCGG 40 0 

TTTACATGAC ACTTCTAAAA CAACTGTTAC TGGTGTAGAA ATGTTCCGTA 450 

AGTTATTAGA CTACGCTGAA GCTGGTGACA ACATCGGTGC TTTATTACGT 500 

GGTGTTGCTC GTGAAGACGT ACAACGTGGT CAAGTATTAG CTGCTCCTGG 550 

TTCAATTACA CCACATACAA AATTCAAAGC GGAAGTTTAC GTTTTATCTA 600 

AAGACGAAGG TGGACGTCAC ACTCCATTCT TCAGTAACTA CCGCCCACAA 650 

TTCTATTTCC GTACTACTGA CGTAACTGGC GTTGTTCAAT TACCAGAAGG 700 

TACTG 705 



2) INFORMATION FOR SEQ ID NO: 188 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 678 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: CSG 23 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188 



TTTTATCACG 
GTTGACATGG 
5 ACGTGACTTA 
TCGCTGGTTC 
AAAATCTTAG 
ACGTGACTCT 
TCACTGGTCG 

L0 AAAGTTGGTG 
AACTGTTACT 
CTGGTGACAA 
CAACGTGGTC 
ATTCAAAGCA 

15 CTCCATTCTT 
GTAACTGGTG 



TAACGTTGGT 
TTGACGATGA 
TTATCTGAAT 
AGCTTTAAAA 
AATTAATGCA 
GATAAACCAT 
TGGTACTGTT 
AAGAAGTTGA 
GGTGTAGAAA 
CATCGGTGCT 
AAGTATTAGC 
GACGTATACG 
CACTAACTAT 
TTGTTAACTT 



GTACCAGCAT 
AGAATTATTA 
ACGACTTCCC 
GCTTTAGAAG 
AGCAGTTGAT 
TCATGATGCC 
GCTACAGGTC 
AATTATTGGT 
TGTTCCGTAA 
TTATTACGTG 
TGCTCCAGGT 
TTTTATCAAA 
CGTCCACAAT 
ACCAGAAG 



TAGTAGTATT 
GAATTAGTTG 
AGGTGACGAC 
GCGATGCTCA 
GATTACATTC 
AGTTGAGGAC 
GTGTTGAACG 
AT C AAAG AAA 
ATTATTAGAC 
GTGTTGCTCG 
TCAATTACAC 
AGATGAAGGT 
TCTATTTCCG 



CTTAAACAAA 50 

AAATGGAAGT 100 

GTACCTGTAA 150 

ATACGAAGAA 200 

CAACTCCAGA 250 

GTATTCTCAA 300 

TGGTCAAATC 3 50 

CTTCTAAAAC 4 00 

TACGCTGAAG 450 

TGAAGATGTA 500 

CTCACACAAA 550 

GGACGTCATA 600 

TACTACTGAC 650 
678 



20 2) INFORMATION FOR SEQ ID NO: 189 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: CSG 33 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189 



35 



ACCAGCATTA GTAGTATTCT TAAATAAAGT TGACATGGTT GACGATGAAG 50 

AATTATTAGA ATTAGTTGAA ATGGAAGTAC GTGACTTATT ATCTGAATAC 100 

GACTTCCCAG GTGACGATGT ACCTGTAATC GCTGGTTCAG CATTAAAAGC 150 

TTTAGAAGGC GATGCTCAAT ACGAAGAAAA AATCTTAGAA TTAATGCAAG 200 

10 CAGTTGATGA CTACATTCCA ACTCCAGAAC GTGATTCTGA CAAACCATTC 250 

ATGATGCCAG TTGAGGACGT ATTCTCAATC ACTGGTCGTG GTACTGTTGC 3 00 

TACAGGCCGT GTTGAACGTG GTCAAATCAA AGTTGGTGAA GAAGTTGAAA 350 

TCATTGGTAT CCATGACACT TCTAAAACAA CTGTTACTGG TGTAGAAATG 4 00 

TTCCGTAAAT TATTAGACTA CGCTGAAGCT GGTGACAACA TCGGTGCATT 4 50 

45 ATTACGTGGT GTTGCTCGTG AAGACGTACA ACGTGGTCAA GTATTAGCTG 500 

CTCCAGGTTC AATCACACCT CACACAAAAT TTAAAGCAGA CGTATACGTT 550 

TTATCTAAAG ACGAAGGTGG ACGTCACACT CCATTCTTCA CAAACTATCG 600 

TCCACAATTC TATTTCCGTA CTACTGACGT AAC TGGTGTT GTTAACTTAC 650 

CAGAAGGTAC TGAAATGG 668 

50 

2) INFORMATION FOR SEQ ID NO: 190 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 
5 (B) STRAIN: CSG 8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190 

AAAGTTGACA TGGTTGACGA TGAAGAATTA TTAGAATTAG TTGAAATGGA 50 

10 AGTACGTGAC TTATTATCTG AATACGACTT CCCAGGTGAC GATGTACCTG 100 

TAATCGCTGG TTCAGCATTA AAAGCTTTAG AAGGCGATGC TCAATACGAA 150 

GAAAAAATCT TAGAATTAAT GCAAGCAGTT GATGATTACA TTCCAACTCC 200 

AGAACGTGAT TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT 250 

CAATCACTGG TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA 300 

15 ATCAAAGTTG GTGAAGAAGT TGAAATCATT GGTATCCATG ACACTTCTAA 350 

AACAACTGTT ACTGGTGTAG AAATGTTCCG TAAATTATTA GACTACGCTG 400 

AAGCTGGTGA CAACATTGGT GCATTATTAC GTGGTGTTGC TCGTGAAGAC 450 

GTACAACGTG GTCAAGTATT AGCTGCTCCA GGTTCAATCA CACCTCACAC 500 

AAAATTTAAA GCAGACGTAT ACGTTTTATC TAAAGACGAA GGTGGACGTC 550 

2 0 ACACTCCATT CTTCACAAAC TATCGTCCAC AATTCTATTT CCG 593 



2) INFORMATION FOR SEQ ID NO: 191 



25 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Staphylococcus hominis subsp. howinis 

(B) STRAIN: ATCC 27844 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191 

4 0 CGGCGCTATC TTAGTAGTAT CTGCTGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC ATTAGTAGTA 100 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTACGTGACT TATTATCTGA ATACGACTTC CCAGGTGACG 2 00 

ACGTACCTGT AATCGCTGGT TCAGCTTTAA AAGCTTTAGA AGGCGATGCT 2 50 

4 5 CAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGATTATAT 300 

TCCAACTCCA GAACGTGACT CTGATAAACC ATTCATGATG CCAGTTGAGG 3 50 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATTATTG GTATCAAAGA 450 

AACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAATTATTAG 500 

50 ACTACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCT 550 

CGTGAAGATG TACAACGTGG TCAAGTATTA GCTGCTCCAG GTTCAATTAC 600 

ACCTCACACA AAATTCAAAG CAGACGTATA CGTTTTATCA AAAGATGAAG 650 

GTGGACGTCA TACTCCATTC TTCTCTAACT ATCGTCCACA ATTCTATTTC 700 

CGTACTACTG ACGTAACTGG TGTTGTTAAC TTACCAGAAG GTACTGAAAT 750 

55 GGTAATGCCT GGTGACAACG TTGAAATGAC AGTAGAATTA ATCGCTCCTA 8 00 

TCGCGATTGA AGACGGTACT CGTTTCTC 82 8 



60 2) INFORMATION FOR SEQ ID NO: 192 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 0 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: ATCC 35982 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192 



15 



ATGGTCCAAT GCCACAAACT CGTGAACACA TTCTTTTATC ACGTAACGTT 50 

GGTGT AC C AG CTTTAGTTGT ATTCTTAAAC AAAGTTGATA TGGTAGACGA 100 

CGAAGAATTA TTAGAATTAG TAGAAATGGA AGTTCGTGAC TTATTATCTG 150 

AATATGACTT CCCAGGTGAC GACGTACCTG TAATCGCTGG TTCAGCATTA 2 00 

2 0 AAAGCTTTAG AAGGCGACGA AAAATACGAA GAAAAAATCT TAGAATTAAT 250 

GCAAGCAGTT GATGACTACA TTCCAACTCC AGAACGTGAT TCTGACAAAC 3 00 

CATTCATGAT GCCAGTTGAG GACGTATTCT CAATCACTGG TCGTGGTACT 350 

GTTGCTACAG GCCGTGTTGA ACGTGGTCAA ATCAAAGTTG GTGAAGAAGT 4 00 

TGAAATCATC GGTTTACATG ACACTTCTAA AACAACTGTT ACTGGTGTAG 4 50 

2 5 AAATGTTCCG TAAGTTATTA GACTACGCTG AAGCTGGTGA CAACATCGGT 500 

GCTTTATTAC GTGGTGTTGC TCGTGAAGAC GTACAACGTG GTCAAGTATT 550 

AGCTGCTCCT GGTTCAATTA CACCACATAC AAAATTCAAA GCGGAAGTTT 600 

ACGTTTTATC TAAAGACGAA 62 0 

30 

2) INFORMATION FOR SEQ ID NO: 193 

(i) SEQUENCE CHARACTERISTICS: 

3 5 (A) LENGTH: 6 92 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 17 0 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193 



45 



CCAGCATTAG TAGTATTCTT AAACAAAGTT GACATGGTTG ACGATGAAGA 50 

ATTATTAGAA TTAGTTGAAA TGGAAGTACG TGACTTATTA TCTGAATACG 100 

50 ACTTCCCAGG TGACGACGTA CCTGTAATCG CTGGTTCAGC TTTAAAAGCT 150 

TTAGAAGGCG ATGCTCAATA CGAAGAAAAA ATCTTAGAAT TAATGCAAGC 2 00 

AGTTGATGAT TATATTCCAA CTCCAGAACG TGACTCTGAT AAACCATTCA 2 50 

TGATGCCAGT TGAGGACGTA TTCTCAATCA CTGGTCGTGG TACTGTTGCT 300 

ACAGGCCGTG TTGAACGTGG TCAAATCAAA GTTGGTGAAG AAGTTGAAAT 3 50 

55 TATTGGTATC AAAGAAACTT CTAAAACAAC TGTTACTGGT GTAGAAATGT 400 

TCCGTAAATT ATTAGACTAC GCTGAAGCTG GTGACAACAT CGGTGCTTTA 4 50 

TTACGTGGTG TTGCTCGTGA AGATGTACAA CGTGGTCAAG TATTAGCTGC 500 

TCCAGGTTCA ATTACACCTC ACACAAAATT CAAAGCAGAC GTATACGTTT 550 

TATCAAAAGA TGAAGGTGGA CGTCATACTC CATTCTTCTC TAACTATCGT 600 

60 CCACAATTCT ATTTCCGTAC TACTGACGTA ACTGGTGTTG TTAACTTACC 650 
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AGAAGGTACT GAAATGGTAA TGCCTGGTGA CAACGTTGAA AT [ 



5 2) INFORMATION FOR SEQ ID NO: 194 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 684 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 36 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194 



CATTCTTTTA TCACGTAACG TTGGTGTACC AGCATTAGTA GTATTCTTAA 50 

ACAAAGTTGA CATGGTTGAC GATGAAGAAT TATTAGAATT AGTTGAAATG 100 

GAAGTACGTG ACTTATTATC TGAATACGAC TTCCCAGGTG ACGACGTACC 150 

TGTAATCGCT GGTTCAGCTT TAAAAGCTTT AGAAGGCGAT GCTCAATACG 2 00 

2 5 AAGAAAAAAT CTTAGAATTA ATGCAAGCAG TTGATGATTA TATTCCAACT 2 50 

CCAGAACGTG ACTCTGATAA AC CATTC ATG ATGCCAGTTG AGGACGTATT 3 00 

CTCAATCACT GGTCGTGGTA CTGTTGCTAC AGGCCGTGTT GAACGTGGTC 3 50 

AAATCAAAGT TGGTGAAGAA GTTGAAATTA TTGGTATCAA AGAAACTTCT 4 00 

AAAACAACTG TTACTGGTGT AGAAATGTTC CGTAAATTAT TAGACTACGC 4 50 

3 0 TGAAGCTGGT GACAACATCG GTGCTTTATT ACGTGGTGTT GCTCGTGAAG 500 

ATGTACAACG TGGTCAAGTA TTAGCTGCTC CAGGTTCAAT TACACCTCAC 550 

ACAAAATTCA AAGCAGACGT ATACGTTTTA TCAAAAGATG AAGGTGGACG 600 

TCATACTCCA TTCTTCTCTA ACTATCGTCC ACAATTCTAT TTCCGTACTA 650 

CTGACGTAAC TGGTGTTGTT AACTTACCAG AAGG 684 

35 



2 ) INFORMATION FOR SEQ ID NO: 195 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 685 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 
50 (B) STRAIN: CSG 6 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 195 

ACCAGCATTA GTAGTATTCT TAAACAAAGT TGACATGGTT GACGATGAAG 50 

55 AATTATTAGA ATTAGTTGAA ATGGAAGTAC GTGACTTATT ATCTGAATAC 100 

GACTTCCCAG GTGACGACGT ACCTGTAATC GCTGGTTCAG CTTTAAAAGC 150 

TTTAGAAGGC GATGCTCAAT ACGAAGAAAA AATCTTAGAA TTAATGCAAG 2 00 

CAGTTGATGA TTACATTCCA ACTCCAGAAC GTGACTCTGA TAAACCATTC 2 50 

ATGATGCCAG TTGAGGACGT ATTCTCAATC ACTGGTCGTG GTACTGTTGC 3 00 

60 TACAGGCCGT GTTGAACGTG GTCAAATCAA AGTTGGTGAA GAAGTTGAAA 3 50 
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TTATTGGTAT CAAAGAAACT 
TTCCGTAAAT TATTAGACTA 
ATTACGTGGT GTTGCTCGTG 
CTCCAGGTTC AATTACACCT 
TTATCAAAAG ATGAAGGTGG 
TCCACAATTC TATTTCCGTA 
CAGAAGGTAC TGAAATGGTA 



TCTAAAACAA 
CGCTGAAGCT 
AAGATGTACA 
CACACAAAAT 
ACGTCATACT 
CTACTGACGT 
ATGCCTGGCG 



CTGTTACTGG TGTAGAAATG 
GGTGACAACA TCGGTGCTTT 
ACGTGGTCAA GTATTAGCTG 
TCAAAGCAGA CGTATACGTT 
CCATTCTTCA CTAACTATCG 
AACTGGTGTT GTTAACTTAC 
ACAAC 



4 00 
450 
500 
550 
600 
650 
685 



LO 



L5 



20 



25 



BO 



35 



10 



15 



50 



55 



2) INFORMATION FOR SEQ ID NO: 196 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 611 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 62 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196 

GACTTATTAT CTGAATACGA CTTCCCAGGT GACGACGTAC CTGTAATCGC 50 

TGGTTCAGCT TTAAAAGCTT TAGAAGGCGA TGCTCAATAC GAAGAAAAAA 100 

TCTTAGAATT AATGCAAGCA GTTGATGATT ACATTCCAAC TCCAGAACGT 150 

GACT CTGATA AACCATTCAT GATGC CAGTT GAGGACGTAT TCTCAATCAC 2 00 

TGGTCGTGGT ACTGTTGCTA CAGGCCGTGT TGAACGTGGT CAAATCAAAG 250 

TTGGTGAAGA AGTTGAAATT ATTGGTATCA AAGATACTTC TAAAACAACT 300 

GTTACTGGTG TAGAAATGTT CCGTAAATTA TTAGACTACG CTGAAGCTGG 350 

TGACAACATC GGTGCTTTAT TACGTGGTGT TGCTCGTGAA GATGTACAAC 4 00 

GTGGTCAAGT ATTAGCTGCT CCAGGTTCAA TCACACCTCA CACAAAATTC 450 

AAAGCAGACG TATATGTTTT ATCAAAAGAT GAAGGTGGAC GTCATACTCC 500 

ATTCTTCACT AACTATCGTC CACAATTCTA TTTCCGTACT ACTGACGTAA 550 

CTGGTGTTGT TAACTTACCA GAAGGTACTG AAATGGTAAT GCCTGGCGAC 6 00 

AACGTTGAAA T 611 



2 ) INFORMATION FOR SEQ ID NO: 197 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus lugdunensis 

(B) STRAIN: ATCC 43809 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197 

CGGCGGTATC TTAGTAGTTT CTGCTGCAGA TGGTCCAATG CCACAAACTC 50 
GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTGCCAGC ATTAGTAGTA 100 
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10 



15 



TTCTTAAACA 
AGAAATGGAA 
ATGTGCCTGT 
AAATACGAAG 
TCCAACTCCA 
ACGTATTCTC 
CGTGGTCAAA 
TACTACTAAA 
ACTACGCTGA 
CGTGAAGATG 
ACCTCACACT 
GTGGACGTCA 
CGTACTACAG 
GGTTATGCCT 
TCGCTATCGA 



AAGTTGACAT 
GTTCGTGATT 
AATCGCTGGT 
CTAAAATCTT 
GAACGTGACT 
AATCACTGGT 
TCAAAGTCGG 
ACAACTGTTA 
AGCTGGTGAC 
TACAACGTGG 
AAATTTAAAG 
TACACCATTC 
ACGTAACTGG 
GGCGACAACG 
AGACGGAACT 



GGTTGACGAT 
TATTAACTGA 
TCAGCATTAA 
AGAATTAATG 
CTGACAAACC 
CGTGGTACTG 
TGAAGAAGTT 
CTGGTGTAGA 
AACATCGGTG 
ACAAGTATTA 
CTGACGTATA 
TTCTCAAACT 
TGTTGTTAAC 
TTGAAATGAC 
CGTTTCTC 



GAAGAATTAT 
ATATGACTTC 
AAGCTTTAGA 
GATGCAGTTG 
ATTCATGATG 
TTGCTACAGG 
GAAATTATTG 
AATGTTCCGT 
CGTTATTACG 
GCTGCTCCAG 
TGTTTTATCT 
ACCGCCCACA 
TTACCAGAAG 
AGTTGAATTA 



TAGAATTAGT 150 

CCAGGTGACG 2 00 

AGGCGACGAA 250 

ATAACTACAT 3 00 

CCAGTTGAGG 350 

CCGTGTTGAA 4 00 

GTATCCACGA 450 

AAATTATTAG 500 

TGGTGTTGCT 550 

GTTCAATTAC 60 0 

AAAGATGAAG 650 

ATTCTATTTC 700 

GTACAGAAAT 750 

ATCGCTCCAA 800 
828 



2) INFORMATION FOR SEQ ID NO: 198 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 690 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

3 0 (A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 35552 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198 

35 AGTAGTATCT GCTGCTGATG GCCCAATGCC ACAAACTCGT GAAC AC ATT C 50 

TTTTAT CACG TAACGTTGGT GTTCCAGCAT TAGTTGTATT CTTAAACAAA 100 

GTTGACATGG TTGACGATGA AGAATTATTA GAATTAGTAG AAATGGAAGT 150 

TCGTGACTTA TTAAGCGAAT ATGACTTCCC AGGTGACGAT GTACCTGTAA 2 00 

TCTCTGGTTC TGCATTAAAA GCTTTAGAAG GCGACGCTGA CTATGAGCAA 250 

4 0 AAAATCTTAG ACTTAATGCA AGCTGTTGAT GACTTCATTC CAACACCAGA 3 00 

ACGTGATTCT GACAAACCAT TCATGATGCC AGTTGAGGAC GTATTCTCAA 3 50 

TCACTGGTCG TGGTACTGTT GCTACAGGCC GTGTTGAACG TGGTCAAATC 4 00 

AAAGTCGGTG AAGAAATCGA AATCATCGGT ATGCAAGAAG AATCAAGCAA 450 

AACAACTGTT ACTGGTGTAG AAATGTTCCG TAAATTATTA GACTACGCTG 500 

45 AAGCTGGTGA CAACATTGGT GCATTATTAC GTGGTGTTTC ACGTGATGAC 550 

GTACAACGTG GTCAAGTTTT AGCTGCTCCT GGTACTATTA CACCACATAC 600 

AAAATTCAAA GCGGATGTTT ACGTTTTATC TAAAGATGAA GGTGGTCGTC 650 

ATACACCATT CTTCACTAAC TACCGCCCAC AATTCTATTT 690 



50 



2) INFORMATION FOR SEQ ID NO: 199 



( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 723 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

119 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus sapropfrytlcus 

(B) STRAIN: CSG 83 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199 

GCATTAGTTG TATTCTTAAA CAAAGTTGAC ATGGTTGACG ATGAAGAATT 50 

ATTAGAATTA GTAGAAATGG AAGTTCGTGA TTTATTAAGC GAATATGACT 100 

10 TCCCAGGTGA CGATGTACCT GTAATCTCTG GTTCTGCATT AAAAGCTTTA 150 

GAAGGCGACG CTGACTATGA GCAAAAAATC TTAGACTTAA TGCAAGCTGT 2 00 

TGATGACTTC ATTCCAACAC CAGAACGTGA TTCTGACAAA CCATTCATGA 2 50 

TGCCAGTTGA GGACGTATTC TCAATCACTG GTCGTGGTAC TGTTGCTACA 300 

GGCCGTGTTG AACGTGGTCA AATCAAAGTC GGTGAAGAAA TCGAAATCAT 350 

15 CGGTATGCAA GAAGAATCAA GCAAAACAAC TGTTACTGGT GTAGAAATGT 4 00 

TCCGTAAATT ATTAGACTAC GCTGAAGCTG GTGACAACAT TGGTGCATTA 450 

TTACGTGGTG TTTCACGTGA TGACGTACAA CGTGGTCAAG TTTTAGCTGC 500 

TCCTGGTACT ATTACACCAC ATACAAAATT CAAAGCGGAT GTTTACGTTT 550 

TATCTAAAGA TGAAGGTGGT CGTCATACAC CATTCTTCAC TAACTACCGC 600 

20 CCACAATTCT ATTTCCGTAC TACTGACGTA ACTGGTGTTG TTAACTTACC 650 

AGAAGGTACT GAAATGGTTA TGCCTGGCGA TAACGTTGAA ATGGATGTTG 7 00 

AATTAATTTC TCCAATCGCT ATT 72 3 



25 



35 



60 



2 ) INFORMATION FOR SEQ ID NO: 2 00 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 697 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSsa 18 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200 

CGTTGGTGTT CCAGCATTAG TTGTATTCTT AAACAAAGTT GACATGGTTG 50 

ACGATGAAGA ATTATTAGAA TTAGTAGAAA TGGAAGTTCG TGACTTATTA 100 

AGCGAATAT G ACTTCCCAGG TGACGATGTA CCTGTAATCT CTGGTTCTGC 150 

4 5 ATTAAAAGCT TTAGAAGGCG ACGCTGACTA TGAG CAAAAA ATCTTAGACT 200 

TAATGCAAGC TGTTGATGAC TTCATTCCAA CACCAGAACG TGATTCTGAC 2 50 

AAACCATTCA TGATGCCAGT TGAGGACGTA TTCTCAATCA CTGGTCGTGG 300 

TACTGTTGCT ACAGGCCGTG TTGAACGTGG TCAAATCAAA GTCGGTGAAG 3 50 

AAATCGAAAT CATCGGTATG CAAGAAGAAT CAAGCAAAAC AACTGTTACT 4 00 

50 GGTGTAGAAA TGTTCCGTAA ATTATTAGAC TACGCTGAAG CTGGTGACAA 4 50 

CATTGGTGCA TTATTACGTG GTGTTTCACG TGATGACGTA CAACGTGGTC 500 

AAGTTTTAGC TGCTCCTGGT ACTATTACAC CACATACAAA ATTCAAAGCG 550 

GATGTTTACG TTTTATCTAA AGATGAAGGT GGTCGTCATA CACCATTCTT 600 

CACTAACTAC CGCCCACAAT TCTATTTCCG TACTACTGAC GTAACTGGTG 650 

55 TTGTTAACTT ACCAGAAGGT ACTGAAATGG TTATGCCTGG CGATAAC 697 



2) INFORMATION FOR SEQ ID NO: 2 01 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Staphylococcus sciuri subsp. sciuri 

(B) STRAIN: ATCC 29060 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 201 

15 CGGCGGTATC TTAGTAGTAT CTGCTGCTGA CGGTCCAATG CCTCAAACTC 50 

GTGAGCACAT TCTTTTATCA CGTAACGTAG GTGTTCCTGC ATTAGTAGTA 100 

TTCTTAAACA AAGTTGACAT GGTTGACGAT GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTATCTGA ATATGACTTC CCAGGCGACG 2 00 

ACGTTCCTGT AATTGCTGGT TCAGCATTAA AAGCATTAGA AGGCGACGAA 2 50 

20 GCTTACGAAG ACAAAATCAT GGAATTAATG GATGCTGTTG ATACATTCAT 3 00 

CCCAACTCCA GAACGTGACT CTGACAAACC ATTCATGATG CCAGTTGAGG 3 50 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGTCAAA TCACTGTTGG TGAAGAAGTT GAAATCATCG GTTTAACTGA 4 50 

AGAATCTTCT AAAACAACTG TAACTGGTGT TGAAATGTTC CGTAAATTAT 500 

25 TAGACTTCGC TGAAGCTGGA GATAACATCG GTGCATTATT ACGTGGTGTT 550 

GCTCGTGAAG ACGTTAACCG TGGTCAAGTA TTAGCTAAAC CAGGTTCAAT 6 00 

CACACCTCAC ACTAAATTCA AAGCTGAAGT TTATGTATTA TCTAAAGACG 650 

AAGGTGGACG TCATACTCCA TTCTTCACAA ACTACCGCCC ACAATTCTAT 700 

TTCCGTACTA CTGACGTAAC TGGTGTAGTT AACTTACCAG AAGGTACTGA 750 

3 0 AATGGTTATG CCTGGCGACA ACGTTGAAAT GGACGTTGAA TTAATTTCAC 8 00 

CAATCGCTAT TGAAGACGGT ATCGTTTCTC AATCA 83 5 



35 2) INFORMATION FOR SEQ ID NO: 202 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: ATCC 27836 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202 



50 



CGGCGGTATC TTAGTTGTAT CTGCAGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC TTTAGTTGTA 100 

TTCTTAAACA AAGTTGATAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTATCTGA ATATGACTTC CCAGGTGACG 2 00 

55 ACGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGACGAA 2 50 

AAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGACTACAT 3 00 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 450 

60 CACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAGTTATTAG 500 
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ACTACGCTGA AGCTGGTGAC 
CGTGAAGACG TACAACGTGG 
ACCACATACA AAATTCAAAG 
GTGGACGTCA CACTCCATTC 
CGTACTACTG ACGTAACTGG 
GGTTATGCCT GGTGATAACG 
TCGCGATTGA AGACGGTACT 



AACATCGGTG 
TCAAGTATTA 
CGGAAGTTTA 
TTCAGTAACT 
CGTTGTTCAA 
TTGAAATGAC 
CGTTTCTCAA 



CTTTATTACG TGGTU TT GCT" 
GCTGCTCCTG GTTCAATTAC 
CGTTTTATCT AAAGACGAAG 
ACCGCCCACA ATTCTATTTC 
TTACCAGAAG GTACTGAAAT 
AGTAGAATTA ATCGCTCCTA 
C 



5%* 



600 
650 
700 
750 
800 
831 



2) INFORMATION FOR SEQ ID NO: 2 03 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: CSG 50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203 

CGGCGGTATC TTAGTTGTAT CTGCAGCTGA TGGTCCAATG CCACAAACTC 50 

GTGAACACAT TCTTTTATCA CGTAACGTTG GTGTACCAGC TTTAGTTGTA 100 

TTCTTAAACA AAGTTGATAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

AGAAATGGAA GTTCGTGACT TATTATCTGA ATATGACTTC CCAGGTGACG 2 00 

ACGTACCTGT AATCGCTGGT TCAGCATTAA AAGCTTTAGA AGGCGACGAA 2 50 

AAATACGAAG AAAAAATCTT AGAATTAATG CAAGCAGTTG ATGACTACAT 3 00 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 3 50 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTTTACATGA 4 50 

CACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAGTTATTAG 500 

ACTACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCT 550 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCAATTAC 6 00 

ACCACATACA AAATTCAAAG CGGAAGTTTA CGTTTTATCT AAAGACGAAG 650 

GTGGACGTCA CACTCCATTC TTCAGTAACT ACCGCCCACA ATTCTATTTC 700 

CGTACTACTG ACGTAACTGG CGTTGTTCAA TTACCAGAAG GTACTGAAAT 750 

GGTTATGCCT GGTGATAACG TTGAAATGAC AGTAGAATTA ATCGCTCCTA 8 00 

TCGCGATTGA AGACGGTACT CGTTTCTCA 82 9 



2) INFORMATION FOR SEQ ID NO: 204 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium longum 

(B) STRAIN: ATCC 15707 
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TGGCGCTATC CTCGTTGTGG CCGCCACCGA CGGCCCGATG GCCCAGACTC 50 

GCGAGCACGT GCTGCTCGCC CGTCAGGTTG GCGTTCCGAA GATCCTCGTC 100 

5 GCCCTGAACA AGTGCGACAT GGTCGACGAT GAAGAGCTCA TCGAGCTCGT 150 

CGAAGAAGAG GTCCGCGACC TCCTCGACGA GAACGGCTTC GACCGTGACT 20 0 

GCCCGGTCAT CCACACCTCC GCTTACGGTG CTCTGCACGA CGACGCTCCG 2 50 

GACCACGAGA AGTGGGTCCA GTCCGTTAAG GACCTCATGG ACGCTGTCGA 300 

CGACTACATC CCGACCCCGG TTCACGACCT GGACAAGCCG TTCCTGATGC 3 50 

L0 CGATCGAGGA CGTCTTCACC ATCTCCGGCC GTGGTACCGT TGTCACCGGT 400 

CGTGTCGAGC GTGGCCAGCT GGCCGTCAAC ACCCCGGTCG AGATCGTTGG 45 0 

TATCCGTCCG ACCCAGCAGA CCACCGTCAC CTCCATCGAG ACCTTCCACA 500 

AGACCATGGA CGCCTGCGAG GCTGGCGACA ACACCGGTCT GCTTCTGCGT 550 

GGTCTCGGCC GTGACGATGT CGAGCGTGGC CAGGTTGTGG CCAAGCCGGG 600 

L5 CTCCGTCACC CCGCACACCA AGTTCGAGGG CGAAGTCTAC GTGCTGACCA 650 

AGGACGAAGG CGGCCGTCAC TCGCCGTTCT TCTCCAACTA CCGTCCGCAG 700 

TTCTACTTCC GCACCACCGA CGTCACCGGC GTCATCGAGC TGCCGGAAGG 75 0 

CGTCGAGATG GTTCAGCCGG GCGACCACGC TACCTTCACC GTTGAGCTGA 8 00 

TTCAGCCCAT CGCTATGGAG GAAGGCCTGA CCTTCGCTG 83 9 

20 



2) INFORMATION FOR SEQ ID NO: 205 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 754 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stenotrophomonas maltophilia 
35 (B) STRAIN: CDC F3338 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 205 

CGGCGCGATC CTGGTGTGCT CGGCCGCTGA CGGCCCGATG CCGCAGACCC 50 

10 GTGAGCACAT CCTGCTGTCG CGCCAGGTCG GCGTGCCGTA CATCGTCGTG 100 

TTCCTGAACA AGGCCGACAT GGTCGACGAC GCCGAGCTGC TCGAGCTGGT 150 

CGAGATGGAA GTGCGCGAAC TGCTGAGCAA GTACGAGTTC CCGGGCGACG 2 00 

ACACCCCGAT CATCGCCGGT TCGGCCCGCC TGGCGCTGGA AGGCGACCAG 2 50 

AGCGAC AT CG GCGTGCCGGC CAT C CTGAAG CTGGTCGACG CGCTGGACAG 3 00 

15 CTGGATTCCG GAGCCGGAGC GTGCGATCGA CAAGCCGTTC CTGATGCCGG 3 50 

TGGAAGACGT GTTCTCGATC TCGGGCCGCG GCACCGTGGT GACCGGTCGT 4 00 

ATCGAGCGCG GCGTGATCAA GGTTGGCGAC GAAATCGAAA TCGTCGGCAT 450 

CCGTCCGGTG CAGAAGACCA CCGTGACCGG CGTTGAAATG TTCCGCAAGC 500 

TGCTGGACCA GGGTCAGGCA GGCGACAACG CTGGCCTGCT GCTGCGCGGC 550 

50 ACCAAGCGTG ATGACGTCGA GCGTGGCCAG GTGCTGGCCA AGCCGGGCAC 600 

GATCAAGCCG CACACCAAGT TCGAAGGCGA AGTGTACGTC CTGTCGAAGG 650 

ACGAGGGCGG CCGCCACACC CCGTTCTTCA ACGGCTACCG TCCGCAGTTC 700 

TACTTCCGCA CCACCGACAT CACCGGCGCC GCTGCACTGC CGGAAGGCGT 750 

CGAA 754 

55 

2) INFORMATION FOR SEQ ID NO: 206 
50 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 83 5 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus acidominimus 
LO (B) STRAIN: ATCC 51726 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206 

TGGTGCTATC CTTGTAGTAG CTTCAACTGA CGGACCAATG CCACAAACTC 50 

L5 GTGAGCACAT CCTTCTTTCA CGTCAAGTTG GTGTTAAAAA CCTTATCGTT 100 

TTCATGAACA AAGTTGACCT TGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGACC TTCTTTCAGA ATACGATTTC CCAGGTGATG 2 00 

ATCTTCCAGT TGTTCAAGGT TCAGCTCTTA AAGCGCTTGA AGGTGATTCA 2 50 

GCACAAGAAG ATGTTATCAT GGAATTGATG TCAATCGTTG ACACATACAT 300 

2 0 TCCAGAACCA GAACGTGATA CTGACAAACC ATTGCTTCTT CCAGTCGAGG 3 50 

ATGTATTCTC AATCACTGGA CGTGGTACTG TTGCTTCAGG ACGTATCGAC 4 00 

CGTGGTACTG TTAAAGTTAA TGACGAAGTT GAAATCGTTG GTATCAAAGA 450 

CGAAATCTCT AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAAC 500 

TTGACGAAGG TCTTGCTGGA GATAACGTTG GTGTTCTTCT TCGTGGTGTA 550 

25 CAACGTGATG AAATCGAACG TGGTCAAGTT CTTGC TAAAC CAGGTTCAAT 600 

CAACCCACAC ACTAAATTCA AAGGTGAAGT TTACGTTCTT TCTAAAGAAG 650 

AAGGTGGACG TCACACTCCA TTCTTCGATA ACTACCGTCC TCAGTTCTAC 70 0 

TTCCGTACAA CTGACGTAAC TGGTTCAATC AAATTGCCAG AAGGTACTGA 750 

AATGGTAATG CCTGGTGATA ACGTAACTAT CGAAGTTGAG TTGATCCACC 800 

3 0 CAATCGCCGT TGAACAAGGT ACTACTTTCT CTATC 83 5 



2) INFORMATION FOR SEQ ID NO : 2 07 



35 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
*0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
£5 (A) ORGANISM: Streptococcus agalactia^ 

(B) STRAIN: ATCC 124 03 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207 

50 CTATCCTTGT AGTTGCTTCA ACTGATGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA AGTTGGTGTT AAACACCTTA TCGTATTCAT 100 

GAACAAAGTT GACCTTGTTG ATGATGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATTCG TGACCTTCTT TCAGAATACG ACTTCCCAGG TGATGACCTT 2 00 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCA CTTGAAGGCG ACGAAAAATA 250 

55 CGAAGACATC ATCATGGAAT TGATGAGCAC TGTTGATGAG TACATTCCAG 300 

AACCAGAACG TGATACTGAC AAACCTTTAC TTCTTCCAGT TGAAGATGTA 3 50 

TTCTCAATCA CTGGACGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 400 

TACTGTTCGT GTCAACGACG AAGTTGAAAT CGTTGGTATT AAAGAAGATA 4 50 

TCCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

50 GAAGGTCTTG CAGGGGACAA CGTTGGTGTT CTTCTTCGTG GTGTTCAACG 550 
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TGATGAAATC GAACGTGGTC AAGTTCTTGC 
CACACACTAA ATTTAAAGGT GAAGTTTACA 
GGACGTCATA CTCCATTCTT CAACAACTAC 
TACAACTGAC GTAACAGGTT CAATCGAACT 
5 TTATGCCTGG TGATAACGTT ACTATCGAAG 
GCCGTAGAAC AAGGTACTA 



TAAACCAGGT PrtMW ttXtSF 1 00 

TCCTTTCTAA AGAAGAAGGT 650 
CGTCCACAAT TCTACTTCCG 700 
TCCAGCAGGA ACAGAAATGG 750 
TTGAATTGAT TCACCCAATC 800 

819 



LO 2) INFORMATION FOR SEQ ID NO: 208 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 
L5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 12973 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208 



CTATCCTTGT AGTTGCTTCA AC TGATGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA AGTTGGTGTT AAACACCTTA TCGTATTCAT 100 

GAACAAAGTT GACCTTGTTG ATGATGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATTCG TGACCTTCTT TCAGAATACG ACTTCCCAGG TGATGACCTT 200 

3 0 CCAGTTATCC AAGGTTCAGC TCTTAAAGCA CTTGAAGGCG ATGAAAAATA 250 

CGAAGAC AT C ATCATGGAAT TGATGAGCAC TGTTGATGAG TACATTCCAG 300 

AACCAGAACG TGATACTGAC AAACCTTTAC TTCTTCCAGT CGAAGATGTA 350 

TTCTCAATCA CTGGACGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 4 00 

TACTGTTCGT GTCAACGACG AAGTTGAAAT CGTTGGTATT AAAGAAGATA 450 

35 TCCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

GAAGGTCTTG CAGGGGACAA CGTTGGTGTT CTTCTTCGTG GTGTTCAACG 550 

TGATGAAATC GAACGTGGTC AAGTTCTTGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTTAAAGGT GAAGTTTACA TCCTTTCTAA AGAAGAAGGT 650 

GGACGTCATA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 700 

iO TACAACTGAC GTAACAGGTT CAATCGAACT TCCAGCAGGA ACAGAAATGG 750 

TTATGCCTGG TGATAACGTT ACTATCGAAG TTGAATTGAT TCACCCAATC 800 

GCCGTAGAAC AAGGTACTA 819 



45 



55 



2) INFORMATION FOR SEQ ID NO : 2 09 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 822 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 13813 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209 
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AGCTATCCTT GTAGTTGCTT CAACTGATGG ACCAATGCCA CAAACTCGTG 50 

AGCACATCCT TCTTTCACGT CAAGTTGGTG TTAAACACCT TATCGTATTC 100 

ATGAACAAAG TTGACCTTGT TGATGATGAA GAATTGCTTG AATTGGTTGA 150 

5 AATGGAAATT CGTGACCTTC TTTCAGAATA CGACTTCCCA GGTGATGACC 2 00 

TTCCAGTTAT CCAAGGTTCA GCTCTTAAAG CACTTGAAGG CGATGAAAAA ,2 50 

TACGAAGACA TCATCATGGA ATTGATGAGC ACTGTTGATG AGTACATTCC 3 00 

AGAACCAGAA CGTGATACTG ACAAACCTTT ACTTCTTCCA GTCGAAGATG 350 

TATTCTCAAT CACTGGACGT GGTACAGTTG CTTCAGGACG TATCGACCGT 4 00 

LO GGTACTGTTC GTGTCAACGA CGAAGTTGAA ATCGTTGGTA TTAAAGAAGA 4 50 

TATCCAAAAA GCAGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG 500 

ACGAAGGTCT TGCAGGGGAC AACGTTGGTG TTCTTCTTCG TGGTGTTCAA 550 

CGTGATGAAA TCGAACGTGG TCAAGTTCTT GCTAAACCAG GTTCAATCAA 600 

CCCACACACT AAATTTAAAG GTGAAGTTTA CATCCTTTCT AAAGAAGAAG 6 50 

L5 GTGGACGTCA TACTCCATTC TTCAACAACT ACCGTCCACA ATTCTACTTC 7 00 

CGTACAACTG ACGTAACAGG TTCAATCGAA CTTCCAGCAG GAACAGAAAT 7 50 

GGTTATGCCT GGTGATAACG TTACTATCGA AGTTGAATTG ATTCACCCAA 800 

TCGCCGTAGA ACAAGGTACT AC 822 



>0 



B5 



2 ) INFORMATION FOR SEQ ID NO: 210 



( i ) SEQUENCE CHARACTERISTICS : 
IS (A) LENGTH: 82 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

3 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactia^ 

(B) STRAIN: CDCss-1073 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210 



CGGAGCTATC CTTGTAGTTG CTTCAACTGA TGGACCAATG CCACAAACTC 50 

GTGAGCACAT CCTTCTTTCA CGTCAAGTTG GTGTTAAACA CCTTATCGTA 100 

10 TTCATGAACA AAGTTGACCT TGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATTCGTGACC TTCTTTCAGA ATACGACTTC CCAGGTGATG 2 00 

ACCTTCCAGT TATCCAAGGT TCAGCTCTTA AAGCACTTGA AGGCGACGAA 250 

AAATACGAAG ACATCATCAT GGAATTGATG AGCACTGTTG ATGAGTACAT 3 00 

TCCAGAACCA GAACGTGATA CTGACAAACC TTTACTTCTT CCAGTTGAAG 3 50 

15 ATGTATTCTC AATCACTGGA CGTGGTACAG TTGCTTCAGG ACGTATCGAC 400 

CGTGGTACTG TTCGTGTCAA CGACGAAGTT GAAATCGTTG GTATTAAAGA 4 50 

AGATATCCAA AAAGCAGTTG TTACTGGTGT TGAAATGTTC CGTAAACAAC 500 

TTGACGAAGG TCTTGCAGGG GACAACGTTG GTGTTCTTCT TCGTGGTGTT 550 

CAACGTGATG AAATCGAACG TGGTCAAGTT CTTGCTAAAC CAGGTTCAAT 6 00 

50 CAACCCACAC ACTAAATTTA AAGGTGAAGT TTACATCCTT TCTAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCAACA ACTACCGTCC ACAATTCTAC 7 00 

TTCCGTACAA CTGACGTAAC AGGTTCAATC GAACTTCCAG CAGGAACAGA 750 

AATGGTTATG CCTGGTGATA ACGTTACTAT CGAAGTTGAA TTGATTCACC 8 00 

CAATCGCCGT AGAACAAGGT ACTAC 825 

55 

2) INFORMATION FOR SEQ ID NO: 211 
60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 826 bases 
<B) TYPE: Nucleic acid 
<C) STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 
10 (B) STRAIN: ATCC 33397 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211 

GGAGCTATCC TTGTAGTAGC TTCAACTGAC GGACCAATGC CTCAAACTCG 50 

15 TGAACACATC CTTCTTTCAC GCCAAGTAGG TGTTAAATAC CTTATCGTCT 100 

TCATGAATAA AGTTGACTTG GTTGACGATG AAGAATTGCT TGAATTGGTT 150 

GAAATGGAAA TCCGTGACCT TCTTTCAGAA TACGATTTCC CAGGTGATGA 2 00 

AATCCCAGTT ATCCAAGGTT CAGCTCTTAA AGCTCTTGAA GGTGATGAAA 250 

AATATGAAGA CATCATCATG GAATTGATGG ATACTGTTGA TGAATACATT 300 

20 CCAGAACCAG AACGTGACAC TGACAAACCA TTGCTTCTTC CAGTTGAAGA 350 

TGTATTCTCA ATTACTGGAC GTGGTACTGT TGCTTCAGGA CGTATCGACC 400 

GTGGTACTGT TAAAGTCAAC GACGAAGTTG AAATCGTTGG TATCCGTGAT 4 50 

GAAATCCAAA AAGCAGTTGT TACTGGTGTT GAAATGTTCC GTAAACAATT 500 

GGACGAAGGT CTTGCTGGAG ATAACGTAGG GGTTCTTCTT CGTGGTATCC 550 

25 AACGTGACGA AATCGAACGT GGACAAGTTC TTGCTAAACC AGGTTCAATT 600 

CATCCACACA CTAAATTCAA AGGTGAAGTT TACATCCTTA CTAAAGAAGA 650 

AGGTGGACGT CATACTCCAT TCTTCAACAA CTACCGTCCT CAATTCTACT 700 

TCCGTACTAC AGACGTTACA GGTTCAATCG AACTTCCTGC AGGTACTGAA 750 

ATGGTAATGC CTGGTGATAA CGTAACAATC GACGTTGAAT TGATCCACCC 800 

3 0 AATTGCCGTA GAACAAGGAA CTACAT 82 6 



35 



2) INFORMATION FOR SEQ ID NO: 212 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Streptococcus bovis 

(B) STRAIN: ATCC 33317 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212 

50 TGGTGCTATC CTTGTAGTAG CTTCTACAGA TGGTCCAATG CCACAAACAC 50 

GTGAACACAT CCTTCTTTCA CGTCAAGTTG GTGTTAAACA CCTTATCGTC 100 

TTCATGAACA AAGTTGACCT TGTTGATGAC GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGACC TTCTTTCAGA ATATGATTTC CCAGGTGATG 2 00 

AAATCCCTGT AATCCAAGGT TCAGCTCTTA AAGCCCTTGA AGGTGACACT 2 50 

55 CACTACGAAG ACATCATCAT GGAATTGATG AACACTGTAG ATGAATACAT 3 00 

TCCAGAACCA AAACGTGATA CTGACAAACC ATTGCTTCTT CCAGTCGAAG 3 50 

ACGTATTCTC AATCACTGGT CGTGGTACTG TAGCATCAGG ACGTATCGAC 4 00 

CGTGGTACTG TTAAAGTCAA CGACGAAGTT GAAATCGTTG GTATCCGTGA 4 50 

CGACATCCAA AAAGCTGTTG TTACTGGTGT TGAAATGTTC CGTAAACAAC 500 

50 TTGATGAAGG TATCGCAGGG GATAACGTTG GTGTTCTTCT TCGTGGTATC 550 

127 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



CAACGTGATG AAATCGAACG 
CCACCCACAC ACTAAATTCA 
AAGGTGGACG TCACACTCCA 
TTCCGTACAA CTGACGTTAC 
5 AATGGTAATG CCTGGTGATA 
CAATCGCCGT TGAACAAGGT 



TGGTCAAGTT CTTGCTAAAC 
AAGGTGAAGT TTACATCCTT 
TTCTTCAACA ACTACCGTCC 
AGGTTCAATC GAACTTCCAG 
ACGTTACTAT CGACGTTGAA 
ACTACAT 



CAGGTTCAAT 6 00 

ACTAAAGAAG 6 50 

TCAATTCTAC 7 00 

CAGGTACTGA 7 50 

TTGATTCACC 800 

827 



10 2 ) INFORMATION FOR SEQ ID NO: 213 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus (deposited as 
Streptococcus constellatus) 

(B) STRAIN: ATCC 27823 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213 

GCTATCCTCG TAGTAGCTTC AACTGATGGA CCAATGCCTC AAACTCGTGA 50 

ACATATCCTT CTTTCACGTC AAGTAGGTGT TAAATACCTT ATCGTCTTCA 100 

TGAACAAAGT TGACTTGGTT GACGATGAAG AATTGCTTGA ATTGGTTGAA 150 

30 ATGGAAATCC GTGACCTTCT TTCAGAATAC GATTTCCCAG GTGATGAAAT 2 00 

CCCAGTTATC CAAGGTTCAG CTCTTAAAGC TCTTGAAGGT GATGAAAAAT 2 50 

ATGAAGACAT CAT CATGGAA TTGATGGATA CTGTTGATGA ATACATTCCA 3 00 

GAACCAGAAC GTGACACTGA CAAACCACTT CTTCTTCCAG TCGAAGATGT 3 50 

ATTCTCAATC ACTGGACGTG GTACTGTTGC TTCAGGACGT ATCGAC CGTG 4 00 

3 5 GTACTGTTAA AGTCAATGAT GAAGTTGAAA TTGTTGGTAT TCGTGACGAA 4 50 

ATC CAAAAAG CAGTTGTTAC TGGTGTTGAA ATGTTCCGTA AACAATTGGA 500 

CGAAGGTCTT GCTGGAGATA ACGTAGGGGT TCTTCTTCGT GGTATCCAAC 5 50 

GTGACGAAAT CGAACGTGGA CAAGTTCTTG CTAAACCAGG TTCAATTCAT 600 

CCACACACTA AATTCAAAGG TGAAGTTTAC ATCCTTACTA AAGAAGAAGG 650 

40 TGGACGTCAT ACTCCATTCT TCAACAACTA CCGTCCTCAA TTCTACTTCC 7 00 

GTACTACAGA CGTTACAGGT TCAATCGAAC TTCCTGCAGG TACTGAAATG 750 

GTAATGCCTG GTGATAACGT AACAATTGAT GTTGAGTTGA TCCACCCAAT 8 00 

TGCCGTAGAA CAAGGAACTA C 821 



45 



2 ) INFORMATION FOR SEQ ID NO: 214 



( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus cricetus 

(B) STRAIN: ATCC 19642 

60 

128 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214 

GCTATCCTTG TAGTAGCTTC TACAGACGGA CCAATGCCAC AAACTCGTGA 50 

ACACATCTTG CTTTCACGCC AAGTTGGTGT TAAGAGCCTT ATCGTCTTCA 100 

5 TGAACAAGGT TGACTTGGTT GACGATGAAG AATTGCTTGA ATTGGTTGAA 150 

ATGGAAATCC GTGATCTTCT TTCAGAATAC GATTTCCCAG GTGATGATAT 2 00 

CCCTGTTGTT CAAGGTTCAG CTCTTAAAGC CCTTGAAGGT GATACAGCTG 2 50 

CCGAAGACAA GATCATGGAA TTGATGGACA TCGTTGATGA CTACATTCCA 3 00 

GAACCAAAAC GTGATACTGA TAAGCCATTG CTTCTTCCAG TCGAAGACGT 3 50 

10 ATTCTCAATC ACTGGACGTG GTACTGTTGC TTCAGGACGT ATCGACCGTG 4 00 

GTACTGTTAA GGTCAATGAC GAAGTTGAAA TCGTTGGTAT CAAGGACGAA 450 

ATC C AAAAAG CGGTTGTTAC CGGAGTTGAA ATGTTCCGTA AACAATTGGA 500 

TGAAGGTCTT GCAGGGGATA ACGTTGGTGT GCTTCTTCGT GGTATCCAAC 550 

GTGATGAAAT CGAACGTGGT CAAGTATTGG CTGCACCTGG TTCAATCCAT 600 

15 CCACACACTA AATTCAAGGG TGAAGTTTAC ATCCTTTCTA AAGATGAAGG 650 

TGGACGTCAC ACTCCATTCT TCAACAACTA CCGTCCACAG TTCTACTTCC 700 

GTACAACTGA CGTAACTGGT TCAATCGAAT TGCCAGCAGG TACTGAAATG 750 

GTTATGCCTG GTGATAACGT TACTATCGAC GTTGAATTGA TCCACCCAAT 800 

CGCTGTTGAA AAAGGTACTA C 821 

20 



2) INFORMATION FOR SEQ ID NO: 215 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus cristatus 
35 (B) STRAIN: ATCC 51100 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 215 

TATCCTTGTA GTAGCTTCAA CTGACGGACC AATGC CACAA ACTCGTGAGC 50 

4 0 ACATCCTTCT TTCACGTCAG GTTGGTGTTA AACACCTTAT CGTCTTCATG 100 

AACAAGATCG ACTTGGTTGA TGACGAAGAA TTGCTTGAAT TGGTTGAAAT 150 

GGAAATCCGT GACCTCTTGT CAGAATACGA CTTCCCAGGT GACGATCTTC 2 00 

CAGTTATCCA AGGTTCAGCT CTTAAAGCTC TTGAAGGTGA TACTAAGTAC 250 

GAAGACATCA TCATGGAATT GATGAACACT GTTGATGAGT ACATCCCAGA 3 00 

45 ACCAGAACGT GATACTGACA AACCTCTTCT TCTTCCAGTC GAAGACGTAT 3 50 

TCTCAATCAC TGGTCGTGGT ACAGTTGCTT CAGGACGTAT CGACCGTGGT 400 

ACTGTTCGTG TCAACGATGA AATCGAAATC GTTGGTATCA AAGAAGAAAT 4 50 

CCAAAAAGCA GTTGTTACTG GTGTTGAAAT GTTCCGTAAA CAGCTTGACG 500 

AAGGTCTTGC AGGGGACAAC GTAGGTGTAC TTCTTCGTGG TATCCAACGT 550 

50 GATGAAATCG AACGTGGTCA AGTTATCGCT AAACCAGGTT CAATCAACCC 600 

ACACACTAAA TTCAAGGGTG AAGTTTACAT CCTTACTAAA GAAGAAGGTG 650 

GACGTCACAC TCCATTCTTC AACAACTACC GTCCACAGTT CTACTTCCGT 700 

ACAACTGACG TTACAGGTTC AATCGAACTT CCAGCAGGTA CTGAAATGGT 750 

AATGCCTGGT GATAACGTAA CTATCGACGT TGAGTTGATC CACCCAATCG 8 00 

55 CCGTTGAACA AGGTACTCCT T 821 



2 ) INFORMATION FOR SEQ ID NO: 216 

60 

129 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 92 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Streptococcus downei 

(B) STRAIN: ATCC 33748 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216 

15 AGTAGCTTCT ACTGATGGAC CAATGCCACA AACTCGTGAA CACATCTTGC 50 

TTTCACGTCA GGTTGGTGTT AAGAACCTTA TCGTCTTCAT GAACAAGGTT 100 

GACTTGGTTG ACGATGAAGA ATTGCTTGAA TTGGTTGAAA TGGAAATCCG 150 

TGACCTGCTT TCAGAATACG ATTTCCCAGG TGATGATATC CCTGTTGTTC 2 00 

AAGGTTCAGC TCTTAAGGCT CTTGAAGGTG ATACAGCTGC CGAAGACAAG 2 50 

2 0 ATCATGGAAT TGATGGACAT CGTTGATGAC TACATTCCAG AACCAAAACG 300 

TGATACTGAT AAGCCTTTGC TTCTTCCAGT CGAAGATGTA TTCTCAATCA 3 50 

CTGGACGTGG TACTGTAGCT TCAGGACGTA TCGACCGTGG TACTGTTAAG 4 00 

GTCAACGACG AAGTTGAAAT CGTTGGTATC AAGGACGAAA TCCAAAAAGC 450 

AGTTGTTACC GGAGTTGAAA TGTTCCGTAA ACAATTGGAC GAAGGTCTTG 500 

2 5 CAGGGGATAA CGTTGGTGTG CTTCTTCGTG GTATCCAACG TGATGAAATC 550 

GAACGTGGTC AAGTGTTGGC TGCGCCTGGT TCGATTCACC CACACACTAA 600 

GTTTAAAGGT GAAGTTTACA TCCTTTCTAA AGAAGAAGGT GGACGTCATA 650 

CTCCATTCTT TAACAACTAC CGTCCACAGT TCTACTTCCG TACAACTGAC 700 

GTAACTGGTT CAATCGAATT GCCAGCGGGT ACTGAAATGG TTATGCCTGG 7 50 

3 0 TGATAACGTT ACTATCGACG TTGAATTGAT CCACCCAATT GC 792 



2) INFORMATION FOR SEQ ID NO: 217 



35 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 95 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
i5 (A) ORGANISM: Streptococcus dysgalactiae 

(B) STRAIN: ATCC 43078 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217 

50 GTAGTTGCTT CAACAGACGG ACCAATGCCA CAAACTCGTG AGCACATCCT 50 

CCTTTCACGT CAGGTTGGTG TTAAACACCT TATCGTGTTC ATGAACAAAA 10 0 

TTGACCTTGT TGACGATGAA GAATTGCTTG AATTGGTTGA AATGGAAATC 15 0 

CGTGACCTTC TTTCAGAATA CGATTTCCCA GGTGATGACC TTCCAGTTAT 2 00 

CCAAGGTTCA GCTCTTAAAG CTCTTGAAGG CGACACTAAA TTTGAAGACA 2 50 

55 TCATCATGGA ATTGATGGAT ACTGTTGATT CATACATTCC AGAACCAGAA 3 00 

CGTGACACTG ACAAACCATT GCTTCTTCCA GTCGAAGACG TATTCTCAAT 350 

CACAGGTCGT GGTACAGTTG CTTCAGGACG TATCGACCGT GGTACTGTTC 4 00 

GTGTCAACGA CGAAATCGAA ATCGTTGGTA TCAAAGAAGA AACTAAAAAA 4 50 

GCTGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG ACGAAGGTCT 500 

50 TGCAGGAGAC AACGTAGGTA TCCTTCTTCG TGGTGTTCAA CGTGACGAAA 550 

130 
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WO 01/23604 

TCGAACGTGG TCAAGTTATT 

AAATTCAAAG GTGAAGTATA 

CACTCCATTC TTCAACAACT 

ACGTAACAGG TTCAATCGAA 

5 GGTGATAACG TGACAATCAA 



GCTAAACCAG GTTCAATCAA 
TATCCTTTCT AAAGACGAAG 
ATCGTCCACA ATTCTACTTC 
CTTCCAGCTG GTACAGAAAT 
CGTTGAGTTG ATCCACCCAA 



PCT/CA00/01150 

CCCACACACT 600 

GTGGACGTCA 650 

CGTACAACTG 700 

GGTTATGCCT 750 

TCGCC 795 



10 



15 



20 



2 ) INFORMATION FOR SEQ ID NO: 218 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus equi subsp, 

(B) STRAIN: ATCC 952 8 



egui 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218 



25 CGGAGCTATC 
GTGAGCACAT 
TTCATGAACA 
TGAAATGGAA 
ACCTTCCAGT 

3 0 AAATACGAAG 
TCCAGAACCA 
ACGTATTCTC 
CGCGGTACTG 
CGAGATCAAA 

35 TTGACGAAGG 
CAACGTGATG 
CAACCCACAC 
AAGGTGGACG 
TTCCGTACTA 

40 AATGGTTATG 
CAATCGCCGT 



CTTGTAGTTG 
CCTTCTTTCA 
AGGTTGACCT 
ATTCGTGACC 
TATCCAAGGT 
ATATCATCAT 
GAACGTGACA 
AATCACTGGA 
TTCGTGTTAA 
AAAGCAGTTG 
TCTTGCAGGG 
AAATCGAACG 
ACTAAATTTA 
TCACACACCA 
CTGACGTAAC 
CCTGGTGATA 
AGAACAAGGT 



CTTCAACTGA 
CGTCAGGTTG 
TGTTGACGAT 
TTCTTTCAGA 
TCAGCGCTTA 
GGAATTGATG 
CAGACAAGCC 
CGTGGTACTG 
CGACGAAATC 
TTACTGGTGT 
GACAACGTTG 
TGGTCAAGTT 
AAGGTGAAGT 
TTCTTCAACA 
AGGTTCAATC 
ACGTGACTAT 
ACTACATT 



CGGACCAATG 
GTGTTAAGCA 
GAAGAATTGC 
ATATGATTTC 
AGGCTCTTGA 
GATACTGTTG 
ATTGCTTCTT 
TTGCTTCAGG 
GAAATCGTTG 
CGAAATGTTC 
GTGTTCTTCT 
ATTGCTAAGC 
ATATATCCTT 
ACTATCGTCC 
GAGCTTCCAG 
TGACGTTGAG 



CCACAAACTC 50 

CCTTATCGTG 10 0 

TTGAGCTTGT 150 

CCAGGTGATG 200 

AGGCGACAGC 250 

ATTCATACAT 3 00 

CCAGTCGAGG 350 

ACGTATCGAC 4 00 

GTATCAGAGA 450 

CGTAAACAGC 500 

TCGTGGTGTA 550 

CAGGTTCTAT 600 

ACTAAAGAAG 650 

ACAATTCTAC 700 

CAGGTACAGA 750 

TTGATCCACC 800 

828 



45 2) INFORMATION FOR SEQ ID NO: 219 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 5 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus ferus 

(B) STRAIN: ATCC 33477 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219 

131 
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PCT/CA00/01150 



CGGTGCAATC 
GTGAGCACAT 
TTCATGAACA 
TGAAATGGAA 
5 ACCTTCCAGT 
GCTCAAGAAG 
CCCAGAACCA 
ATGTATTCTC 
CGTGGTACTG 

10 CGAAATCACT 
TGGACGAAGG 
CAACGTGATG 
CAACCCACAC 
AAGGTGGACG 

15 TTCCGTACAA 
AATGGTTATG 
CAATCGCCGT 



CTTGTAGTAG 
CCTTCTTTCA 
AAGTTGACTT 
ATCCGTGACC 
TATCCAAGGT 
ATGTTATCAT 
GAACGTGATA 
AATCACAGGT 
TAAGAGTCAA 
AAAGCAGTTG 
TCTTGCTGGT 
AAATCGAACG 
ACTAAATTTA 
TCATACACCA 
CTGACGTAAC 
CCTGGTGATA 
TGAACAAGGT 



CTTCTACAGA 
CGTCAGGTAG 
GGTTGACGAT 
TGCTTTCAGA 
TCAGCTCTTA 
GGAATTGATG 
CTGACAAACC 
CGTGGTACTG 
CGATGAAGTT 
TTACCGGTGT 
GATAACGTTG 
TGGTCAAGTA 
AAGGTGAAGT 
TTCTTCAACA 
TGGTTCAATC 
ACGTGACTAT 
ACTAC 



TGGACCAATG 
GTGTTAAACA 
GAAGAATTGC 
ATATGATTTC 
AAGCGCTTGA 
AAAACCGTTG 
ATTGCTTCTT 
TAGCTTCAGG 
GAAATCGTTG 
TGAAATGTTC 
GTGTGCTTCT 
TTGGCTAAAC 
TTACATCCTT 
ACTACCGTCC 
GAATTGCCAG 
CGACGTTGAA 



CCACAAACTC 50 

CCTTATCGTC 100 

TTGAATTGGT 150 

CCAGGTGATG 200 

AGGTGATACT 250 

ATGAGTACAT 3 00 

CCAGTCGAAG 350 

ACGTATCGAT 4 00 

GTATCAAAGA 4 50 

CGTAAACAAT 500 

CCGTGGTGTG 550 

CAGGTTCAAT 600 

ACTAAAGAAG 650 

ACAGTTCTAC 700 

CAGGTACTGA 750 

TTGATCCACC 800 
825 



20 



25 



30 
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2) INFORMATION FOR SEQ ID NO: 22 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus gordonii 

(B) STRAIN: ATCC 10558 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220 



CGGAGCTATC 
GTGAGCACAT 
TTCATGAACA 

10 TGAAATGGAA 
ATCTTCCAGT 
AAATATGAAG 
CCCAGAACCA 
ACGTATTCTC 

15 CGTGGTATCG 
AGAAATCCAA 
TTGACGAAGG 
CAACGTGATG 
CAACCCACAC 

50 AAGGTGGACG 
TTCCGTACAA 
AATGGTAATG 
CAATCGCCGT 



CTTGTAGTAG 
CCTTCTCTCA 
AAGTTGACTT 
ATCCGTGACC 
TATCCAAGGT 
ATATCATCAT 
GAACGCGACA 
AATCACTGGA 
TTAAAGTCAA 
AAAGCAGTTG 
TCTTGCAGGG 
AAATCGAACG 
ACTAAATTCA 
TCACACTCCA 
CTGACGTTAC 
CCTGGTGATA 
TGAACAAGGT 



CTTCAACTGA 
CGCCAAGTTG 
GGTTGACGAT 
TCTTGTCAGA 
TCAGCTCTTA 
GGAATTGATG 
CTGACAAACC 
CGTGGTACAG 
TGACGAAATC 
TTACTGGTGT 
GACAACGTTG 
TGGACAAGTT 
AAGGTGAAGT 
TTCTTCAACA 
AGGTTCAATC 
ACGTAAC TAT 
ACTACT 



TGGTCCTATG 
GTGTTAAACA 
GAAGAATTGC 
ATACGACTTC 
AAGCTCTTGA 
AACACTGTTG 
ATTGCTTCTT 
TTGCTTCAGG 
GAAATCGTTG 
TGAAATGTTC 
GTGTGCTTCT 
ATTGCTAAAC 
TTATATCCTT 
ACTACCGTCC 
GAACTTCCAG 
CGACGTTGAG 



CCACAAACTC 50 

CTTGATCGTG 100 

TTGAGTTGGT 150 

CCAGGTGACG 2 00 

AGGTGACTCT 250 

ATGAGTACAT 3 00 

CCAGTCGAAG 3 50 

ACGTATCGAC 4 00 

GTATCAAAGA 4 50 

CGTAAACAGC 500 

TCGTGGTATC 550 

CAGGTTCAAT 6 00 

ACTAAAGAAG 650 

ACAGTTCTAC 700 

CAGGTACTGA 750 

TTGATCCACC 800 
826 



55 



2) INFORMATION FOR SEQ ID NO: 221 



50 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 799 bases 



132 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 

(B) STRAIN: ATCC 2 7335 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221 



TGTAGTAGCT TCAACTGACG GACCAATGCC 
TTCTTTCACG TCAAGTAGGT GTTAAATACC 
GTTGACTTGG TTGACGATGA AGAATTGCTT 
CCGTGATCTT CTTTCAGAAT ACGATTTCCC 
TCCAAGGTTC AGCACTTAAA GCTCTTGAAG 
ATCATCATGG AATTGATGAA TACTGTTGAT 
ACGTGATACT GACAAACCAT TGCTTCTTCC 
TCACTGGACG TGGTACTGTT GCTTCAGGAC 
AAAGTCAACG ATGAAGTTGA AATCGTTGGT 
AGCAGTTGTT ACTGGTGTTG AAATGTTCCG 
TTGCTGGAGA TAACGTAGGG GTTCTTCTTC 
ATTGAACGTG GACAAGTTCT TGCTAAACCA 
TAAATTCAAA GGTGAAGTTT ACATCCTTAC 
ATACTCCATT CTTCAACAAC TACCGTCCTC 
GACGTTACAG GTTCAATCGA ACTTCCTGCA 
TGGTGATAAC GTAACAATTG ATGTTGAGTT 



TCAAACTCGT GAACATATCC 50 

TTATTGTCTT CATGAACAAA 100 

GAATTGGTTG AAATGGAAAT 150 

AGGTGATGAT ATTCCAGTAA 2 00 

GTGATGAAAA ATATGAAGAC 250 

GAATATATTC CAGAACCAGA 3 00 

AGTCGAAGAT GTATTCTCAA 3 50 

GTATCGACCG TGGTACTGTT 4 00 

ATCCGCGAGG AAATCCAAAA 4 50 

TAAACAATTG GACGAAGGTC 5 00 

GTGGTATCCA ACGTGACGAA 55 0 

GGTTCAATTC ATCCACACAC 600 

TAAAGAAGAA GGTGGACGTC 650 

AATTCTACTT CCGTACTACA 700 

GGTACTGAAA TGGTAATGCC 750 

GATCCACCCA ATTGCCGTA 7 99 



2) INFORMATION FOR SEQ ID NO : 222 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus macacae 

(B) STRAIN: ATCC 35911 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222 



TGGTGCTATT CTTGTAGTAG 
GTGAACATAT CCTTCTTTCA 
TTCATGAATA AAGTTGACTT 
TGAAATGGAA ATCCGTGATC 
AACTTCCAGT TATCCAAGGT 
AAGTACGAAG ATATTATCAT 
CCCAGAACCA CAACGTGATA 
ATGTTTTCTC TATTACTGGA 
CGTGGTACTG TTAAGGTTAA 
CGATATTCAA AAAGCAGTTG 
TTGACGAAGG TCTTGCTGGT 
CAACGTGATG AAATTGAACG 
TCATCCACAT ACTAAATTCA 



CTTCAACTGA CGGTCCAATG 
CGCCAAGTAG GTGTTAAAAA 
AGTTGATGAT GAAGAATTGC 
TTCTTACAGA ATATGATTTC 
TCAGCACTTA AAGCTCTTGA 
GGAATTGTTG GATACTGTAG 
CTGACAAGCC ATTGCTTCTT 
CGTGGTACTG TTGCTTCAGG 
TGATGAAGTT GAAATCGTTG 
TTACTGGTGT TGAAATGTTC 
GATAACGTCG GTGTCCTTCT 
CGGTCAAGTT CTTGCTAAAC 
AAGGTGAAGT TTATATTCTT 
133 



CCTCAAACGC 50 

CCTTATTGTT 100 

TTGAATTGGT 150 

CCAGGCGATG 2 00 

AGGTGATACT 250 

ATGATTACAT 3 00 

CCAGTCGAAG 3 50 

ACGTATTGAC 4 00 

GTATTCGTGA 4 50 

CGTAAACAGC 5 00 

TCGTGGTATC 550 

CAGGATCAAT 60 0 

ACTAAAGAAG 650 
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AAGGTGGACG TCATACTCCA TTCTTTAACA 

TTCCGTACAA CTGATGTAAC TGGTTCAATT 

AATGGTTATG CCTGGTGATA ATGTTACGAT 

CAATCGCTGT TGAACAAGGT ACAAC 



PCT/CA00/01150 

ACTACCGTCC AC AGTT C T AC 70 0 

GATTTGCCAG CAGGTACTGA 750 
TGATGTTGAA CTGATCCACC 800 

825 



2) INFORMATION FOR SEQ ID NO : 22 3 



10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus gordonii (deposited as 
20 Streptococcus mitis) 

(B) STRAIN: ATCC 33399 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223 



25 CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAAGTT GACTTGGTTG ACGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATCCG TGACCTATTG TCAGAATACG ACTTCCCAGG TGACGATCTT 2 00 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCC CTTGAAGGTG ACACTAAATA 250 

3 0 CGAAGACATC GTTATGGAAT TGATGAACAC AGTTGATGAG TACATCCCAG 3 00 

AACCAGAACG TGACACTGAC AAACCATTGC TTCTTCCAGT CGAAGACGTA 3 50 

TTCTCAATCA CTGGTCGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 4 00 

TATCGTTAAA GTCAACGACG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 4 50 

CTCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

35 GAAGGTCTTG CCGGAGATAA TGTAGGTGTC CTTCTTCGTG GTGTTCAACG 550 

TGATGAAATC GAACGTGGAC AAGTTATTGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTCAAAGGT GAAGTTTACA TCCTTACTAA AGAAGAAGGT 650 

GGACGTCACA CTCCATTCTT CAACAACTAC CGTCCACAAT TCTACTTCCG 700 

TACTACTGAC GTTACAGGTT CAATCGAACT TCCAGCAGGT ACTGAAATGG 750 

4 0 TAATGCCTGG TGATAACGTG ACAATCGACG TTGAGTTGAT CCACCCAATC 8 00 

GCCGTAGAAC AAGGTACTAC AT 822 



45 2) INFORMATION FOR SEQ ID NO: 224 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus /nutans 

(B) STRAIN: ATCC 25175 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224 

so 
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CCCTGGTGCT ATCCTTGTAG TAGCTTCAAC TGATGGACCA ATGC C AC AAA 50 

CTCGTGAACA CATTCTTCTT TCACGTCAAG TTGGTGTTAA ATACCTCATT 100 

GTCTTCATGA ATAAAGTTGA TTTGGTTGAC GATGAAGAAT TGCTTGAATT 150 

GGTTGAAATG GAAATCCGTG ATCTTCTTTC AGAATATGAT TTCCCAGGTG 2 00 

5 ATGATATTCC AGTTATTCAA GGTTCAGCTC TTAAAGCTCT TGAAGGCGAT 2 50 

ACTGCTCAAG AAGATATCAT CATGGAATTA ATGCATACTG TTGATGACTA 3 00 

CATTCCAGAT CCAGAACGTG ATACTGACAA GCCGCTCCTT CTTCCAGTCG 3 50 

AAGATGTTTT CTCAATCACT GGTCGTGGTA CTGTTGCTTC AGGACGTATT 4 00 

GATCGTGGTA CTGTTAAAGT TAACGATGAA GTTGAAATCG TTGGTATCCG 4 50 

10 TGATGACATT CAAAAAGCTG TTGTTACTGG TGTTGAAATG TTCCGTAAAC 500 

AATTGGATGA AGGTATTGCA GGGGATAATG TTGGTGTTCT CCTTCGTGGT 550 

ATCCAACGTG ATGAAATCGA ACGTGGTCAA GTTCTTGCTA AACCAGGTTC 600 

AATTCACCCA CATACTAAAT TCAAAGGTGA AGTTTATATC CTTACTAAAG 650 

AGGAAGGTGG ACGTCATACA CCATTCTTCA ATAAC TATCG TCCACAATTC 700 

15 TACTTCCGTA CAACTGACGT AACTGGTTCA ATTGAGTTGC CAGCAGGTAC 750 

TGAAATGGTT ATGCCTGGTG ATAACGTTAC TATTGACGTT GAATTGATCC 80 0 

ATCCAATCGC TGTTGAACAA GGTACTA 82 7 



20 



30 



55 



2) INFORMATION FOR SEQ ID NO: 225 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 824 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus parasanguinis 

(B) STRAIN: ATCC 15912 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225 

AGCTATCCTT GTAGTAGCTT CAACTGACGG ACCAATGCCA CAAACACGTG 50 

AACACATCCT TCTTTCACGT CAGGTTGGTG TTAAACACTT GATCGTCTTC 100 

ATGAACAAAG TTGACTTGGT TGATGATGAA GAATTGCTTG AATTGGTTGA 150 

4 0 AATGGAAATC CGTGACCTTC TTTCAGAATA CGATTTCCCA GGTGATGACC 200 

TTCCAGTTAT CCAAGGTTCA GCTCTTAAAG CTCTTGAAGG TGACTCTAAA 250 

TATGAAGATA TCATCATGGA ATTGATGGAT ACTGTTGATG AGTACATCCC 3 00 

AGAACCAGAA CGCGATACTG ACAAACCATT GCTTCTTCCA GTCGAAGACG 3 50 

TATTCTCAAT CACTGGACGT GGTACAGTTG CTTCAGGACG TATCGACCGT 400 

4 5 GGTGTTGTTC GTGTCAATGA TGAAATCGAA ATCGTTGGTA TCAAAGAAGA 450 

AATCCAAAAA GCAGTTGTTA CTGGTGTTGA AATGTTCCGT AAACAACTTG 500 

ACGAAGGTCT TGCAGGGGAT AACGTTGGTG TGCTTCTTCG TGGTATCCAA 550 

CGTGATGAAA TCGAACGTGG ACAAGTTATC GCTAAACCAG GTTCAATCAA 600 

CCCACACACT AAATTCAAAG GTGAAGTTTA CATCCTTACT AAAGAAGAAG 650 

50 GTGGACGTCA TACTCCATTC TTCAACAACT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACTGG ATCTATCGAA CTTCCACCAG GAACTGAAAT 750 

GGTAATGCCT GGTGATAACG TGACTATCGA CGTTGAGTTG ATCCACCCAA 800 

TCGCCGTTGA ACAAGGTACT ACAT 824 



2) INFORMATION FOR SEQ ID NO: 226 

( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 824 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus ratti 

(B) STRAIN: ATCC 19645 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226 

TGGTGCTATC CTTGTAGTAG CTTCAACTGA TGGACCAATG CCGCAAACTC 50 

GTGAACACAT CTTGCTTTCA CGTCAAGTTG GTGTTAAATA CCTTATCGTC 100 

15 TTCATGAACA AGGTTGACTT GGTTGATGAT GAAGAATTGC TTGAATTGGT 150 

TGAAATGGAA ATCCGTGATC TTCTTTCAGA ATACGATTTC CCAGGTGATG 200 

ACATTCCAGT TATCCAAGGT TCAGCCCTTA AAGCTCTTGA AGGTGACACT 250 

GAACAAGAAG ATGTTATCAT GGAATTGATG AAAACAGTTG ATGAGTACAT 300 

CCCAGATCCA GAACGCGATA CTGATAAGCC ATTGCTTCTT CCAGTCGAAG 350 

2 0 ACGTGTTCTC AATCACTGGA CGTGGTACTG TTGCATCAGG ACGTATCGAC 40 0 

CGTGGTACTG TTAAAGTCAA TGACGAAGTT GAAATCGTTG GTATCCGTGA 450 

TGACATCCAA AAAGCTGTTG TTACTGGTGT TGAAATGTTC CGTAAACAGC 50 0 

TTGACGAAGG TCTTGCTGGT GATAACGTTG GTGTACTTCT TCGTGGTATC 55 0 

CAACGTGATG AAATCGAACG CGGTCAAGTT CTTGCTAAAC CAGGTTCAAT 600 

25 TCATCCGCAT ACTAAATTTA AAGGTGAAGT TTACATCCTT ACTAAAGAAG 650 

AAGGCGGACG TCACACTCCA TTCTTCAACA ACTACCGTCC ACAGTTCTAC 70 0 

TTCCGTACAA CTGACGTAAC TGGTTCAATC GAATTGCCAG CAGGTACTGA 750 

AATGGTTATG CCTGGTGATA ACGTGACTAT CGACGTTGAA TTGATCCACC 80 0 

CAATCGCTGT TGAACAAGGT ACTA 824 

30 



2) INFORMATION FOR SEQ ID NO: 227 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus sanguinis 
15 (B) STRAIN: ATCC 10556 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227 

TGTAGTAGCT TCAACTGACG GACCAATGCC ACAAACTCGT GAGCACATCT 50 

50 TGCTTTCACG TCAGGTTGGT GTTAAACACT TGATCGTCTT CATGAACAAA 100 

GTTGACTTGG TTGACGATGA AGAATTGCTT GAATTGGTTG AAATGGAAAT 150 

CCGTGACCTC TTGTCAGAAT ACGACTTCCC AGGTGACGAT CTTCCAGTTA 200 

TCCAAGGTTC AGCTCTTAAA GCTCTTGAAG GTGACTCTAA ATATGAAGAC 2 50 

ATCATCATGG AATTGATGGA CACTGTTGAT GAGTACATCC CAGAACCAGA 3 00 

55 ACGCGATACT GACAAGCCAT TGCTTCTTCC AGTCGAAGAC GTATTCTCAA 3 50 

TCACTGGTCG TGGTACAGTT GCTTCAGGAC GTATCGACCG TGGTATCGTT 4 00 

AAAGTCAACG ACGAAATCGA AATCGTTGGT ATCAAAGAAG AAATCCAAAA 450 

AGCAGTTGTT ACTGGTGTTG AAATGTTCCG TAAACAGCTT GACGAAGGTC 500 

TTGCAGGGGA CAACGTAGGT GTGCTTCTCC GTGGTATCCA ACGTGATGAA 550 

50 ATCGAACGTG GACAAGTTAT CGCTAAACCA GGTTCAATCA ACCCACACAC 600 
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TAAATTCAAG GGTGAAGTTT ATATCCTTAC 
ACACTCCATT CTTCAACAAC TACCGTCCAC 
GACGTTACAG GTTCAATCGA ACTTCCAGCA 
TGGTGATAAC GTAACAATCG ACGTTGAGTT 



TAAAGAAGAA GGCGGACGTC 
AGTTCTACTT CCGTACAACT 
GGTACTGAAA TGGTAATGCC 
GATCCACCCA ATCGC 



650 
700 
750 
795 
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2) INFORMATION FOR SEQ ID NO: 22 8 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus sobrinus 

(B) STRAIN: ATCC 33478 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228 

TGTAGTAGCT TCTACTGACG GACCAATGCC ACAAACTCGT GAACACATCT 5 0 

TGCTTTCACG CCAAGTTGGT GTTAAGAACC TCATCGTCTT CATGAACAAG 100 

GTTGACTTGG TTGATGATGA AGAATTGCTT GAATTGGTTG AAATGGAAAT 150 

CCGTGATCTT CTTTCAGAAT ACGATTTCCC AGGTGACGAC ATTCCTGTTG 2 00 

TTCAAGGTTC AGCTCTTAAG GCTCTTGAAG GTGATACAGC TGCCGAAGAC 2 50 

AAGATTATGG AATTGATGGA CATCGTTGAT GATTACATTC CAGAAC CAAA 300 

ACGCGATACT GATAAGCCAT TGCTTCTCCC AGTCGAAGAC GTATTCTCAA 350 

TCACTGGTCG TGGTACTGTT GCTTCAGGAC GTATTGACCG TGGTACTGTT 400 

AAGGTTAACG ACGAAGTTGA AATCGTTGGT ATCCGTGACG ATATC CAAAA 450 

AGCAGTTGTT ACTGGAGTTG AAATGTTCCG TAAGCAATTG GACGAAGGTC 50 0 

TTGCTGGAGA TAACGTTGGT GTGCTTCTTC GTGGTATCCA ACGTGATGAA 55 0 

ATTGAACGTG GTCAAGTATT GGCTGCACCT GGTTCAATCC ACCCACACAC 60 0 

TAAGTTCAAG GGTGAAGTTT ACATCCTTTC TAAAGATGAA GGTGGACGTC 65 0 

ACACTCCATT CTTCAACAAC TACCGTCCAC AGTTCTACTT CCGTACAACT 70 0 

GACGTAACTG GTTCAATCGA ATTGCCAGCA GGTACTGAAA TGGTTATGCC 75 0 

TGGTGATAAC GTTACTATCG ACGTTGAATT GATCCACCCA ATCGC 795 



2) INFORMATION FOR SEQ ID NO: 229 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus suis 

(B) STRAIN: ATCC 43765 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229 

TGTAGTAGCT TCAACTGACG GTCCAATGCC ACAAACTCGT GAGCACATCC 50 
TTCTTTCACG TCAGGTTGGT GTTAAACACC TTATCGTCTT CATGAACAAA 100 
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GTTGACTTGG TTGACGATGA AGAATTGCTT GAGTTGGTTG AAATGGAAAT 150 

CCGTGACCTT CTTTCAGAAT ACGATTTCCC AGGTGATGAT CTTCCAGTTA 2 00 

TCCAAGGTTC AGCTCTTAAA GCTCTTGAAG GTGACTCTAA GTACGAAGAC 250 

ATCGTTATGG AATTGATGAA CACTGTTGAT GAGTACATTC CAGAACCAGA 3 00 

5 ACGCGACACT GACAAACCAT TGTTGCTTCC AGTCGAGGAC GTATTCTCAA 3 50 

TCACTGGTCG TGGTAC TGTA GCTTCAGGAC GTATCGACCG TGGTAC TGTT 4 00 

CGTGTCAACG ACGAAATCGA AATCGTTGGT CTTCAAGAAG AAAAATCTAA 4 50 

AGCAGTTGTT ACTGGTGTTG AAATGTTCCG TAAACAACTT GACGAAGGTC 500 

TTGCCGGCGA TAACGTTGGT GTGCTTCTTC GTGGTGTACA ACGTGATGAA 550 

10 ATCGAACGTG GTCAAGTTAT CTCTAAACCA GGTTCTATCA ACCCACACAC 600 

TAAATTCAAA GGTGAAGTTT ACATCCTTAC TAAAGAAGAA GGTGGACGTC 650 

ACACTCCATT CTTCGACAAC TACCGTCCAC AGTTCTACTT CCGTACAACT 700 

GACGTAACTG GTTCAATCAA ATTGCCAGAA GGTACTGAAA TGGTAATGCC 750 

TGGTGATAAC GTTACTATCG ACGTTGAATT GATCCACCCA ATCGCCG 797 

15 



2) INFORMATION FOR SEQ ID NO: 230 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: Genomic DMA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus uteris 
30 (B) STRAIN: ATCC 19436 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230 

TTGTTGTTGC ATCAACTGAT GGAC CAATGC CACAAACTCG TGAGCACATC 50 

CTTCTTTCAC GCCAAGTTGG TGTTAAACAC CTTATCGTTT TCATGAACAA 100 

AATCGACCTT GTTGACGATG AAGAATTGCT TGAATTAGTT GAAATGGAAA 150 

TCCGTGACCT TCTTTCAGAA TACGATTTCC CAGGTGATGA CCTACCAGTT 2 00 

ATCCAAGGTT CAGCT CTTAA AGCTCTTGAA GGTGATTCTA AATACGAAGA 2 50 

CATCATCATG GAATTGATGA AAACTGTTGA TGAGTATATT CCAGAACCAG 3 00 

AACGTGATAC AGACAAACCA TTACTTCTTC CAGTCGAAGA CGTATTCTCA 350 

ATCACAGGTC GTGGTACTGT AGCTTCAGGA CGTATCGATC GTGGTACTGT 4 00 

TCGTGTCAAC GACGAAATTG AAATCGTTGG TATCAAAGAA GAAACTAAAA 4 50 

AAGCAGTTGT TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT 500 

CTTGCAGGAG ATAACGTAGG TATCCTTCTT CGTGGTGTTC AACGTGACGA 550 

AATCGAACGT GGACAAGTTA TTGCTAAACC AGGTTCAATC AACCCACACA 600 

CTAAATTCAA AGGTGAAGTT TACATCCTTT CTAAAGATGA AGGTGGACGT 650 

CATACTCCAT TCTTCAACAA CTACCGTCCT CAATTCTATT TCCGTACAAC 700 

TGACGTAACA GGTTCAATCG AACTTCCAGC TGGTAC TGAA ATGGTAATGC 750 

CTGGTGATAA CGTGACAATC AGCGTTGAGT TGATCCACCC AAT 7 93 



35 



40 



45 



50 



2 ) INFORMATION FOR SEQ ID NO : 231 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus vestibularis 
5 (B) STRAIN: ATCC 49124 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231 

TTGTAGTAGC ATCTACTGAC GGACCAATGC CACAAACTCG TGAGCACATC 50 

10 CTTCTTTCAC GTCAGGTTGG TGTTAAACAC CTTATCGTCT TCATGAACAA 100 

AGTTGACTTG GTTGACGATG AAGAATTGCT TGAATTGGTT GAAATGGAAA 150 

TCCGTGACCT TCTTTCAGAA TACGATTTCC CAGGTGATGA TATTCCAGTT 2 00 

ATCCAAGGTT CAGCTCTTAA AGCTCTTGAA GGTGATTCTA AATACGAAGA 250 

CATCATCATG GACTTGATGA ACACTGTTGA CGAATACATT CCAGAACCAG 3 00 

15 AACGTGACAC TGACAAACCA TTGTTGCTTC CAGTCGAAGA CGTATTCTCA 3 50 

ATCACTGGTC GTGGTACTGT TGCTTCAGGA CGTATCGACC GTGGTGTTGT 4 00 

TCGTGTTAAT GACGAAGTTG AAATCGTTGG TCTTAAAGAA GAAATCCAAA 4 50 

AAGCAGTTGT TACTGGTGTA GAAATGTTCC GTAAACAACT TGACGAAGGT 500 

ATTGCCGGAG ATAACGTCGG TGTCCTTCTT CGTGGTATCC AACGTGATGA 55 0 

2 0 AATTGAACGT GGTCAAGTAT TGGCTGCACC TGGTTCAATC AACCCACACA 60 0 

CTAAATTCAA AGGTGAAGTT TACATCCTTT CTAAAGAAGA AGGTGGACGT 6 50 

CACACTCCAT TCTTCAACAA CTACCGTCCA CAGTTCTACT TCCGTACAAC 7 00 

TGACGTAACA GGTTCAATCG AACTTCCTGC AGGTACTGAA ATGGTTATGC 750 

CTGGTGATAA CGTGACTATC GACGTTGAGT TGATCCACCC AATCGCCG 798 

25 



2) INFORMATION FOR SEQ ID NO: 232 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tatumella ptyseos 
40 (B) STRAIN: ATCC 33301 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232 

GGCGCTATCC TGGTTGTTGC TGCAACTGAC GGCCCTATGC CTCAGACCCG 50 

4 5 TGAGCACATC CTGCTGGGCC GCCAGGTAGG CGTTCCTTAC ATCATCGTGT 100 

TCCTGAACAA ATGTGACATG GTTGATGATG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TCCGTGACCT GCTGTCACAG TACGACTTCC CGGGTGACGA 2 00 

CACGCCAATC GTTCGCGGTT CAGCGCTGAA AGCACTGGAA GGTGAAGGCG 2 50 

AGTGGGAAGA GAAGATTCTG GAGCTGGCTG GCTTCCTGGA TTCTTACATC 3 00 

50 CCTGAGCCAG AGCGTGCTAT CGATCAGCCG TTCCTGCTGC CAATCGAAGA 3 50 

CGTATTCTCA ATCTCCGGTC GTGGTACAGT TGTTACCGGT CGTGTAGAGC 4 00 

GCGGGATCAT CAAAGTCGGT GAAGAAGTTG AGATCGTTGG TATCAAAGAT 450 

ACTGCGAAAT CAACCTGTAC CGGTGTTGAA ATGTTCCGTA AACTGCTGGA 500 

CCAGGGTCAG GCGGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAGC 550 

05 GTGAAGAGAT CGAACGTGGT CAGGTTCTGG CTAAACCAGG TTCAATCAAA 600 
CCACACACCC AGTTCGAGTC AGAAGTTTAT ATTCTGTCTA AAGACGAAGG 650 
CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 700 
GTACAACTGA CGTGACCGGA ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 
GTAATGCCTG GTGACAACAT CAAAATGGTT GTTACCCTGA TCCATCCAAT 800 

50 CGCGATGGAC GATGGTCTGC GTTTCGCAA 82 9 
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2) INFORMATION FOR SEQ ID NO: 233 

5 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Trabulsiella. guamensis 

(B) STRAIN: ATCC 4 9490 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233 

2 0 GGCGCAATCC TGGTAGTAGC AGCGACTGAC GGCCCGATGC CGCAGACTCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 150 

GAGATGGAAG TTCGTGAACT GCTGTCTCAG TACGATTTCC CGGGCGATGA 2 00 

CACGCCGATC GTACGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 250 

25 AGTGGGAAGC GAAAATCATC GAACTGGCAG GTTTCCTGGA TTCTTACATT 3 00 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 3 50 

CGTATTCTCC ATCTCTGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 4 00 

GCGGTATCAT CAAAGTGGGT GAAGAAGTAG AAATCGTTGG TATCAAAGAG 450 

ACTGCGAAGT CAACCTGTAC TGGCGTAGAA ATGTTCCGCA AACTGCTGGA 500 

30 CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAC 600 

CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGACGAAGG 650 

CGGCCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACAACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

35 GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCGATGGAC GACGGTCTGC GTTTCGCAA 829 



40 2) INFORMATION FOR SEQ ID NO: 234 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Veillonella. parvula 

(B) STRAIN: ATCC 10790 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234 



CGGCGCTATC TTGGTTGTAT CCGCAGCTGA CGGCCCTATG CCTCAAACTC 50 

GCGAACACAT CTTGTTGGCT CGCCAAGTTG GTGTTCCTGC AATCGTAGTA 100 

TTCTTGAACA AAGCTGACAT GGTTGACGAT GAAGAATTGA TCGAATTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTTTCTTC CTACGAATTC CCTGGCGACG 2 00 

60 AAGTACCTAT CGTTGTAGGT TCCGCGTTGA AAGCTTTGGA AGGCGATGCT 250 
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CAATATGTAG 
CCCAACACCA 
ATGTTTTCAC 
CGTGGTCAAG 
AAAAGCTGAA 
TGGATTCTGC 
GATCGTAAAG 
CAACCCACAY 
AAGGTGGTCG 
TTCCGTACAA 
AATGTGTATG 
CAATCGCTAT 



CTAAAATTGA 
GTTCGTGACA 
AATCACTGGT 
TAAACGTTGG 
CAATACGTAG 
AGTAGCAGGT 
ACATYGAACG 
ACAAAATTCA 
TCATACTCCA 
CAGACGTAAC 
CCTGGCGATA 
CGAAGAAGGT 



CGAATTGATG 
CTGATAAACC 
CGTGGTACAG 
TGATACTGTT 
TAACAGGTCT 
GACAACGTAG 
TGGTCAAGTA 
AAGCAGAAGT 
TTCTTCTCCA 
AGGTGTTGTA 
ACGTAACAAT 
CTTCG 



GACGCTGTAG 
ATTCTTGATG 
TAGCAACTGG 
GAAGTAGTAG 
TGAAATGTTC 
GTGCATTGCT 
TTGGCTAAAC 
ATACGTATTG 
ACTACCGTCC 
AACCTTCCTG 
GGAAATCGAA 



PCT/CA00/01150 

ACTCCTACAT 3 00 

CCTGTGGAAG 350 

CCGTGTTGAA 4 00 

GCTTGAAAGA 4 50 

CGTAAAGTGT 500 

TCGTGGTGTT 550 

CAGGTTCCAT 600 

ACTAAAGAAG 650 

ACAATTCTAC 700 

AAGGTGTAGA 750 

TTGATTACTC 8 00 

825 
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2 ) INFORMATION FOR SEQ ID NO: 235 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 825 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia, enterocolitica 

(B) STRAIN: ATCC 9 610 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235 

CGCTATCCTG GTTGTTGCTG CAACTGATGG CCCAATGCCA CAGACTCGTG 50 

AGCACATCCT GTTGGGTCGT CAGGTTGGTG TTCCTTACAT CATCGTATTC 100 

ATGAACAAAT GTGACATGGT TGACGATGAA GAGCTGCTAG AACTGGTAGA 150 

35 AATGGAAGTG CGCGATCTTC TGTCTACCTA CGATTTCCCA GGCGATGATA 2 00 

CGCCAGTTGT TCGTGGTTCC GCGCTGAAAG CATTGGAAGG CGAACCTGAG 250 

TGGGAAGCAA AAATTATCGA ACTGGCTGGC TACCTGGATT CTTACATCCC 3 00 

AGAACCAGAG CGTGCTATCG ATAAGCCGTT CCTGCTGCCA ATCGAAGACG 3 50 

TATTCTCTAT CTCTGGTCGT GGTACTGTTG TAACGGGTCG TGTAGAGCGC 4 00 

4 0 GGTATCGTTA AAGTTGGTGA AGAAGTCGAA ATTGTTGGCC TGAAAGATAC 4 50 

CGTTAAATCT ACTTGTACTG GCGTTGAAAT GTTCCGCAAA CTGCTGGATG 50 0 

AAGGCCGTGC AGGTGAGAAC GTTGGTGTTC TGCTGCGTGG TATCAAGCGT 550 

GAAGATATCG AACGTGGTCA AGTTCTTGCT AAACCAGGTT CGATTAAACC 6 00 

ACACACCAAA TTTGAATCAG AAGTTTATAT TCTGAGCAAA GATGAAGGTG 650 

4 5 GTCGCCATAC TCCGTTCTTC AAAGGCTACC GTCCTCAGTT CTACTTCCGT 700 

ACAACTGATG TAACCGGTAC TATTGAACTG CCAGAAGGCG TTGAGATGGT 750 

GATGCCAGGT GATAACATTC AAATGATTGT TAACCTGATT GCTCCTATCG 800 

CAATGGATGA CGGCTTGCGC TTTGC 825 



2) INFORMATION FOR SEQ ID NO: 236 



( i ) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DN A 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia frederiksenii 

(B) STRAIN: ATCC 33641 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236 

GGCGCGATCC TGGTTGTTGC TGCCACTGAT GGCCCGATGC CACAGACTCG 50 

CGAGCACATT CTGTTAGGGC GTCAGGTGGG TGTTCCTTAC ATCCTGGTCT 100 

10 TCCTGAACAA ATGTGACATG GTTGACGACG AAGAGCTGCT GGAACTGGTA 150 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC CTGGCGACGA 2 00 

CACTCCAGTT ATCCGTGGTT CTGCGCTGAA AGCGCTGGAA GGCGAAGCTG 2 50 

AGTGGGAAGC AAAAATCATC GAATTGGCTG AGGCGCTGGA TAGCTATATT 300 

CCACAGCCAG AGCGTGCGAT TGATAAACCA TTCCTGCTGC CAATCGAAGA 350 

15 CGTATTCTCA ATCTCTGGCC GTGGTACTGT TGTCACCGGT CGTGTAGAGC 4 00 

GCGGTATCGT TAAAGTCGGC GAAGAAGTCG AAATCGTTGG TATCATTGAT 4 50 

ACCATCAAGA CTACCTGTAC TGGTGTTGAA ATGTTCCGCA AATTGCTGGA 50 0 

CGAAGGCCGT GCGGGTGAGA ACGTTGGTGT TCTGCTACGT GGTACTAAAC 550 

GTGATGACGT ACAACGTGGT CAGGTATTGG CAAAACCAGG TTCTATCAAG 600 

2 0 CCACACACCA AATTTGAATC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 650 

TGGTCGCCAT ACTCCGTTCT TCAAAGGTTA TCGTCCTCAG TTCTACTTCC 700 

GTACAACTGA CGTGACCGGT ACTATCGAAC TGCCAGAAGG CGTTGAGATG 750 

GTGATGCCAG GTGATAACAT TCAAATGATT GTTAACCTGA TTGCTCCTAT 8 00 

CGCAATGGAT GACGGTCTGC GCTTTGCG 828 

25 



2) INFORMATION FOR SEQ ID NO: 237 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia intermedia 
*0 (B) STRAIN: ATCC 29909 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237 

CTTGGTTGTC GCTGCAACTG ATGGTCCTAT GCCACAGACT CGCGAGCACA 50 

45 TCCTGCTAGG TCGTCAGGTG GGTGTTCCTT ACATCCTGGT CTTCCTGAAC 100 

AAGTGTGACA TGGTTGACGA TGAGGAGTTG CTGGAATTGG TAGAAATGGA 150 

AGTCCGCGAA CTTCTGTCTC AATATGATTT CCCTGGCGAT GATACTCCTG 2 00 

TTATCCGTGG TTCAGCGCTG AAGGCGTTGG AAGGCGAGCC TGAATGGGAA 2 50 

GCAAAAATTA TCGAATTAGC TGAGGCGCTG GATAGTTATA TTCCACAGCC 3 00 

50 AGAGCGCGCG ATTGATAGAC CATTCTTGCT GCCAATCGAA GACGTATTCT 3 50 

CTATCTCAGG TCGTGGTACA GTCGTCACTG GTCGTGTAGA GCGTGGGATC 4 00 

GTTAAAGTTG GCGAAGAAGT TGAAATCGTT GGTATTATCG ATTCCATTAG 450 

AACAACATGT ACTGGCGTTG AAATGTTCCG CAAATTGCTG GACGAAGGCC 500 

GCGCGGGTGA GAACGTTGGT GTTCTACTGC GTGGGACTAA ACGTGATGAC 550- 

55 GTACAGCGTG GTCAGGTATT AGCTAAGCCA GGTTCTATCA AGCCACATAC 600 

TAAATTCGAA TCCGAAGTTT ATATTCTGAG CAAAGATGAA GGCGGGCGTC 650 

ACACGCCGTT CTTCAAAGGC TACCGTCCTC AGTTCTACTT CCGTACAACG 700 

GATGTAACCG GTACTATTGA ATTGCCAGAC GGCGTTGAGA TGGTGATGCC 750 

AGGTGATAAC ATTCAAATGA TTGTTAACCT GATTGCACCT ATTGCGATGG 800 

50 ATGATGGTCT GCG 813 
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2) INFORMATION FOR SEQ ID NO: 23 8 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Yersinia pestis 

(B) STRAIN: KIM D27 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238 

2 0 GGAGCGATCT TGGTTGTTGC TGCAACCGAT GGCCCTATGC CGCAGACTCG 50 

TGAGCATATC CTGCTGGGCC GCCAGGTTGG TGTCCCATAC ATTATTGTCT 100 

TCCTGAACAA ATGTGACATG GTTGACGATG AAGAGTTGCT AGAGTTGGTT 150 

GAAATGGAGG TTCGTGAGCT TCTGTCTCAA TACGATTTCC CAGGCGACGA 200 

CACTCCAGTC ATCCGTGGTT CAGCGTTGAA AGCCCTGGAA GGTGACGCTG 250 

25 AGTGGGAAGC TAAAATTATC GAGTTGGCAG AAGCTCTGGA TAGCTATATT 3 00 

CCGCAACCAG AACGCGCTAT TGATAGACCA TTCCTATTGC CAATTGAAGA 350 

CGTATTCTCT ATTTCTGGTC GTGGTACTGT AGTTACTGGT CGTGTAGAAC 4 00 

GTGGTATTGT TAAGGTCGGC GAAGAAGTTG AAATCGTTGG TATTATCGAT 450 

ACGATTAAAA CAACTTGTAC TGGCGTTGAA ATGTTCCGCA AGCTGCTGGA 500 

3 0 TGAAGGCCGT GCTGGTGAAA ATGTTGGTGT TCTGCTGCGT GGTACTAAGC 550 

GTGACGATGT TCAGCGTGGT CAAGTACTGG CGAAACCAGG TTCTATCAAG 600 

CCACACACGA AGTTTGAGTC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 650 

CGGCCGTCAT ACACCGTTCT TCAAGGGCTA CCGTCCTCAG TTCTACTTCC 7 00 

GTACAACTGA CGTGACCGGT ACCATTGAGC TGCCAGAAGG CGTTGAAATG 750 

35 GTCATGCCTG GTGACAACGT AAACATGGTT GTTAACCTAA TTGCTCCTAT 800 

CGCAATGGAT GATGGTCTGC GCTTCGCAA 829 



4 0 2) INFORMATION FOR SEQ ID NO : 239 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia, pseudotuberculosis 

(B) STRAIN: ATCC 29833 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239 

TGGAGCGATC TTGGTTGTTG CTGCAACCGA TGGCCCTATG CCGCAGACTC 50 

GTGAGCATAT CCTGCTGGGC CGCCAGGTTG GTGTCCCATA CATTATTGTC 100 

TTCCTGAACA AATGTGACAT GGTTGACGAT GAAGAGTTGC TAGAGTTGGT 150 

TGAAATGGAG GTTCGTGAGC TTCTGTCTCA ATACGATTTC CCAGGCGACG 200 

ACACTCCAGT CATCCGTGGT TCAGCGTTGA AAGCCCTGGA AGGTGACGCT 250 
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5 TACGATTAAA ACAACTTGTA 
ATGAAGGCCG TGCTGGTGAA 
CGTGACGATG TTCAGCGTGG 
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CGAGTTGGCA GAAGCTCTGG 
TTGATAGACC ATTCCTATTG 
CGTGGTACTG TAGTTACTGG 
CGAAGAAGTT GAAATCGTTG 
CTGGCGTTGA AATGTTCCGC 
AATGTTGGTG TTCTGCTGCG 
TCAAGTACTG GCGAAACCAG 
CAGAAGTTTA TATTCTGAGC 
TTCAAGGGCT ACCGTCCTCA 
TACCATTGAG CTGCCAGAAG 
TAAACATGGT TGTTAACCTA 



PCT/CA00/01150 

ATAGCTATAT 300 

CCAATTGAAG 350 

TCGTGTAGAA 4 00 

GTATTATCGA 4 50 

AAGCTGCTGG 500 

TGGTACTAAG 550 

GTTCTATCAA 600 

AAAGATGAAG 650 

GTTCTACTTC 700 

GCGTTGAAAT 750 

ATTGCTCCTA 8 00 

817 



15 



25 



50 



2) INFORMATION FOR SEQ ID NO: 240 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 82 9 bases 
2 0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
•(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia rohdei 

(B) STRAIN: ATCC 43380 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 0 

TGGCGCGATC CTGGTTGTTG CTGCAACTGA TGGCCCAATG CCACAGACTC 50 

GCGAGCACAT CCTGTTGGGT CGTCAAGTGG GTGTTCCTTA CATCTTAGTC 100 

TTCCTGAACA AGTGTGACAT GGTTGACGAC GAAGAGTTGC TGGAACTGGT 150 

3 5 TGAAATGGAA GTTCGTGAGC TTCTGTCTCA ATACGATTTC CCTGGCGATG 2 00 

ACACTCCGGT TATTCGTGGT TCCGCGCTGA AAGCGCTGGA AGGCGAGGCC 250 

GAGTGGGAAG CCAAAATTAT TGAACTTGCT GAAGCACTGG ATAGCTACAT 300 

TCCACAGCCA GAGCGCGCGA TTGATAAACC ATTCTTGCTG CCAATCGAAG 350 

ACGTATTCTC TATCTCAGGC CGTGGAACAG TTGTTACCGG GCGTGTTGAG 4 00 

4 0 CGCGGTATCG TCAGAGTGGG CGAAGAAGTT GAAATCGTGG GTATCATCGA 450 

CACCATTAAA ACCACTTGTA CCGGTGTTGA AATGTTCCGC AAACTGCTGG 500 

ACGAAGGCCG TGCCGGTGAG AACGTTGGTG TTCTGCTGCG CGGTACTAAA 550 

CGCGATGACG TGCAACGTGG TCAAGTGTTG GCTAAACCAG GTTCTATTAA 600 

GCCGCATACC AAATTTGAGT CAGAAGTTTA TATTCTGAGC AAAGATGAAG 650 

45 GTGGTCGTCA TACTCCGTTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTGACCGG TACCATCGAA CTGCCAGACG GTGTTGAGAT 750 

GGTGATGCCA GGTGATAACA TTCAAATGAT TGTTAACCTG ATTGCGCCTA 8 00 

TTGCAATGGA TGACGGTCTA CGATTTGCA 82 9 



2) INFORMATION FOR SEQ ID NO: 241 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

144 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yokenella regensburgei 

(B) STRAIN: ATCC 35313 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241 

TGGCGCGATC CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACTC 50 

GTGAGCACAT CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG 100 

10 TTCCTGAACA AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT 150 

AGAAATGGAA GTTCGTGAAC TTCTGTCTCA GTACGATTTC CCGGGCGACG 200 

ACACTCCGAT CATCCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGAAGCA 2 50 

GAGTGGGAAG CTAAAATCGT TGAGCTGGCT GGCTTCCTGG ATTCTTACAT 300 

CCCAGAACCA GTTCGTGCTA TCGACCTGCC GTTCCTGCTG CCGATCGAAG 3 50 

15 ACGTATTCTC CATCTCCGGT CGTGGCACCG TTGTTACCGG TCGTGTAGAG 4 00 

CGCGGTATCG TTAAAGTTGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA 4 50 

GACTGCTAAG TCTACCTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 50 0 

ACGAAGGCCG TGCTGGTGAG AACGTTGGTG TTCTGCTGCG TGGTATCAAA 550 

CGTGAAGAAA TCGAACGTGG TCAGGTACTG GCTAAGCCGG GTTCCATCAA 600 

20 GCCGCACACC AAATTCGAAT CTGAAGTTTA TATCCTGTCC AAAGACGAAG 650 

GCGGCCGTCA TACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC 7 00 

CGTACAACTG ACGTGACTGG TACCATCGAA CTGCCGGAAG GCGTAGAGAT 750 

GGTAATGCCG GGCGACAACA TCAAAATGGT TGTTAC CCTG ATCCACCCGA 8 00 

TCGC 804 

25 



2) INFORMATION FOR SEQ ID NO: 242 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Achromobacter xylosoxidans subsp. 
4 0 deni tri fi cans 

(B) STRAIN: ATCC 15173 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242 

4 5 TCAGTTCCCC CGCGATCACA TGCCCAAGAT CTACGAAGCG CTTACTCTGG 50 

CCGACGAGGG TTCCTCGTTC GCCGAAAAGG GTCTGACGCT GGAAGTGCAG 10 0 

CAACAGCTGG GCGACGGCGT GGTGCGTACC ATCGCGCTGG GCTCCAGCGA 150 

CGGCCTGCGC CGCGGTATGA AGGTCACCGG TACGGGCGCG CCGATCTCGG 2 00 

TGCCGGTCGG CACCGGCACG CTGGGCCGCA TCATGGACGT GCTGGGTCGT 250 

50 CCCATCGACG AAGCCGGCCC GATCCAGCAC GAAGAAAAGC GTGGCATTCA 300 

CCAGCCGGCT CCCCGTTTCG ACGAACTGTC GCCGTCGGTG GAACTGCTGG 3 50 

AAACCGGCAT CAAGGTTATT GACCTGGTCT GCCCGTTCGC CAAGGGCGGC 4 00 

AAGGTCGGCC TGTTCGGCGG CGCCGGCGTG GGCAAGACCG TCAACATGAT 450 

GGAACTGATC AACAACATCG CCAAGCAGCA CAGCGGCTTG TCGGTGTTCG 500 

55 CCGGCGTGGG CGAGCGTACC CGCGAAGGCA ACGACTTCTA CCACGAAATG 550 

GAAGAGTCGA ACGTTCTGGA CAAGGTTGCG ATGGTGTTCG GTCAGATGAA 600 

CGAACCCCCG GGCAACCGTC TGCGCGTGGC GCTGACCGGC CTGACCATGG 650 

CCGAGAAGTT CCGCGACGAA GGCCGCGACA TCCTGTTCTT CGTGGACAAC 700 

ATCTACCGCT ACACCCTGGC CGGTACGGAA GTGTCCGCGC TGCTGGGCCG 750 

50 TATGCCGTCG GCAGTGGGCT ACCAGCCCAC GCTGGCCGAA GAAATGGGCA 8 00 
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2) INFORMATION FOR SEQ ID NO: 243 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: AcinetoJbacter Jbaumaixnii 

(B) STRAIN: ATCC 19606 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243 

TGAAACTACT TTAGAAGTTC AGCAACAACT TGGTGATGGT GTTGTTCGTA 50 

CCATCGCAAT GGGTTCTACA GAAGGTCTTA AACGTGGTCT TACTGTAACT 100 

AGCACAAACG CACCGATCTC TGTTCCAGTT GGTACAGCCA CTCTTGGCCG 150 

TATCATGGAC GTTTTAGGTC GTCCTATCGA TGAAGCAGGT CCTGTTGCGA 20 0 

CTGAAGAACG TTTGCCGATT CACCGTCAAG CGCCTTCTTA TGCTGAACAA 250 

GCAGCTTCTA CTGACCTTTT AGAAACTGGT ATTAAAGTCA TCGACTTACT 3 00 

TTGCCCGTTT GCGAAAGGTG GTAAAGTTGG TTTATTCGGT GGTGCTGGTG 350 

TTGGTAAAAC CGTTAACATG ATGGAATTGA TCAACAACAT CGCGAAAGCA 4 00 

CACTCAGGTT TATCTGTGTT TGCTGGTGTT GGTGAGCGTA CTCGTGAAGG 450 

TAATGACTTC TATCACGAAA TGAAAGATTC TAACGTTCTT GACAAAGTAG 500 

CAATGGTCTA CGGTCAGATG AACGAGCCAC CAGGTAACCG TTTACGCGTA 550 

GCGTTAACTG GTTTGACTAT GGCTGAATAC TTCCGTGATG AAAAAGACGA 600 

AAACGGTAAA GGTCGTGACG TATTATTATT CGTCGACAAC ATCTACCGTT 650 

ATACACTTGC AGGTACTGAA GTATCAGCAT TGTTAGGTCG TATGCCATCT 70 0 

GCGGTAGGTT ACCAACCTAC ACTTGCAGAA GAAATGGGTG TTCTTCAAGA 750 

GCGTATTACA TCTACTAAAT CTGGTTCGAT CACTTCG 787 



2) INFORMATION FOR SEQ ID NO: 244 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobeicter Iwoffii 

(B) STRAIN: CDCF 3697 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244 

GTTCCTAAGA TCTATGACGC TCTCCACGTT GATGGCACTG AAACTACATT 50 

AGAAGTTCAG CAACAACTTG GTGATGGCGT AGTTCGTACT ATTGCAATGG 100 

GTTCTACTGA AGGCCTTAAG CGTGGTTTGA ACGTAACTAA CACTAACGCG 150 

CCGATTTCTG TACCAGTAGG TACAGCGACT CTAGGTCGTA TCATGGACGT 2 00 

TCTTGGTCGC CCAATCGACG AAGCTGGTCC AGTTGCGACT GAAGCGCGTT 250 
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TGCCGATTCA CCGTCAAGCA CCTTCTTATG CTGAACAAGC AGCTTCTACT 3 00 

GACCTTTTAG AAACTGGTAT TAAAGTCATC GACTTACTTT GCCCGTTCGC 3 50 

TAAAGGTGGT AAAGTTGGTC TGTTCGGTGG TGCCGGTGTT GGTAAAACTG 4 00 

TAAACATGAT GGAGTTGATC AACAACATCG CTAAAGCGCA CTCAGGTTTA 4 50 

5 TCTGTATTCG CTGGTGTTGG TGAGCGTACT CGTGAAGGTA ATGACTTCTA 500 

TCACGAGATG AAAGACTCAA ACGTTCTAGA CAAAGTAGCA ATGGTCTACG 550 

GTCAGATGAA CGAGCCACCG GGTAACCGTT TACGCGTAGC GTTGACTGGT 600 

TTGACCATGG CTGAGTACTT CCGTGACGAG AAAGACGAAA ACGGCAAAGG 650 

CCGTGACGTA CTATTGTTCG TAGATAACAT CTACCGTTAT ACACTAGCAG 700 

10 GTACTGAAGT ATCAGCACTT CTAGGTCGTA TGCCGTCTGC AGTAGGTTAC 750 

CAACCGACAC TTGCAGAAGA GATGGGTGTT CTTCAAGAAC GTATTACATC 8 00 

GACTAAGTCT GGTTCGATTA CGTCA 825 



15 



25 



50 



2) INFORMATION FOR SEQ ID NO: 245 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 83 7 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSG-197 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245 

CAATGAAGTT CCAGAAATTA ACAATGCCTT AGTCGTAGAC GTTGAAAGAG 50 

ATGAAGGTAC AGTATCTCTT ACATTAGAAG TGGCATTACA ACTTGGCGAT 100 

GATGTCGTAC GTACAATTGC AATGGATTCT ACTGATGGTG TTAAACGTGG 150 

3 5 TACAGAAGTT CGAGATAGCG GAGATAGCAT CAGTGTTCCA GTTGGTGATG 200 

CTACGTTAGG ACGTGTGTTT AATGTTCTTG GTGATACAAT TGACTTAGAC 250 

GAGAAGCTTG ATACTTCTGT CAAACGTGAT CCAATTCATA GAGAAGCACC 3 00 

TGCATTCGAT CAATTATCAA CAAAAGTTGA AATCTTAGAA ACAGGTATTA 3 50 

AAGTAATTGA TTTACTTGCA CCATATATTA AAGGTGGTAA AATCGGTTTA 4 00 

40 TTCGGTGGCG CTGGTGTAGG TAAAACAGTA TTAATTCAAG AATTAATTAA 4 50 

TAATATAGCT CAAGAACATG GTGGTATTTC AGTATTTGCC GGCGTAGGTG 500 

AACGTACGCG TGAAGGTAAT GACTTATACT ACGAAATGAG TGATAGTGGT 550 

GTTATTAAGA AAACAGCTAT GGTCTTCGGA CAAATGAATG AGCCACCTGG 600 

TGCGCGTATG CGTGTTGCTT TATCAGGCTT AACAATGGCT GAACACTTCC 650 

45 GTGATGTACA AGGACAAGAT GTTTTACTAT TTATTGATAA CATATTCAGA 700 

TTTACGCAAG CTGGTTCAGA AGTATCAGCA CTATTAGGTC GTATGCCATC 750 

AGCCGTTGGT TATCAACCTA CCCTTGCTAC TGAAATGGGT CAATTACAAG 800 

AACGTATTAC ATCAACAACT AAAGGATCTG TAACGTC 837 



2) INFORMATION FOR SEQ ID NO: 246 



(i) SEQUENCE CHARACTERISTICS : 
55 (A) LENGTH: 851 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

SO (ii) MOLECULE TYPE: Genomic DNA 
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5 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alcaligenes faecalis 

(B) STRAIN: ATCC 15554 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246 



TTCCCCCGCG ACAGCATCCC TAAAGTCTAC GAAGCATTGA CGCTCGTTGA 50 

CGAAAGTTCG GCTTTCGCAG AAAAAGGCCT GACTTTTGAA GTACAGCAAC 100 

10 AATTGGGTGA CGGTGTAGTT CGCACCATCG CCATGGGTTC CAGCGACGGC 150 

CTGCGCCGCG GTATGGAAGT GGCCGGTTCG GGCGCTCCCA TCTCCGTTCC 2 00 

CGTGGGTGTC GGCACCCTGG GTCGCATTAT GGACGTTCTG GGTCGCCCTA 2 50 

TTGACGAAGT CGGTCCTATT CAGTCCGACG AGCGTCGCGC CATTCACCAG 3 00 

CCTGCGCCTA CTTTCGACGA ACTGTCGCCT TCCGTAGAGC TGCTGGAAAC 3 50 

15 CGGTATTAAA GTGATTGACC TGGTTTGCCC GTTCGCCAAG GGTGGTAAGG 4 00 

TTGGTCTGTT CGGTGGTGCC GGTGTGGGCA AGACCGTGAA CATGCTGGAG 4 50 

CTGATCAACA ACATCGCCAA GGCACACAGC GGTCTGTCCG TGTTTGCCGG 50 0 

TGTGGGTGAG CGTACCCGTG AAGGTAACGA CTTCTACCAC GAAATGGCCG 550 

ATGCTGGCGT TAT C C AGATG GACAACCTGA GCGAGTCCAA AGTGGCCATG 600 

2 0 GTGTTCGGTC AGATGAACGA ACCTCCAGGC AACCGTCTGC GTGTGGCACT 650 

GTCCGGCCTG ACCATGGCCG AGAAGTTCCG TGACGAAGGC CGTGACATCC 700 

TGTTCTTTGT GGACAACATC TACCGCTACA CGCTGGCCGG TACAGAAGTG 750 

TCCGCTCTGC TGGGTCGTAT GCCTTCCGCA GTGGGTTACC AGCCTACGCT 800 

GGCCGAGGAA ATGGGTAAGC TGCAAGAGCG CATTACCTCC ACCAAGACCG 850 

25 G 851 



30 



2) INFORMATION FOR SEQ ID NO: 247 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Bacillus anthracis 

(B) STRAIN: 4229 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247 

15 GATGGCGGAA AGCTACCAGA AATCTACAAC GCCCTTACGG TAAAACAGAG 50 

CAACGAAAAC GGAACAAGCA TTAACTTAAC ATTTGAAGTT GCACTTCATT 100 

TAGGTGATGA CACAGTTCGT ACAGTTGCAA TGTCTTCCAC AGATGGACTT 150 

GTTCGTGGCA CAGAAGTAGA AGATACTGGT AAAGCAATCT CTGTACCAGT 2 00 

TGGTGATGCA ACACTTGGTC GTGTATTTAA CGTATTAGGT GATGCAATTG 2 50 

50 ACTTAGATGG TGAGGTTCCT GCGGATGTAC GTCGTGATCC AATTCACCGT 3 00 

CAAGCACCTG CATTCGAAGA ATTATCTACT AAAGTAGAAA TTCTTGAAAC 350 

TGGTATTAAA GTAGTAGACT TACTTGCTCC TTACATTAAG GGTGGTAAGA 4 00 

TCGGTCTATT CGGTGGTGCC GGTGTAGGTA AAACGGTATT AATTCAGGAA 4 50 

TTAATCAATA ACATCGCACA AGAACACGGT GGTATCTCTG TATTCGCTGG 500 

55 TGTAGGTGAG CGTACTCGTG AGGGTAATGA CTTATACCAC GAAATGAGCG 550 

ATTCTGGCGT AATTAAGAAA ACTGCGATGG TATTCGGACA AATGAACGAG 600 

CCACCTGGAG CACGTCAACG TGTTGCGTTA ACAGGTTTAA CAATGGCTGA 650 

GCATTTCCGT GATGAGCAAG GACAAGATGT ACTTCTGTTC ATCGATAATA 700 

TCTTCCGTTT CACGCAAGCA GGTTCTGAAG TATCTGCCCT TCTTGGCCGT 750 

50 ATGCCATCTG CGGTAGGTTA CCAACCAACA CTTGCAACAG AAATGGGTCA 800 
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ATTACAAGAG CGTATTACAT CTACAAATAA AGGGTCTATC ACGTCT 84 6 



2) INFORMATION FOR SEQ ID NO: 248 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248 

CCAGAAATCT ACAATGCCCT TACGGTAAAA CAAAGCAACG AAAACGGAAG 50 

CATGAACTTA ACATTTGAAG TTGCACTTCA TTTAGGTGAT GATACAGTTC 100 

GTACAGTTGC GATGTCTTCC ACAGATGGAC TTGTTCGTGG CACAGAAGTA 150 

GAAGATACTG GTAAAGCAAT CTCTGTACCA GTTGGTGATG CAACACTTGG 2 00 

ACGTGTATTC AACGTATTAG GTGATGCAAT TGACTTAGAT GGTGAACTTC 2 50 

CTGCGGATGT ACACCGTGAT CCAATTCACC GTCAAGCACC TGCATTCGAA 3 00 

GAATTATCTA CTAAAGTAGA AATTCTTGAA ACTGGTATTA AAGTAGTAGA 350 

CTTACTTGCT CCTTACATTA AGGGTGGTAA GATCGGCCTA TTCGGTGGTG 4 00 

CCGGCGTAGG TAAAACAGTA TTAATTCAGG AGTTAATCAA TAACATCGCA 4 50 

CAAGAGCACG GTGGTATCTC TGTATTCGCT GGTGTAGGTG AGCGTACTCG 500 

TGAGGGTAAT GACTTATACC ACGAAATGAG CGATTCTGGC GTAATCAAGA 550 

AAACTGCGAT GGTATTCGGA CAAATGAACG AGCCACCTGG AGCACGTCAA 600 

CGTGTTGCAT TAACAGGTTT AACAATGGCT GAGCATTTCC GTGATGAGCA 650 

AGGACAAGAC GTACTTCTGT TCATCGATAA CATCTTCCGT TTCACGCAAG 7 00 

CGGGTTCTGA AGTATCTGCC CTTCTTGGTC GTATGCCATC TGCGGTAGGT 750 

TACCAACCAA CACTTGCAAC AGAAATGGGT CAATTACAAG AGCGTATTAC 800 

ATCTACAAAT 810 



2) INFORMATION FOR SEQ ID NO: 249 

( i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 944 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ba.cteroid.es distasonis 

(B) STRAIN: ATCC 8503 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249 

CTTTGATTAT GGGACAGAAG AAACAGTAAC CCTCCCCCGT ATCCACGACG 50 

CCATGGAGAT TTCCCGCCCA AACGGAAAGA TCTTGATCGT CGAAGTTCAG 100 

CAACACATCG GGGAAAACAC CGTCCGTACC GTAGCGATGG ATACGACCGA 150 

CGGATTGAGA CGAGGCATGG AGGCCGTGTC ATACGGAATG CCCATCACCA 2 00 

149 
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TGCCGACCGG CGACCAAGTC AAAGGACGTT TAATGAATGT CACCGGCGAC 250 

CCTATCGATG GCATGGCCCA GCTTACTAAA GACGGGGCTC TTCCCATCCA 3 00 

TCGTGAGCCT CCTAAATTCG AGGATCTGAC AACGACCCAA GAGGTCTTGT 3 50 

ACACGGGTAT TAAAGTAATC GATTTATTGG AACCTTACGC CAAAGGAGGC 4 00 

5 AAGATCGGAC TTTTCGGAGG AGCCGGAGTC GGCAAAACGG TATTGATCAT 4 50 

GGAATTGATC AACAACATCG CAAAGAAAAA CAACGGATTC TCCGTCTTCG 500 

CCGGTGTGGG TGAACGTACA CGTGAAGGAA ATGACCTATT GCGTGAAATG 550 

ATCCAATCCG GTGTCATCCG ATACGGCGAG GAGTTTAAGA AGAGTATGGA 600 

GGCAGGCAAC TGGGACTTGT CGAAGATCGA TTACGATGAA TTGGCTAAGT 650 

10 CGCAGGCTAC CTTGGTATTC GGGCAGATGA ACGAACCGCC GGGCGCCCGC 700 

TCGTCCGTGG CCTTATCCGG TTTGACGATC GCCGAATCCT TCCGTGACAA 750 

AGCCTCTGAG GGAGAAAGAA AAGATATATT ATTCTTCATC GATAATATCT 800 

TCCGTTTCAC CCAAGCCGGT TCCGAGGTTT CCGCCTTGCT GGGGCGTATG 850 

CCTTCCGCCG TAGGTTACCA ACCGACATTG GCCACGGAAA TGGGAGCTAT 900 

15 GCAAGAGCGT ATCACTTCAA CCAAGAAGGG CTCCATCACC TCCG 944 



20 



55 



2) INFORMATION FOR SEQ ID NO: 250 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 93 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Bacteroides ovatus 

(B) STRAIN: ATCC 84 83 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250 

35 TTGAGGGTAC GGATGCAGAA TTGGTGCTGC CAAGCATCCA CGACGCACTG 50 

GAGATAAAGA GGCCAAACGG CAAAATACTG GTTGTAGAAG TTCAGCAACA 100 

TATCGGCGAA AATACGGTGC GTACCGTAGC GATGGACAGT ACTGACGGAC 150 

TTCAGAGAGG CATGAAAGTG TATCCCACCG GAGGCCCGAT CACGATGCCG 2 00 

ATTGGCGAAC AGATTAAAGG ACGACTGATG AACGTAGTCG GTGATTCGAT 250 

£0 CGACGGTATG AAAGGACTCG ACCGCAAAGG TGCATATTCC ATTCATCGCG 3 00 

ACCCCCCTAA GTTTGAGGAT TTGACTACTG TGCAAGAGGT GCTCTTCACA 3 50 

GGTATCAAAG TGATCGACCT GCTCGAACCG TATGCCAAAG GTGGTAAAAT 40 0 

CGGTTTGTTC GGCGGTGCCG GTGTAGGAAA GACTGTATTG ATTCAGGAAC 450 

TTATCAATAA TATCGCCAAG AAACATAATG GATTCTCTGT ATTTGCCGGA 500 

45 GTAGGTGAAC GTACCCGTGA AGGTAACGAC TTGCTGCGCG AAATGATTGA 550 

ATCCGGTGTA ATCCGTTACG GCGAAGCATT CAAAGAAGGA ATGGAGAAAG 600 

GTCACTGGGA TCTTTCGAAA GTGGATTATA ACGAACTGGA GAAATCGCAA 650 

GTGTCTCTGA TTTTCGGTCA GATGAACGAG CCTCCGGGCG CACGTGCCTC 700 

TGTGGCATTG TCCGGACTGA CGGTGGCGGA ATCTTTCCGC GACGCAGGAA 750 

50 AAGAAGGTGA GAAACGCGAT ATTCTGTTCT TTATTGATAA TATCTTCCGT 80 0 

TTCACGCAAG CAGGTTCAGA AGTGTCCGCC CTTTTGGGAC GTATGCCCTC 850 

CGCTGTTGGT TACCAGCCCA CGTTGGCTAC GGAAATGGGT GCGATGCAGG 900 

AACGTATCAC GTCTACCCGC AAAGGTTCTA TCACCTCCG 93 9 



2) INFORMATION FOR SEQ ID NO: 251 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 83 3 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



PCT/CA00/01150 



10 



15 



20 



25 



30 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leclercia adecarboxylata 

(B) STRAIN: ATCC 23216 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 51 



CGAATTCCCT 
AGAATGGTAA 
GGTATTGTGC 
TCTGGAAGTT 
CAACCCTGGG 
GGCGACATCG 
CTATGAAGAG 
TTATCGACCT 
GGCGGTGCGG 
CATCGCGATC 
GTACTCGTGA 
CTGGACAAAG 
CCGTCTGCGC 
ACGAAGGTCG 
CTGGCCGGTA 
AGGCTACCAG 
TCACCTCTAC 



CAGGATGCCG 
TGAGAGCCTG 
GTACCATCGC 
AAAGACCTCG 
CCGTATCATG 
GCGAAGAAGA 
CTGTCCAGCT 
GATGTGTCCG 
GTGTAGGTAA 
GAGCACTCCG 
GGGTAACGAC 
TATCCCTGGT 
GTTGCGCTGA 
TGACGTACTG 
CGGAAGTATC 
CCGACCCTGG 
CAAAACCGGT 



TACCGCGCGT 
GTGCTGGAAG 
CATGGGTTCT 
AGCACCCAAT 
AACGTCCTGG 
GCGTTGGGCT 
CTCAGGAACT 
TTCGCGAAGG 
AACCGTAAAC 
GTTACTCCGT 
TTCTACCACG 
TTACGGCCAG 
CCGGCCTGAC 
CTGTTCGTTG 
CGCACTGCTG 
CGGAAGAGAT 
TCTATCACCT 



GTACGATGCT 
TTCAGCAGCA 
TCCGACGGTC 
CGAAGTACCA 
GTCAGCCGAT 
ATCCACCGTT 
GCTGGAAACC 
GCGGTAAAGT 
ATGATGGAGC 
GTTTGCAGGC 
AAATGACCGA 
ATGAACGAGC 
CATGGCTGAG 
ACAACATCTA 
GGTCGTATGC 
GGGTGTTCTG 
CCG 



CTTGAGGTTC 50 

GCTCGGCGGC 100 

TGCGTCGTGG 15 0 

GTAGGTAAAG 2 00 

CGACATGAAA 2 50 

CAGCACCTTC 300 

GGCATCAAAG 350 

TGGTCTGTTC 4 00 

TGATCCGTAA 450 

GTGGGTGAGC 500 

CTCCAACGTT 550 

C AC C AGGAAA 600 

AAGTTCCGTG 650 

CCGTTACACC 70 0 

CATCAGCAGT 750 

CAGGAACGTA 80 0 

833 



35 



40 



45 



50 
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2) INFORMATION FOR SEQ ID NO: 252 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stenotrophomonas maltophilia 

(B) STRAIN: CDC F3338 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252 



GTGTACGACG 
GCAGCAGCTG 
ACGGCCTGAA 
GTGCCGGTCG 
TCCGATCGAC 
ACCGCGCTGC 
GAAACCGGCA 
CAAGGTCGGC 
TGGAACTGAT 
GCCGGCGTGG 
GAAGGACTCC 
ACGAGCCGCC 



CACTGAAGGT 
GGCGACGGCG 
GCGCAACCTG 
GCGCCGGCAC 
GAAGCCGGCC 
GCCGTCGTAT 
TCAAGGTCAT 
CTGTTCGGCG 
CAACAACATC 
GTGAGCGTAC 
AACGTCCTGG 
GGGCAACCGT 



GGAAAACACC 
TGGTGCGTAC 
GTTGCCGTCA 
CCTGGGCCGC 
CGGTCGCTGC 
GAAGACCAGT 
CGACCTGATG 
GCGCCGGCGT 
GCCAAGGCGC 
CCGTGAGGGC 
ACAAGGTGGC 
CTGCGCGTCG 



GAGATCACCC 
CATCGCCCTC 
ACACCGGCCG 
ATCATGGACG 
CAGCGACAAC 
CCCCGGCCAC 
TGCCCGTTCG 
CGGCAAGACC 
ACAGCGGCCT 
AACGACTTCT 
GATGGTGTAC 
CCCTGACCGG 



TCGAAGTCCA 50 

GGTTCCACCG 100 

TGGCATCTCG 150 

TGCTGGGCCG 2 00 

TGGGAAATCC 2 50 

CGAGCTGCTG 300 

CCAAGGGCGG 3 50 

GTCAACATGA 4 00 

GTCCGTGTTC 450 

ACCACGAGAT 500 

GGCCAGATGA 550 

CCTGACCATG 600 
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GCCGAGTACT TCCGCGATGA 
CCTGCTGTTC GTCGACAACA 
TGTCGGCACT GCTGGGCCGC 
CTGGCCGAGG AAATGGGCGT 
5 TGGTTCGATC ACCTCGATC 



GAAGGACGAA AACGGCAAGG 
TCTACCGCTA CACCCTGGCC 
ATGCCGTCCG CGGTGGGTTA 
CCTGCAGGAG CGCATCACCT 



PCT/CA00/01150 

GCAAGGACGT ""650 
GGTACCGAAG 700 
CCAGCCGACC 750 
CGACCAAGAA 800 

819 



2) INFORMATION FOR SEQ ID NO: 253 



10 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 864 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Bartonella henselae 

(B) STRAIN: ATCC 4 9882 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253 

2 5 AGTTTGAAGG CCCTTTGCCA AATATTCTCA ATGCATTAGA AACAGATAAT 50 

TTGGGCAATC GGCTAGTTTT AGAAGTTGCT CAGCATTTGG GTGAAAATAC 100 

CGTGCGTACC ATTGCCATGG ATACTACCGA TGGTCTTGTC CGTGGTCAAA 150 

AAGTTTTTGA TACAGGAACA CAGATCAGTG TTCCCGTGGG AGAAGCAACA 2 00 

CTTGGTCGTA TTATGAATGT GATTGGAGAG CCGGTTGATA ATGTTGGCCC 250 

3 0 AATTGCTACA AGCAAAACCC GTTCCATTCA CCAAGAGGCT CCTGAATATG 3 00 

TGGAGCAATC AACCGCATCA GAAATCCTTG TGACTGGTAT TAAAGTCGTT 3 50 

GATCTGTTAG CTCCTTATTC TAAAGGGGGG AAGGTTGGTT TGTTTGGAGG 4 00 

TGCCGGTGTT GGTAAAACCG TTCTCATTAT GGAGCTTATC AACAATATTG 4 50 

CAAAGGCGCA TGGTGGCTAT TCAGTGTTTG CCGGTGTTGG TGAACGTACA 500 

3 5 CGTGAGGGAA ATGATCTTTA TTATGAAATG ATCGAAAGCC GTGTGAATGT 550 

GAATCCAAAA GACAACAATG GTTCAACAGA AGGATCAAAA TGTGCACTCG 600 

TTTATGGGCA AATGAATGAA CCACCAGGGG CGCGTGCACG TGTGGCTCTT 650 

TCAGGATTGA CCATTGCAGA AAGTTTCCGT GATGAGGGAC AAGATGTTTT 700 

GTTCTTCGTA GATAATATTT TCCGTTTTAC GCAAGCAGGC GCTGAAGTGT 750 

4 0 CAGCTCTTTT AGGGCGTATT CCTTCTGCTG TAGGGTATCA GCCAACTTTG 800 

GCAACTGATA TGGGGGCTTT GCAAGAGCGT ATTACCAGTA CAAGAACAGG 850 
CTCTATTACC TCTG 864 



45 
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2) INFORMATION FOR SEQ ID NO: 254 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 866 bases 

(B) TYPE: Nucleic acid 

. (C) STRANDEDNESS : Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bifidobacterium adolescent is 

(B) STRAIN: ATCC 15703 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254 
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PCT/CA00/01150 



TTCCCGGTGG GCCACCTGCC CGACATTTAC AATGCACTTA CCGTTGAACT 50 

GKCCAACACC GGYGTCCACG AASAGGGCGA GACCACCAAG AAGATCACCC 100 

TTGMGGTTGA ACAGCATCTT GGCGATTCCA CCGTGCGTAC CGTCGCTCTG 150 

5 AAGCCGACTG ACGGCCTTGT GCGTGGCGCC ACCGTGTATG ACACCGGCGG 200 

CCCGATCTCT GTGCCGGTTG GCGATGTCAC CAAGGGCCAC GTATTCGACG 250 

TGTCCGGCAA CATCCTCAAC AAGAAGGCCG ACGAGACCGT TAAGGTTACC 3 00 

GAACGTTGGT CTATCCACCG TAACCCGCCG GCATTCGACC AGCTGGAGTC 3 50 

CAAGACCCAG ATGTTCGAAA CCGGTATCAA GGTCATCGAT TTGCTGACCC 4 00 

10 CGTATGTGCA GGGCGGCAAG ATCGGTCTGT TCGGCGGCGC AGGCGTCGGC 450 

AAGACCGTGC TGATCCAGGA AATGATTCAG CGTGTGGCTC AGAACCACGG 500 

CGGTGTGTCC GTGTTCGCAG GCGTCGGCGA GCGTACCCGT GAGGGTAACG 550 

ATCTGATCGG CGAAATGGAC GAAGCCGGCG TGCTCGAGAA GACCGCACTG 600 

GTCTTCGGCC AGATGGATGA GCAGCCGGGT ACCCGTCTGC GCGTGCCGCT 650 

15 GACCGCACTG AC CATGGC AG AGTACTTCCG TGACGTACAG AATCAGGACG 700 

TGCTGCTGTT CATCGATAAC ATCTTCCGTT TCACCCAGGC TGGTTCCGAG 750 

GTGTCCACCC TGCTCGGCCG TATGCCGTCC GCAGTGGGCT ACCAGCCGAA 800 

CCTGGCCGAT GAGATGGGCG CGCTGCAGGA GCGAATCACT TCGACCCGTG 850 

GACACTCCAT CACCTC 866 

20 

2) INFORMATION FOR SEQ ID NO: 255 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brucella, abortus 
35 (B) STRAIN: S2308 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255 

AAGGCCAGCT GCCGCTGATC CTGAACGCGC TTGAAGTGGA CAATCAGGGC 50 

4 0 CATCGTCTGG TTCTCGAAGT TGCCCAGCAC CTCGGCGAAG ACACCGTGCG 100 

CACCATCGCC ATGGACGCGA CCGAAGGTCT CGTTCGCGGT CAGGAAGCAC 150 

GCGACACTGG CGAACCGATC ATGGTGCCGG TCGGCGTCGA AACGCTTGGC 2 00 

CGCATCATGA ACGTCATCGG CGAGCCGGTT GACGAAGCAG GCCCCATCAA 2 50 

GACCAAGGCA ACCCGCGCCA TCCACCAGAA CGCGCCGGAA TATATCGAAC 300 

45 AGTCGACCGA AGCCGAAATT CTGGTCACGG GCATCAAGGT CGTCGACCTT 350 

CTGGCGCCTT ACGCCAAGGG CGGCAAGATC GGCCTCTTCG GCGGTGCAGG 400 

CGTCGGCAAG ACCGTTCTCA TCATGGAACT CATCAACAAC GTCGCCAAGG 450 

CGCACGGCGG TTATTCCGTG TTCGCAGGCG TCGGTGAGCG TACCCGTGAG 500 

GGCAACGACC TTTACCACGA AATGATCGAG TCGGGCGTGA ACAAGCTCGG 550 

50 CGGCGGCGAA GGCTCCAAGG CAGCCCTCGT TTACGGCCAG ATGAACGAAC 600 

CCCCGGGTGC CCGCGCCCGC GTTGCCCTTT CCGGTCTGAC GGTTGCTGAA 650 

AACTTCCGTG ACCAGGGCCA GGACGTTCTG TTCTTCGTGG ACAACATCTT 700 

CCGCTTCACG CAGGCAGGTT CGGAAGTGTC GGCTCTTCTC GGCCGTATTC 750 

CTTCCGCTGT GGGTTATCAG CCGACGCTGG CAACCGACAT GGGCGCCATG 800 

55 CAGGAACGCA TCACCACGAC GACCAAGGGT TCGATCACCT CG 842 
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2) INFORMATION FOR SEQ ID NO: 256 

153 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Cedecea davisae 

(B) STRAIN: ATCC 33431 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256 

15 CGAGTTCCCT CAGGACGGCG TACCGCGCGT TKATGATGCG CTTGAAGTAC 50 

AAAATAACAG CGAGCAGCTG GTGCTGGAAG TTCAGCAGCA GCTCGGCGGC 100 

GGTATCGTGC GTACCATCGC TATGGGTTCT TCCGACGGTC TGCGTCGTGG 150 

TCTGGAAGTT AAAGACCTCG AGCACCCGAT CGAAGTCCCG GTAGGTAAAG 2 00 

CGACCCTTGG CCGTATCATG AACGTGCTGG GTCAGCCGAT CGACATGAAA 250 

20 GGCGATATCG GCGAAGAAGA CCGTTGGGCT ATTCACCGCG CTGCACCTTC 300 

CTATGAAGAG CTGTCCAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 350 

TAATCGACCT TATCTGTCCG TTCGCTAAGG GCGGTAAAGT AGGTCTGTTC 400 

GGTGGTGCGG GCGTGGGTAA AACCGTAAAC ATGATGGAGC TTATCCGTAA 450 

CATCGCGATC GAGCACTCCG GCTACTCCGT GTTTGCGGGC GTGGGTGAGC 500 

2 5 GTACTCGTGA GGGTAACGAC TTCTATCACG AAATGACCGA CTCCAACGTT 550 

CTGGACAAAG TTGCCCTGGT TTACGGCCAG ATGAACGAGC CACCGGGTAA 600 

CCGTCTGCGC GTAGCGCTGA CCGGTCTGAC CATCGCGGAG AAATTCCGTG 650 

ACGAAGGTCG TGACGTTCTG CTGTTCGTTG ATAACATCTA CCGTTACACC 7 00 

CTGGCCGGTA CTGAAGTATC CGCGCTGCTG GGTCGTATGC CTTCTGCGGT 750 

3 0 AGGTTACCAG CCAACTCTGG CGGAAGAGAT GGGTGTTCTT CAGGAGCGTA 800 

TTACCTCCAC CAAGACCGGT TCCATCACCT CCG 83 3 



35 2) INFORMATION FOR SEQ ID NO: 257 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 
a0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea lapagei 

(B) STRAIN: ATCC 33432 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257 

50 

TTCCCTCAGG ACGGCGTACC GCGCGTATAT GACGCGCTTG AGGTACAGAA 50 

TAACAGCGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 100 

TCGTACGTAC CATCGCAATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 150 

GAAGTGAAAG ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC 2 00 

55 TCTGGGTCGT ATCATGAACG TGCTGGGTCA GCCAATTGAT ATGAAAGGCG 250 

ACATCGGCGA AGAAGATCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAT 3 00 

GAAGAGCTGT CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 350 

CGACCTGATT TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG 4 00 

GTGCGGGCGT AGGT AAAAC C GTAAACATGA TGGAGCTGAT CCGTAACATC 4 50 

50 GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCAGGCGTGG GTGAGCGTAC 500 
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TCGTGAGGGT AACGACTTCT ACCACGAGAT 
ACAAAGTTGC ACTGGTTTAC GGCCAGATGA 
CTGCGCGTAG CGCTGACCGG TCTGACCATC 
AGGCCGTGAC GTTCTGCTGT TCGTCGATAA 
CCGGTACAGA AGTTTCTGCA CTGCTGGGTC 
TATCAGCCTA CTCTGGCAGA AGAGATGGGT 
CTCCACCAAG ACCGGTTCCA TCACTTCCG 



PCT/CA00/01150 

GACCGACTCC AACGTTCTGG 550 

ACGAGCCGCC AGGTAACCGT 600 

GCGGAGAAAT TCCGTGACGA 650 

CATCTATCGT TATACCCTGG 700 

GTATGCCATC TGCGGTAGGT 750 

GTTCTTCAGG AGCGTATTAC 8 00 

829 



2) INFORMATION FOR SEQ ID NO: 258 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cedecea neteri 

(B) STRAIN: ATCC 33855 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258 



TCCCTCAGGA CGGCGTACCG CGCGTTTATG 
AACAATGAGA AGCTGGTGCT GGAAGTTCAG 
CGTGCGTACC ATCGCAATGG GTTCTTCCGA 
TAGTAACAGA CCTCGAGCAC CCGATCGAAG 
CTTGGCCGTA TCATGAACGT GCTGGGTCAG 
CATCGGCGAA GAAGACCGTT GGGCGATTCA 
AAGAGCTGTC CAGCTCTCAG GAATTGCTGG 
GACCTGATTT GTCCGTTCGC TAAGGGCGGT 
TGCGGGCGTA GGTAAAACCG TAAACATGAT 
CGATCGAGCA CTC CGGTTAT TCCGTGTTTG 
CGTGAGGGTA ACGACTTCTA CCACGAAATG 
TAAAGTAGCA CTGGTTTACG GCCAGATGAA 
TGCGCGTAGC GCTGACCGGT CTGACCATCG 
GGCCGCGACG TTTTGCTGTT CGTTGATAAC 
CGGTACCGAA GTATCTGCAC TGCTGGGTCG 
ATCAGCCAAC TCTGGCAGAA GAGATGGGTG 
TCCACCAAGA CCGGTTCTAT CACCTCCGTA 



ACGCGCTTGA GGTACAGAAC 50 

CAGCAGCTCG GCGGCGGTAT 100 

CGGTCTGCGT CGTGGTCTGG 150 

TCCCGGTAGG TAAAGCGACC 200 

CCGATCGACA TGAAAGGCGA 2 50 

CCGCGCAGCA CCTTCCTACG 300 

AAACCGGCAT CAAAGTTATC 350 

AAAGTAGGTC TGTTCGGTGG 4 00 

GGAGCTGATC CGTAACATTG 450 

CGGGCGTGGG TGAGCGTACT 500 

ACCGACTCCA ACGTTCTGGA 550 

CGAGCCACCA GGTAACCGTC 6 00 

CGGAAAAATT CCGTGACGAA 650 

ATCTACCGTT ATACCCTGGC 700 

TATGCCTTCT GCGGTAGGTT 750 

TTCTTCAGGA GCGTATTACC 800 

830 



2) INFORMATION FOR SEQ ID NO : 259 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: ChxrysGobacterium meningosepticum 

(B) STRAIN: CDC B7681 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259 

TTTCAGGAAG TAGAGGAACT ACCAAACATT TATGACGCAC TTGAAGTTGT 50 

CAGAGAAGGC CAGAAAGGTC TGATCTTAGA AGTTGAACAA CACATCGGTG 100 

5 AGGATACAGT GAGATGTATC GCTATGGATG CAACAGACGG ACTTCAAAGA 150 

GGACAATCTG TAATAGGACA TGGAAGACAA ATTACTATGC CTATTGGTGA 2 00 

AGAAGTAAAT GGTAGATTGT TCAACGTGGT AGGAGATGCT ATCGACGGAC 2 50 

TTCAGGAATT ATCTAAAGAC AATGGTTTGC CAATCCACAG AGAAGCACCT 3 00 

AAATTCGATC AGTTATCAAC TTCTGCTGAA GTACTATATA CAGGTATCAA 3 50 

10 AGTAATCGAC CTTATCGAGC CTTATGCAAA AGGTGGTAAA ATTGGTTTGT 4 00 

TCGGTGGTGC TGGTGTAGGT AAAACAGTAT TGATCCAGGA ATTGATTAAT 450 

AACATTGCTA AAGGACACGG TGGTCTTTCT GTATTCGCAG GAGTAGGTGA 500 

GAGAACAAGA GAAGGAAATG ACCTTCTTCG TGAGATGTTA GAGTCTGGTA 550 

TTATTAAATA TGGTGACGAA TTCATGCATT CTATGGAGAA CGGTGGATGG 600 

15 GATCTTTCTA AAGTTGACAG TGAGTTGATG AAAGAGTCTA AAGCTGCTTT 650 

CGTTTTCGGA CAGATGAACG AGCCACCAGG TGCAAGAGCA CGTGTAGCCC 700 

TTTCTGGTCT TACTTTAGCT GAATACTACC GTGATGGTGG CGAAAGCGGA 750 

CAAGGTAGAG ACGTTCTTTT CTTCGTAGAC AACATCTTCC GTTTTACACA 800 

GGCTGGTTCT GAGGTGTCTG CACTTCTAGG TCGTATGCCT TCAGCGGTAG 850 

2 0 GTTACCAACC AACTCTAGCT TCTGAGATGG GTGCAATGCA GGAGAGAATT 900 

ACTTCAACTA AAAACGGATC TATTACATCT G 931 



2 5 2) INFORMATION FOR SEQ ID NO: 2 60 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 72 6 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ci trobacter amalonaticus 

(B) STRAIN: ATCC 25405 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260 



GTATCGTACG TACCATCGCA ATGGGTTCTT CCGACGGTCT GCGTCGTGGT 50 

CTGGAAGTCA AAGACCTCGA GCACCCGATC GAAGTCCCGG TAGGTAAAGC 100 

AACCCTGGGT CGTATCATGA ACGTCCTGGG TCACCCGATC GACATGAAAG 150 

GCGATATCGG TGAAGAAGAG CGTTGGGCTA TCCACCGCGC AGCACCGTCC 2 00 

45 TATGAAGAGC TGTCCAGCTC TCAGGAACTG CTGGAAACCG GTATCAAAGT 2 50 

TATCGACCTG ATGTGTCCGT TCGCGAASGG CGGTAAAGTG GGTCTGTTCG 300 

GTGGTGCGGG TGTAGGTAAA ACCGTAAACA TGATGGAGCT TATCCGTAAC 3 50 

ATCGCGATCG AGCACTCCGG TTACTCCGTG TTTGCGGGCG TAGGTGAACG 4 00 

TACTCGTGAG GGTAACGACT TCTACCACGA AATGACCGAC TCCAACGTTC 4 50 

50 TGGATAAAGT ATCCCTGGTA TATGGCCAGA TGAACGAGCC GCCGGGAAAC 500 

CGTCTGCGCG TTGCACTGAC CGGTCTGACC ATGGCAGAGA AATTCCGTGA 550 

CGAAGGTCGT GACGTACTGC TGTTCGTCGA TAACATCTAT CGTTACACCC 600 

TGGCCGGTAC GGAAGTTTCC GCACTGCTGG GTCGTATGCC ATCAGCGGTA 650 

GGTTACCAGC CGACCCTGGC GGAAGAGATG GGTGTTCTGC AGGAACGTAT 700 

55 CACTTCTACC AAAACCGGTT CTATCA 726 



50 



2) INFORMATION FOR SEQ ID NO: 2 61 

156 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ci trobacter braakii 

(B) STRAIN: ATCC 43162 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAT 150 

CTCGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACTC TGGGTCGTAT 200 

CATGAACGTC CTGGGTCACC CGATCGACAT GAAAGGCGAT ATCGGTGAAG 2 50 

AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTATGA AGAGCTGTCC 3 00 

AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCT AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAACAC 450 

TCCGGTTACT CCGTGTTTGC GGGCGTGGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAT AAAGTATCCC 550 

TGGTATATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCT 600 

CTGACCGGTC TGACCATGGC AGAGAAGTTC CGTGACGAAG GTCGTGACGT 650 

TCTGCTGTTC GTTGATAACA TCTATCGTTA CACCCTGGCC GGTACAGAAG 7 00 

TATCCGCTCT GCTGGGTCGT ATGCCATCAG CGGTAGGCTA CCAGCCGACC 750 

CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTACCAAAAC 800 

CGGTTCTATC AC 812 



2) INFORMATION FOR SEQ ID NO: 2 62 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ci troJbacter koseri 

(B) STRAIN: ATCC 27156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262 

GCGCGTGTAC GACGCCCTTG AGGTGCAGAA TGGTAATGAA CATCTGGTGC 50 

TGGAAGTTCA GCAGCAGCTC GGTGGCGGTA TCGTACGTAC CATCGCCATG 100 

GGTTCTTCCG ACGGCCTGCG TCGTGGTCTG GATGTGAAAG ACCTTGAGCA 150 

CCCGATCGAA GTCCCGGTAG GTAAAGCAAC GCTGGGTCGT ATCATGAACG 200 

TACTGGGCGA ACCAGTAGAC ATGAAAGGCG AGATCGGTGA AGAAGAGCGT 250 

TGGGCTATCC ACCGTGCGGC ACCGTCCTAC GAAGAGTTGT CAAACTCTCA 3 00 

GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG TGCCCGTTCG 3 50 

CGAAGGGCGG TAAAGTGGGT CTGTTCGGTG GTGCGGGTGT AGGTAAAACC 4 00 

GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAAC ACTCCGGTTA 4 50 

CTCCGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT AACGACTTCT 500 

157 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



ACCACGAAAT GACCGACTCC 
GGCCAGATGA ACGAGCCGCC 
CCTGACCATG GCGGAGAAAT 
TCGTCGACAA CATCTACCGT 
CTGCTGGGTC GTATGCCTTC 
AGAGATGGGT GTTTTGCAGG 
TCACCTCCGT A 



AACGTTATCG ACAAAGTATC CCTGGTTTAC 
GGGAAACCGT CTGCGCGTTG CGCTGACCGG 
TCCGTGACGA AGGTCGTGAC GTTCTGCTGT 
TACACCCTGG CCGGTACGGA AGTATCCGCA 
AGCGGTAGGT TACCAGCCGA CCCTGGCGGA 
AACGTATCAC CTCCACCAAA ACCGGTTCTA 



550 
600 
650 
700 
750 
800 
811 



2) INFORMATION FOR SEQ ID NO: 263 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter farmer i 

(B) STRAIN: ATCC 51112 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263 

GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTTATGA ATGGTAAAGA 50 

GAGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGT ATCGTACGTA 10 0 

CCATCGCGAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GGAAGTAAAA 150 

GACCTCGAAC ACCCGATCGA AGTCCCGGTA GGTAAAGCAA CTCTGGGTCG 2 00 

TATCATGAAC GTCCTGGGTC ACCCGATCGA CATGAAAGGC GATATCGGTG 250 

AAGAAGAGCG TTGGGCTATC CACCGCGCAG CGCCATCCTA TGAAGAGCTG 3 00 

TCCAGCTCTC AGGAACTGCT GGAAACCGGT ATCAAAGTTA TCGACCTGAT 3 50 

GTGTCCGTTT GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 400 

TAGGTAAAAC CGTAAACATG ATGGAACTCA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT AC T C CGTGTT TGCGGGCGTA GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGATTC CAACGTTCTG GATAAAGTAT 550 

CCCTGGTATA TGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCGCTGACCG GTCTGACCAT GGCAGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTACTGCTG TTCGTCGATA ACATCTATCG TTACACCCTG GCCGGTACGG 70 0 

AAGTATCCGC ACTGCTGGGC CGTATGCCAT CAGCGGTAGG CTACCAGCCA 75 0 

ACCCTGGCGG AAGAGATGGG TGTTCTGCAG GAACGTATCA CTTCTACCAA 8 00 

AACCGGTTCT ATTACC 816 



2 ) INFORMATION FOR SEQ ID NO: 264 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrojbacter freundii 

(B) STRAIN: ATCC 8090 



158 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264 



PCT/CA00/01150 



AGGATGCCGT ACCGCGCGTG TACGATGCTC TTGAGGTTAT GAATGGTAAA 50 

GAGAGCCTGG TGCTGGAAGT TCAGCAGCAG CTCGGCGGCG GTATCGTACG 100 

5 TACCATCGCC ATGGGTTCTT CTGACGGTCT GCGTCGTGGT CTGGAAGTTA 150 

AAGACCTCGA GCACCCGATC GAAGTCCCGG TAGGTAAAGC AACGCTGGGT 2 00 

CGTATCATGA ACGTTCTGGG TCACCCGATC GACATGAAAG GCGATATCGG 250 

TGAAGAAGAG CGTTGGGCTA TCCACCGTGC AGCACCTTCC TACGAAGAGC 3 00 

TGTCAAGCTC TCAGGAACTG CTGGAAACCG GTATCAAAGT TATCGACCTG 350 

10 ATGTGTCCGT TCGCTAAGGG CGGTAAAGTT GGTCTGTTCG GTGGTGCGGG 400 

TGTAGGTAAA ACCGTAAACA TGATGGAGCT GATCCGTAAC ATCGCGATCG 450 

AACACTCCGG TTACTCCGTG TTTGCGGGCG TAGGTGAACG TACTCGTGAG 500 

GGTAACGACT TCTACCACGA AATGACCGAC TCCAACGTTC TGGACAAAGT 550 

ATCCCTGGTA TATGGCCAGA TGAACGAGCC GCCTGGAAAC CGTCTGCGTG 600 

15 TTGCGCTGAC CGGTCTGACC ATGGCTGAGA AGTTCCGTGA CGAAGGTCGT 650 

GACGTTCTGC TGTTCGTTGA TAACATCTAT CGTTACACCC TGGCCGGTAC 700 

AGAAGTATCT GCACTGCTGG GTCGTATGCC ATCAGCGGTA GGCTACCAGC 750 

CGACCCTGGC GGAAGAGATG GGTGTTCTGC AGGAACGTAT CACCTCCACC 8 00 

AAAACCGGTT CTATCACCT 819 

20 

2) INFORMATION FOR SEQ ID NO: 265 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ci trobacter koseri 
35 (B) STRAIN: ATCC 27028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265 

GATGCCGTAC CGCGCGTGTA CGACGCCCTT GAGGTGCAGA ATGGTAATGA 50 

4 0 ACATCTGGTG CTGGAAGTTC AGCAGCAGCT CGGTGGCGGT ATCGTACGTA 100 

CCATCGCCAT GGGTTCTTCC GACGGCCTGC GTCGTGGTCT GGATGTGAAA 150 

GACCTTGAGC ACCCGATCGA AGTCCCGGTA GGTAAAGCAA CGCTGGGTCG 2 00 

TATCATGAAC GTACTGGGCG AACCAGTAGA CATGAAAGGC GAGATCGGTG 2 50 

AAGAAGAGCG TTGGGCTATC CACCGTGCGG CACCGTCCTA CGAAGAGTTG 300 

45 TCAAACTCTC AGGAACTGTT GGAAACCGGT ATCAAAGTTA TCGACCTGAT 3 50 

GTGTCCGTTC GCGAAGGGCG GTAAAGTGGG TCTGTTCGGT GGTGCGGGTG 4 00 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAA 450 

CACTCCGGTT ACTCCGTGTT TGCGGGCGTA GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC GACAAAGTAT 550 

50 CCTTGGTTTA CGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCGCTGACCG GCCTGACCAT GGCGGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGCTG TTCGTCGACA ACATCTACCG TTACACCCTG GCCGGTACGG 700 

AAGTATCCGC ACTGCTGGGT CGTATGCCTT CAGCGGTAGG TTACCAGCCG 750 

ACCCTGGCGG AAGAGATGGG TGTTTTGCAG GAACGTATCA CCTCCACCAA 80 0 

55 AACCGGTTCT ATCACCTCCG TA 822 



60 



2) INFORMATION FOR SEQ ID NO: 266 

159 
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WO 01/23604 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



PCT/CAOO/01150 



10 



15 



20 



25 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter sedlakii 

(B) STRAIN: ATCC 51115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266 



GATGCCGTAC 
GCGTCTGGTG 
CCATCGCCAT 
GACCTTGAGC 
TATCATGAAC 
AAGAAGAGCG 
TCTAACTCTC 
GTGTCCGTTC 
TAGGTAAAAC 
CACTCCGGTT 
TAACGACTTC 
CCCTGGTGTA 
GCACTGACCG 
CGTTCTGCTG 
AAGTATCCGC 
ACTCTGGCGG 
AACCGGTTCT 



CGCGCGTGTA 
CTGGAAGTTC 
GGGTTCTTCC 
ACCCGATCGA 
GTACTGGGCG 
TTGGGCTATC 
AGGAACTGCT 
GCGAAGGGCG 
CGTAAACATG 
ACTCTGTGTT 
TACCACGAAA 
CGGCCAGATG 
GTCTGACCAT 
TTCGTCGATA 
ACTGCTGGGT 
AAGAGATGGG 
ATCACCTCCG 



CGACGCCCTT 
AGCAGCAGCT 
GACGGTCTGC 
AGTCCCGGTA 
AACCAGTAGA 
CACCGTGCCG 
GGAAACCGGC 
GTAAAGTCGG 
ATGGAGCTGA 
TGCGGGCGTG 
TGACCGACTC 
AACGAGCCGC 
GGCTGAGAAG 
ACATCTATCG 
CGTATGCCTT 
TGTTCTGCAG 



GAGGTACAGA 
CGGTGGCGGT 
GTCGTGGTCT 
GGTAAAGCAA 
CATGAAAGGC 
CGCCGTCCTA 
ATCAAAGTTA 
TCTGTTCGGT 
TCCGTAACAT 
GGTGAACGTA 
CAACGTTATC 
CTGGAAACCG 
TTCCGTGACG 
TTACACCCTG 
CAGCGGTAGG 
GAACGTATCA 



ATGGTAATGA 50 

ATTGTACGTA 100 

GGAAGTAAAA 150 

CGCTGGGTCG 200 

GACATCGGTG 250 

TGAAGAGTTG 3 00 

TCGACCTGAT 350 

GGTGCGGGCG 4 00 

CGCGATCGAG 450 

CTCGTGAGGG 50 0 

GACAAAGTAT 550 

TCTGCGCGTC 600 

AAGGTCGTGA 650 

GCCGGTACGG 700 

TTATCAGCCG 750 

CCTCAACCAA 800 

820 



35 2) INFORMATION FOR SEQ ID NO: 267 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter werkmanii 

(B) STRAIN: ATCC 51114 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267 



GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACCC TGGGTCGTAT 2 00 

55 CATGAACGTC CTGGGTCATC CGATCGACAT GAAAGGCGAT ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCAGCAC CTACCTATGA AGAACTGTCC 3 00 

AGTTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCG AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAACAC 4 50 

50 TCCGGTTACT CAGTGTTTGC GGGCGTTGGT GAACGTACTC GTGAGGGTAA 500 

160 
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CGACTTCTAC CACGAAATGA 
TGGTATATGG CCAGATGAAC 
CTGACCGGTC TGACCATGGC 
TCTGCTGTTC GTTGATAACA 
TATCTGCACT GCTGGGTCGT 
CTGGCGGAAG AGATGGGTGT 
CGGTTC 



CCGACTCCAA CGTTCTGGAC AAAGTATCCC 

GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 

TGAGAAGTTC CGTGACGAAG GTCGTGACGT 

TCTATCGTTA CACCCTGGCC GGTACTGAAG 

ATGCCATCAG CGGTAGGCTA CCAGCCAACC 

TCTGCAGGAA CGTATCACCT CTACCAAAAC 



550 
600 
650 
700 
750 
800 
806 



2) INFORMATION FOR SEQ ID NO: 268 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter youngae 

(B) STRAIN: ATCC 29935 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTTATGAATG GTAAAGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

CTCGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACGC TGGGTCGTAT 2 00 

CATGAACGTT CTGGGTCACC CGATCGACAT GAAAGGCGAT ATCGGAGAAG 2 50 

AAGAGCGTTG GGCTATTCAC CGCGCAGCAC CTTCCTATGA AGAGCTGTCC 3 00 

AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCT AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTACT CTGTGTTTGC GGGTGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGATTCCAA CGTTCTGGAT AAAGTATCCC 550 

TGGTTTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA TACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGTCGT ATGCCATCAG CGGTAGGTTA CCAGCCGACC 750 

CTGGCGGAAG AGATGGGTGT TCTGCAGGAA CGTATCACTT CTACCAAAAC 800 

CGGTTCTATC 810 



2) INFORMATION FOR SEQ ID NO: 269 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 7 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium innocuum 

(B) STRAIN: ATCC 14501 



161 
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(xi) SEQUENCE DESCRIPTION: 



SEQ ID NO: 2 69 



10 



15 



20 



TTGAGAACGG 
AAAGACAGTG 
ACGTGTCCGC 
TGGAAGCCAT 
ATTCTGGGAA 
ACCTGTAGGA 
TTGAGGAGCA 
ATTGACCTGT 
TGGTGCGGGA 
TCGCCAAGGA 
ACCCGTGAAG 
TGATAAGACC 
GAATGCGTGT 
CACGACCATC 
CCAGGCGGGA 
TAGGCTATCA 
ATTACATCCA 



AGATTTGCCG 
AATCTCTGAT 
TGTATCGCTA 
TGATACAGGA 
GAATGTTCAA 
ACGGATAACA 
GCAGACATCC 
TATGTCCATA 
GTAGGTAAAA 
ACATGGTGGA 
GAAACGACAT 
GTACTGGTTT 
CGGTCTGACC 
AGGATGTATT 
AGTGAAGTAA 
GCCGACACTT 
CGAAGGATGG 



CAGCTATTGA 
TGTCGAAGTT 
TGGGCGGTAC 
TCCGCAATCC 
TGTCCTCGGA 
CACTGCCGAT 
GCAGAAATGC 
TTCCAAGGGT 
CCGTACTGAT 
ATGTCCGTCG 
GTATCATGAA 
ACGGACAGAT 
GGGCTGACGA 
GCTGTTTATT 
GTGCCCTGCT 
GCGACAGAAA 
TTCCATT 



CCGCTATTGA 
GCTCAGCATA 
AGATGGTCTG 
GTGTACCGGT 
CGTGAAATTG 
CCACAGACAG 
TGGAAACAGG 
GGTAAGATTG 
TCAGGAGCTG 
TTACCGGTGT 
ATGAAGGACA 
GAATGAATCA 
TGGCGGAATA 
GATAATATTT 
GGGACGTATG 
TGGGACAGCT 



PCT/CA00/01150 



AATTCCTCTG 50 

TCGGTGATGA 100 

GTTCGTGGAA 150 

GGGAAAAGAA 2 00 

ATGGTCTGGG 2 50 

GCACCGGGCT 3 00 

AATTAAGGTC 3 50 

GTTTGTTTGG 400 

ATTCATAATA 450 

AGGGGAGAGA 500 

GCGGTGTCCT 550 

CCGGGTGCCA 600 

TTTCCGTGAT 650 

TCCGTTTTAC 70 0 

CCAAGTGCAG 750 

GCAGGAGCGC 800 

827 



25 



30 



35 



*0 



45 



2) INFORMATION FOR SEQ ID NO: 270 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 

(B) STRAIN: ATCC 13124 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270 



55 



GCAAATGATG 
CGATGGAAAA 
TAGTTAGAAC 
GAAGCTGTTG 
ATTAGGAAGA 
AAGTTAATAA 
GAAGAACAAT 
CGACCTTTTA 
GTGCAGGTGT 
GCTAAAGAGC 
AAGAGAAGGT 
AAAATACAGC 
ATGAGAGTTG 
AGGTCAAGAC 
CTGGATCAGA 
TACCAACCAA 
ATCAACTACC 



AACTTCCTAA 
ATCTTAGTTT 
TATAGCTATG 
ATACAGGAGC 
ATATTTAACG 
CGAAGAAAAA 
CAGTTGTTCC 
GCACCTTACC 
TGGTAAAACA 
ACGGTGGACT 
AATGACCTTT 
ATTAGTATTT 
CTTTAACAGG 
GTGTTATTAT 
GGTTTCAGCT 
CTCTTGCTAC 
CATGGATCAA 



TATATTTAAT 
GTGAGGTAGA 
GAAGCTACTG 
ACCTATATCA 
TATTAGGTAA 
TATCCAATTC 
TCAAATGTTT 
AAAGAGGGGG 
GTTCTTATCC 
TTCTGTATTC 
ACTATGAAAT 
GGACAAATGA 
ACTTACTATG 
TCATAGATAA 
TTATTAGGAA 
AGAGATGGGA 
TTACATCAG 



GCAATACACA 
GCAACACGTA 
AAGGACTAAG 
GTACCAGTTG 
ACCACTAGAT 
ATAGACCAGC 
GAGACAGGAA 
AAAAATCGGT 
AAGAGCTTAT 
ACAGGAGTTG 
GATGGAATCA 
ACGAACCACC 
GCTGAGTACT 
CATATTCAGA 
GAATACCATC 
GCTCTTCAAG 



TAAAAATGGA 50 

GGAGACGATA 100 

AAGAGGTGTA 150 

GTGAATGCGT 2 00 

AGTGGAGCTG 2 50 

TCCATCATTT 300 

TAAAGGTTAT 3 50 

CTATTTGGAG 400 

AAACAACATA 450 

GAGAAAGATC 500 

GGAGTTATAA 550 

TGGAGCAAGA 600 

TCAGAGACCA 650 

TTCTCACAAG 700 

AGCTGTTGGT 750 

AGAGAATCAC 800 

829 



2) INFORMATION FOR SEQ ID NO: 271 

162 



SUBSTITUTE SHEET (RULE 26) 



10 



30 



WO 01/23604 PCT/CA00/01150 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 71 
TTGTCCACGT TGGATRTCTT CA 22 



15 2) INFORMATION FOR SEQ ID NO: 272 

( i ) SEQUENCE CHARACTER I STICS : 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium diphtheriae 

(B) STRAIN: ATCC 27010 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272 



CCCGCGTGGC GAGCTGCCGG CACTGTACAA CGCGTTGACT GTCGAGGTCA 50 

CCCTCGAGGC AGTCGCTAAG ACCATTACCC TTGAGGTTGC CCAGCACTTG 100 

GGCGACAACC TCGTTCGCGC CGTGTCCATG GCCCCTACCG ACGGCCTCGT 150 

CCGTGGTGCT GTTGTGACCG ACTCGGGCAA GCCAATCTCC GTGCCAGTTG 2 00 

3 5 GCGACGTTGT TAAAGGCCAC GTTTTCAACG CACTGGGCGA TTGCTTGGAT 250 

GAGCCAGGTC TCGGCCGCGA TGGTGAGCAG TGGGG AATTC ACCGCGATCC 3 00 

ACCACCATTC GATCAGCTCG AAGGTAAGAC CGAAATCCTC GAGACCGGTA 3 50 

TTAAGGTCAT CGACTTGCTC ACCCCTTACG TTAAGGGCGG CAAGATTGGT 4 00 

CTGTTCGGTG GTGCAGGTGT GGGTAAGACC GTGCTCATCC AGGAGATGAT 450 

4 0 CACTCGTATT GCTCGCGAGT TCTCCGGTAC CTCCGTCTTC GCTGGCGTTG 500 

GTGAGCGTAC CCGTGAGGGC ACCGACCTCT TCCTCGAAAT GGAAGAAATG 550 

GGCGTTCTTC AGGACACCGC TCTCGTGTTC GGCCAGATGG ACGAGCCACC 600 

AGGAGTCCGT ATGCGCGTTG CTCTGTCCGG TCTGACCATG GCGGAGTACT 650 

TCCGCGATGT TCAGCACCAG GACGTGCTTC TGTTCATCGA TAACATTTTC 700 

4 5 CGTTTCACCC AGGCCGGTTC CGAGGTTTCG ACCCTTCTTG GTCGTATGCC 750 

TTCCGCCGTG GGTTACCAGC CAACCTTGGC TGACGAGATG GGTGTTCTCC 800 

AGGAGCGTAT TACCTCTA 818 



50 



2) INFORMATION FOR SEQ ID NO: 273 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 83 3 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

SO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium pseudodiphtheritlcum 

(B) STRAIN: ATCC 10700 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273 

CGATGCCTGC TCTGTACAAC GCGCTGACTG TCGAGGTCAC CCTCGAGGCA 50 

GTCGCCAAGA CCATCACGCT TGAGGTTGCA CAGCACCTCG GCGATAACCT 100 

GATCCGGACC ATTGCGTTGG CACCTACGGA CGGTCTCGTC CGTGGCGCTG 150 

AGGTTATCGA CACTGGTAAG CCAATTACTG TTCCCGTCGG CGATGCCGTC 2 00 

AAAGGACACG TCTTCAATGC GCTCGGTGAG TGTTTGGACG AACCAGGATT 250 

GGGCCGCGAC GGCGAACAGT GGGGAATCCA CCGCGATCCG CCACCATTCG 3 00 

ATGCGCTGGA GGGCAAAACC GAGATTCTGG AGACTGGAAT CAAGGTTATC 350 

GACCTCCTTA CCCCTTACGT TAAGGGTGGC AAAATTGGTC TGTTCGGTGG 4 00 

CGCCGGCGTC GGCAAGACCG TTCTTATCCA GGAAATGATC ACTCGTATCG 450 

CTCGTAACTT CTCCGGTACT TCCGTGTTCG CCGGCGTCGG TGAGCGTACC 500 

CGTGAGGGTA CTGACCTGTT CCTGGAAATG GAAGAGATGG GCGTGTTGCA 550 

AGACACCGCC CTTGTCTTCG GTCAAATGGA CGAACCACCA GGGGTTCGTA 600 

TGCGCGTGGC CTTGTCTGGT CTAACCATGG CTGAATATTT CCGCGACGTT 650 

CAAAACCAGG ACGTTTTGTT GTTCATTGAC AACATCTTCC GTTTTACTCA 700 

GGCAGGTTCC GAGGTTTCCA CGCTGTTGGG CCGTATGCCT TCCGCCGTGG 750 

GTTATCAGCC AACATTGGCT GATGAGATGG GTGTTTTGCA GGAACGGATT 800 

ACCTCTACAC GTGGTAAGTC AATTACTTCC CTG 833 



2) INFORMATION FOR SEQ ID NO: 274 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 417 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium ulcerans 

(B) STRAIN: NCTC 8665 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274 

CCGTGCTCAT CCAGGAGATG ATCACCCGTG TGGCCCGCAA CTTCGGCGGC 50 

ACCTCTGTGT TCGCCGGCGT CGGCGAGCGC ACCCGTGAGG GCAACGACCT 100 

CTGGGTCGAG ATGGACGAGG CCGACGTGCT CAAGGACACC GCCCTGGTGT 150 

TCGGCCAGAT GGACGAGCCG CCGGGAACCC GTCTGCGCGT GGCCCTGTCC 2 00 

GCGCTGACCA TGGCGGAGTA CTTCCGCGAT GTGCAGAACC AGGACGTGCT 2 50 

GCTGTTCATC GACAACATCT TCCGCTTCTC CCAGGCCGGC TCCGAGGTCT 3 00 

CCACCCTGCT GGGCCGCATG CCCTCCGCGG TGGGCTACCA GCCGAACCTG 3 50 

GCGGACGAGA TGGGTGTGCT GCAGGAGCGC ATCACCTCGA CTCGCGGCCA 4 00 

CTCCATCACC TCGATGC 417 



2) INFORMATION FOR SEQ ID NO: 275 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium urealyticum 

(B) STRAIN: ATCC 43 042 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275 



GGGCAGCAGC CAGCACTATT CAACGCGCTG CACGTCGAGG TTGACCTCGA 50 

GGCAGTTGCG AAGACCATTA CCCTGGAGGT CGCACAGCAC CTGGGTGACA 100 

ACCTGGTGCG CACCGTCTCC ATGGCCCCGA CCGACGGCCT GGTCCGCGGT 150 

GCAGAGGTCA AGGACACCGG TAAGC CGATC TCTGTGCCAG TCGGCGATGT 2 00 

15 TGTCAAGGGG CACGTCTTCA ACGCCCTGGG CGACTGCCTG GATGAGCCAG 2 50 

GTCTCGGCCG CGACGGCGAG CAGTGGGGCA TCCACCGCGA GCCACCGGCA 300 

TTCGACGAGC TCGAGGGTAA GACCGAGATC CTGGAGACCG GCGTTAAGGT 350 

CATCGACCTG CTGACCCCTT ACGTCAAGGG CGGCAAGATT GGCCTCTTCG 4 00 

GTGGTGCAGG TGTGGGTAAG ACCGTCCTGA TTCAGGAGAT GATTACCCGT 450 

2 0 ATCGCCCGCG AGTTCTCCGG TACCTCCGTG TTCGCCGGCG TCGGCGAGCG 500 

TACCCGTGAG GGTACGGACC TCTTCCTCGA GATGGAGGAG ATGGGCGTGC 550 

TCCAGGACAC CGCGCTGGTG TTCGGTCAGA TGGATGAGCC GCCGGGAGTC 600 

CGTATGCGCG TGGCTCTGTC CGGTCTGACC ATGGCGGAGT ACTTCCGCGA 650 

TGTTCAGGGC CAGGACGTGC TGCTGTTCAT CGACAACATC TTCCGTTTCA 70 0 

2 5 CCCAGGCAGG TTCTGAGGTC TCCACGCTGC TCGGCCGCAT GCCGTCCGCA 750 

GTGGGTTACC AGCCGACCCT GGCTGACGAG ATGGGTGTTC TGCAGGAGCG 800 

CATTACCTCC ACGAAGGGTA AGTCCATTAC CTCCC 835 



30 



ao 



2) INFORMATION FOR SEQ ID NO: 276 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 818 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coxiella burnetii 

(B) STRAIN: Nine Mile phase II 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276 

CCCGTCACGC TGTCCCGAAA GTGTACGATG CTTTGCAGGT GGATGAAAAT 50 

AATTTAACGC TCGAAGTCCA ACAGCAACTC GGGGACGGTG TCGTGCGCAC 100 

AATTGCCATG GGCAGCACTG AGGGCTTAAA ACGCGATATC GCCGTAAAAA 150 

50 ATACGGAAAA ACCGATTGAA GTTCCCGTAG GAAAAGAAAC TTTAGGTCGT 2 00 

ATCATGAACG TGCTGGGTGA GCCGATCGAT GAGTTAGGTC CCATTAATTC 250 

AAAAGAAAAA CTCCCTATTC ATCGTCCTGC GCCGAGCTTT ATTGAGCAAT 3 00 

CTGGCGCTAC CGAATTATTA GAAACCGGTA TTAAAGTGGT CGATTTGCTT 350 

TGCCCCTTTG CTAAGGGAGG CAAAGTGGGT CTTTTTGGAG GCGCGGGCGT 4 00 

55 TGGAAAAACG GTTAATATGA TGGAATTAAT CCGTAATATC GCCATTGAAC 4 50 

ACAGCGGTTA TTCTGTTTTT GCGGGTGTGG GAGAAAGAAC GCGAGAAGGC 500 

AATGATTTTT ATCATGAAAT GAAAGAATCC AATGTCTTGG ATAAAGTGGC 550 

GTTGGTGTAC GGACAAATGA ACGAGCCGCC AGGGAACCGC TTGCGGGTGG 600 

GTTTGACGGG GCTTACGCTG GCGGAAGCCT TCCGTGACGA AGGACGCGAC 650 

50 GTTCTGTTAT TTATCGATAA TATCTTTCGT TACACTTTGG CAGGGGTTGA 700 
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AGTCTCTGCC CTCCTCGGTC GGATGCCATC GGCTGTGGGT TATCAGCCGA 750 
CGTTGGCCGA AGAGATGGGG GCCCTGCAAG AACGCATTAC TTCCACTAAA 800 
AAAGGGTCCA TTACGTCG 818 



10 



15 



20 



25 



30 



35 



40 



2) INFORMATION FOR SEQ ID NO: 277 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwardsiella hoshinae 

(B) STRAIN: ATCC 33379 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277 



TTCCCGCAGG 
CGGTGCCACG 
TAGTTCGCTG 
GAGGTTGAAG 
TCTGGGCCGT 
AGATCGGTGA 
GAAGATCTGT 
CGACCTGATT 
GGGCCGGTGT 
GCTATCGAGC 
CCGTGAGGGT 
ATAAAGTTTC 
CTGCGCGTGG 
AGGTCGTGAT 
CCGGTACTGA 
TATCAGCCGA 
CTCCACTAAG 



ACGCCGTGCC 
AAACTGGTAC 
CATCGCGATG 
ACCAAGACCA 
ATCATGAACG 
AGAAGAGCGT 
CTAACTCTCA 
TGCCCGTTCG 
GGGTAAGACC 
ACTCCGGTTA 
AACGACTTCT 
TCTGGTGTAT 
CGCTGACCGG 
GTACTGTTGT 
AGTCTCCGCT 
CTCTGGCGGA 
ACCGGGTCCA 



GAAGGTGTAT 
TGGAAGTGCA 
GGCTCTTCCG 
TCCGATCGAG 
TACTGGGTGA 
TGGGCTATCC 
GGAACTGCTG 
CTAAAGGCGG 
GTTAACATGA 
CTCAGTCTTC 
ACCACGAGAT 
GGTCAGATGA 
TCTGACCATG 
TCATCGATAA 
CTGCTGGGCC 
GGAAATGGGG 
TCACCTCTG 



AACGCACTGG 
GCAGCAGCTG 
ACGGTCTGCG 
GTTCCTGTTG 
TCCGGTCGAC 
ATCGTGCTGC 
GAGACCGGCA 
TAAAGTGGGC 
TGGAGCTTAT 
GCCGGTGTGG 
GACCGATTCC 
ACGAGCCACC 
GCGGAGAAAT 
CATCTACCGT 
GTATGCCGTC 
GTGCTGCAAG 



AAGTAAAAGG 50 

GGTGGCGGCG 100 

CCGTGGGCTA 150 

GCAAGGCGAC 2 00 

ATGAAGGGCG 2 50 

ACCGAGCTAT 3 00 

TTAAGGTTAT 3 50 

CTGTTCGGTG 4 00 

CCGTAACATC 450 

GTGAGCGTAC 500 

AACGTATTGG 550 

GGGAAACCGT 600 

TCCGTGATGA 650 

TATACCTTGG 70 0 

GGCGGTAGGT 750 

AGCGTATTAC 800 

829 



2) INFORMATION FOR SEQ ID NO: 2 78 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 09 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Edwards i el la tarda 
55 (B) STRAIN: ATCC 15947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278 

GCCGTGCCGA AGGTGTATAA CGCACTGGAA GTAAAAGGCG GTGCCACGAA 50 
50 ACTGGTACTG GAAGTGCAGC AGCAGCTGGG TGGCGGCGTC GTTCGCTGCA 100 
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TCGCGATGGG CTCCTCCGAC GGTCTGCGCC GTGGGCTGGT 13GTTGAAGKC * 2*5 

CAAGACCATC CGATCGAGGT TCCGGTCGGT AAGGCGACCC TGGGCCGTAT 2 00 

CATGAACGTA CTGGGTGATC CGGTCGACAT GAAGGGCGAG ATCGGCGAAG 2 50 

AAGAGCGTTG GGCTATCCAC CGCGCGGCGC CGAGCTATGA AGATCTGTCC 3 00 

5 AACTCTCAGG AGCTGCTGGA GACCGGCATC AAGGTTATCG ACCTGATTTG 350 

CCCGTTCGCC AAAGGCGGTA AAGTTGGCCT GTTCGGTGGT GCCGGTGTGG 4 00 

GTAAGACCGT TAACATGATG GAGCTTATCC GTAACATCGC TATCGAGCAC 4 50 

TCCGGTTACT CCGTATTTGC CGGTGTAGGC GAGCGTACCC GTGAGGGTAA 500 

CGACTTCTAC CACGAGATGA CCGACTCCAA CGTATTGGAT AAAGTTTCTC 550 

10 TGGTATACGG CCAGATGAAT GAGCCGCCGG GAAACCGTCT GCGTGTGGCG 600 

TTGACCGGTC TGACCATGGC GGAGAAATTC CGTGATGAAG GTCGCGATGT 650 

GTTGTTGTTC ATCGATAACA TTTATCGTTA TACCTTGGCT GGTACCGAAG 700 

TTTCTGCTCT GCTGGGTCGT ATGCCGTCGG CGGTAGGTTA TCAGCCGACC 750 

CTGGCGGAAG AGATGGGTGT GTTGCAAGAG CGTATCACCT CAACGAAGAC 800 

1 5 GGGCTCTAT 8 0 9 



2) INFORMATION FOR SEQ ID NO: 279 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 ( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

3 0 (A) ORGANISM: Eikenella corrodens 

(B) STRAIN: ATCC 23834 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279 

35 TTTCCGCGTA CCGCCATTCC GCGTGTTTAC GATGCACTCA AACTGGTTGA 50 

TACTGATTTG ACGCTGGAAG TACAGCAGCA GCTTGGTGAC GGCGTTGTCC 10 0 

GTACCATTGC GATGGGTAGT ACAGACGGTT TGAAACGTGG CTTGGCTGTG 150 

CAAAATACTG GTGCACCGAT TACTGTGCCG GTGGGGAAAG CCACCCTAGG 2 00 

CCGCATCATG GACGTTCTGG GCAACCCCGT GGACGAACAA GGTCCGATCG 2 50 

4 0 GTTCTGACCA AACCCGTGCT ATTCATCAAT TTGCACCTAA GTTCGACGAA 300 

CTCTCCAGCA CTACCGAATT GTTGGAAACA GGCATTAAAG TGATCGATTT 3 50 

GCTTTGTCCG TTTGCTAAAG GTGGTAAAGT GGGTCTGTTT GGCGGTGCCG 4 00 

GTGTGGGCAA AACCGTGAAC ATGATGGAGC TGATTAACAA CATTGCCAAA 4 50 

GCGCACAGTG GTCTTTCCGT ATTCGCCGGT GTGGGTGAGC GTACTCGTGA 500 

45 AGGTAACGAC TTCTACCACG AAATGAAAGA CTCCAACGTG TTGGATAAAG 550 

TGGCAATGGT GTATGGCCAG ATGAACGAGC CGCCTGGTAA CCGCTTGCGT 600 

GTTGCTCTAA CTGGTTTGTC GATGGCTGAA TACTTCCGTG ACGAAAAAGA 650 

CGAAAACGGC AAAGGCCGTG ACGTATTATT CTTTGTGGAT AATATCTACC 700 

GCTATACTCT GGCGGGTACC GAAGTGTCGG CTCTGCTTGG CCGTATGCCT 750 

50 TCTGCTGTGG GTTATCAGCC AACTTTGGCT GAAGAAATGG GTCGTTTGCA 8 00 

GGAGCGTATT ACCTCCACCC AAACTGGTTC CATTACCTCT 84 0 



55 2) INFORMATION FOR SEQ ID NO: 280 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS : Double 
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10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : En terobacter agglomerans 

(B) STRAIN: ATCC 27989 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280 



GCCGTACCAC GAGTGTACGA TGCACTTGAG GTAAAGAATG GTGAAGAGCG 50 

TCTGGTGCTG GAAGTTCAGC AACAGCTCGG CGGTGGCGTT GTACGTACCA 100 

TCGCAATGGG TTCTTCTGAT GGTCTGCGTC GTGGTCTGGA AGTAACGGAC 150 

CTGGCTCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 200 

L5 CATGAACGTA CTGGGCGAAC CAGTAGACAT GAAAGGCGAC ATCGGTGAAG 2 50 

AAGAGCGTTG GGCGATCCAC CGTGCAGCAC CGTCCTACGA AGAGTTGTCA 300 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCT AAGGGCGGTA AAGTGGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 4 50 

2 0 TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAC AAAGTATCCC 550 

TGGTGTATGG TCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACCATGGC TGAGAAGTTC CGTGACGAAG GTCGTGACGT 650 

ACTGTTGTTC GTTGACAACA TCTACCGTTA CACCCTGGCC GGTACGGAAG 700 

25 TATCCGCACT GCTGGGCCGT ATGCCTTCTG CGGTAGGTTA TCAGCCGACG 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 

CGG 803 



30 



2) INFORMATION FOR SEQ ID NO: 281 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 833 bases 
IS (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter amnigenus 

(B) STRAIN: ATCC 33072 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281 

CGAATTCCCT CAGGATGCCG TACCGCGCGT GTACGATGCT CTTGAGGTAC 50 

AGAATGGTAA CGAGAGTCTG GTGCTGGAAG TTCAGCAGCA GCTCGGTGGT 100 

GGTATCGTAC GTACTATCGC CATGGGTTCT TCCGACGGTC TGCGTCGTGG 150 

50 TCTGGCTGTT AAAGATCTCG AACACCCGAT CGAAGTCCCG GTAGGTAAAG 200 

CAACACTGGG TCGTATCATG AACGTTTTGG GTCAACCAAT CGACATGAAA 250 
GGCGACATCG GTGAAGAAGA CCGTTGGGCA ATCCACCGTG CAGCACCTTC ' 3 00 

CTATGAAGAG CTGTCTAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 3 50 

TTATCGACCT GATGTGTCCG TTCGCTAAGG GCGGTAAAGT TGGTCTGTTC 400 

55 GGCGGTGCGG GCGTGGGTAA AACTGTAAAC ATGATGGAGC TGATCCGTAA 4 50 

CATCGCGATC GAGCACTCCG GTTACTCCGT GTTTGCAGGC GTGGGTGAGC 5 00 

GTACTCGTGA GGGTAACGAC TTCTACCACG AAATGACCGA TTCCAACGTT 550 

CTGGATAAAG TATCCCTGGT TTATGGCCAG ATGAACGAGC CACCAGGAAA 600 

CCGTCTGCGC GTTGCGCTGA CCGGTCTGAC TATGGCTGAG AAGTTCCGTG 650 

50 ACGAAGGTCG TGACGTACTG CTGTTCGTAG ATAACATCTA CCGTTACACC 700 
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CTGGCCGGTA CTGAAGTATC TGCGCTGCTG GGCCGTATGC CTTCAGCGGT 
AGGTTACCAG CCGACCCTGG CGGAAGAGAT GGGCGTTCTG CAGGAACGTA 
TCACTTCTAC CAAAACCGGT TCTATCACCT CCG 



750 
800 
833 



2) INFORMATION FOR SEQ ID NO: 282 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terobacter asburiae 

(B) STRAIN: ATCC 35953 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282 

GCCGTACCAC GCGTGTACGA CGCGCTTGAG GTACAGAATG GTAACGAGAG 5 0 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 10 0 

TCGCGATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTCAAAGAC 15 0 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 200 

CATGAACGTA TTGGGTCAAC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 250 

AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA AGAGCTGTCC 3 00 

AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTTG 4 00 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CCGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 50 0 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAC AAAGTATCCC 550 

TGGTTTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACGATGGC TGAGAAGTTC CGTGATGAAG GCCGTGACGT 650 

TCTGCTGTTC GTTGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 70 0 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA CCAGCCTACG 750 

CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTACCAAAAC 80 0 

CGGTTCTATC 810 



2) INFORMATION FOR SEQ ID NO: 283 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terobacter cancerogenus 

(B) STRAIN: ATCC 35317 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTACAGAATG GTAACGAGAG 50 
CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTACGTACTA 100 



169 
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TCGCCATGGG 
CTTGAGCACC 
CATGAACGTA 
AAGAGCGTTG 
AGCTCTCAGG 
TCCGTTCGCG 
GTAAAACCGT 
TCCGGTTATT 
CGACTTCTAC 
TGGTGTACGG 
CTGACCGGCC 
ACTGCTGTTC 
TATCTGCACT 
CTGGCGGAAG 
CGGTTCTATC 



TTCTTCCGAC 
CGATCGAAGT 
TTGGGTCAAC 
GGCTATCCAC 
AACTGCTGGA 
AAGGGCGGTA 
AAACATGATG 
CCGTGTTTGC 
CACGAAATGA 
CCAGATGAAC 
TGACCATGGC 
GTTGATAACA 
GCTGGGTCGT 
AGATGGGTGT 
A 



GGTCTGCGTC 
CCCGGTAGGT 
CAATCGACAT 
CGCGCAGCAC 
AACCGGCATC 
AAGTCGGTCT 
GAGCTGATCC 
GGGCGTGGGT 
CCGACTCCAA 
GAGCCACCAG 
TGAGAAGTTC 
TCTACCGTTA 
ATGCCTTCAG 
TCTTCAGGAA 



GTGGTCTGGA 
AAAGCAACAC 
GAAAGGCGAC 
CTTCCTACGA 
AAAGTTATCG 
GTTCGGTGGT 
GTAACATCGC 
GAACGTACTC 
CGTTCTGGAT 
GAAACCGTCT 
CGTGACGAAG 
CACCCTGGCC 
CGGTAGGTTA 
CGTATCACCT 



PCT/CA00/01150 

AGTAAAAGAC 150 

TGGGTCGTAT 2 00 

ATCGGTGAAG 2 50 

AGAGCTGTCC 3 00 

ACCTGATGTG 3 50 

GCGGGTGTAG 4 00 

GATCGAGCAC 4 50 

GTGAGGGTAA 500 

AAAGTATCCC 550 

GCGCGTTGCG 60 0 

GTCGTGACGT 650 

GGTACCGAAG 700 

TCAGCCTACG 750 

CAACCAAAAC 8 00 

811 



20 



2) INFORMATION FOR SEQ ID NO: 284 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

2 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Enterobacter cloacae 

(B) STRAIN: ATCC 13047 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284 

3 5 GCCGTACCAC GCGTGTACGA CGCGCTTGAG GTACAGAATG GTAACGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTTAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACAC TGGGTCGTAT 2 00 

CATGAACGTA TTGGGTCAGC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 2 50 

10 AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA AGAGCTGTCC 30 0 

AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTTGCG AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTACT CCGTATTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

15 CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAC AAAGTATCCC 550 

TGGTTTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACTGGTC TGACGATGGC TGAGAAGTTC CGTGACGAAG GCCGTGACGT 650 

TCTGCTGTTC GTTGATAACA TCTACCGTTA CACCCTGGCC GGTACGGAAG 70 0 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCTACG 750 

50 CTGGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CTACCAAAAC 800 

CGGTTCTATC ACTTCCG 817 



2) INFORMATION FOR SEQ ID NO: 28 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 766 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter gergoviae 

(B) STRAIN: ATCC 33028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285 



AATGAGAGCC TGGTGCTGGA AGTTCAGCAG CAGCTCGGCG GCGGTATCGT 50 

GCGTACCATC GCAATGGGTT CTTCTGACGG TCTGCGTCGC GGTCTGGAAG 100 

TTAAAGATCT CGAACATCCG ATCGAAGTCC CGGTAGGTAA AGCGACCCTC 150 

GGCCGTATTA TGAACGTGCT GGGTCAGCCG GTTGATATGA AAGGCGATAT 2 00 

15 CGGCGAAGAA GAGCGTTGGG CGATCCACCG CGCTGCGCCG TCCTATGAAG 250 

AGCTCTCCAG CTCTCAGGAA CTGCTGGAAA CCGGTATCAA GGTAATGGAC 3 00 

CTGATTTGCC CGTTCGCGAA GGGCGGTAAA GTCGGTCTGT TCGGCGGTGC 3 50 

GGGCGTTGGT AAAACCGTAA ACATGATGGA GCTGATCCGT AACATCGCGA 4 00 

TCGAGCACTC CGGCTACTCC GTGTTTGCGG GCGTGGGTGA ACGTACTCGT 450 

2 0 GAGGGTAACG ACTTCTACCA CGAAATGACC GACTCCAACG TTATCGACAA 50 0 

AGTATCCCTG GTGTACGGCC AGATGAACGA GCCGCCGGGA AACCGTCTGC 550 

GCGTGGCGCT GACCGGTCTG ACCATGGCTG AGAAATTCCG TGACGAAGGT 600 

CGTGACGTTC TGCTGTTCGT CGATAACATC TACCGCTATA CCCTCGCCGG 650 

TACTGAAGTA TCCGCACTGC TGGGCCGTAT GCCTTCTGCA GTAGGTTACC 700 

25 AGCCGACGCT GGCGGAAGAG ATGGGTGTTC TGCAGGAACG TATCACCTCC 750 

ACCAAAACCG GTTCTA 766 



30 2) INFORMATION FOR SEQ ID NO: 286 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

*0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter hormaechei 

(B) STRAIN: ATCC 49162 



*5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286 



GCCGTACCAC GCGTGTACGA CGCGCTTGAG GTACAGAATG GTAACGAGAG 50 

CCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCCATGGG TTCTTCCGAC GGTCTGCGTC GTGGTCTGGA AGTGAAAGAC 150 

CTTGAGCACC CGATCGAAGT CCCGGTAGGT AAAGCAACGC TGGGTCGTAT 2 00 

50 CATGAACGTA TTGGGTCAGC CAATCGACAT GAAAGGCGAC ATCGGTGAAG 2 50 

AAGAGCGTTG GGCTATCCAC CGCGCGGCAC CTTCCTACGA AGAGCTGTCC 3 00 

AGCTCTCAGG AACTGCTGGA AACCGGCATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTTGCG AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 4 50 

55 TCCGGTTACT CCGTGTTTGC GGGCGTGGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTCTGGAC AAAGTATCCC 550 

TGGTTTACGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACTGGCC TGACGATGGC TGAGAAGTTC CGTGACGAAG GCCGTGACGT 650 

TCTGCTGTTC GTCGATAACA TCTACCGTTA CACCCTGGCC GGTACGGAAG 700 

50 TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCAACG 750 

171 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

CTTGCGGAAG AGATGGGTGT TCTTCAGGAA CGTATCACCT CGACCAAAAC 800 
CGGTT 805 



2) INFORMATION FOR SEQ ID NO: 287 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 7 91 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terobacter sakasakii 

(B) STRAIN: ATCC 29544 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287 

TACGACGCCC TTGAGGTAAC GAATGGTAAT GAGCGTCTGG TGCTGGAAGT 50 

CCAGCAGCAG CTCGGCGGCG GTATCGTACG TACCATCGCG ATGGGCTCTT 100 

CCGACGGTCT GCGTCGCGGT CTGCCTGTTG CAGACCTTGA GCACCCGATC 150 

25 GAAGTGCCGG TAGGTAAAGC GACGCTGGGT CGTATCATGA ACGTCCTGGG 200 

TCAGCCTATC GACATGAAAG GCGACATCGG CGAAGAAGAG CGTTGGGCGA 2 50 

TTCATCGCGC GGCGCCGTCC TATGAAGAGC TGTCCAGCTC TCAGGAACTG 3 00 

CTGGAAACCG GCATCAAAGT TATCGACCTG ATGTGTCCGT TCGCGAAGGG 3 50 

CGGTAAAGTC GGTCTGTTCG GTGGTGCAGG TGTAGGTAAA ACCGTAAACA 4 00 

3 0 TGATGGAGCT TATTCGTAAC ATCGCGATTG AGCACTCCGG TTACTCCGTG 450 

TTTGCGGGCG TGGGCGAACG TACCCGTGAA GGTAACGACT TCTACCACGA 500 

AATGACCGAC TCCAACGTAC TGGATAAAGT ATCCCTGGTG TACGGCCAGA 550 

TGAACGAGCC GCCGGGAAAC CGTCTGCGCG TTGCGCTGAC CGGCCTGACC 600 

ATGGCTGAGA AATTCCGTGA CGAAGGTCGT GACGTTCTGC TGTTCGTCGA 650 

3 5 CAACATCTAC CGTTACACCC TGGCCGGTAC TGAAGTATCC GCACTGCTGG 700 

GCCGTATGCC TTCAGCGGTA GGTTATCAGC CGACCCTGGC GGAAGAGATG 750 

GGTGTTCTGC AGGAGCGTAT CACCTCCACC AAAACCGGTT C 791 



40 



2) INFORMATION FOR SEQ ID NO: 288 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 839 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 

(B) STRAIN: ATCC 14025 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288 

TTTCTTTAGA TCAATCCTTA CCAGACATCA ACAATGCGTT GATTGTTTAC 50 

AAAAAAGATA AAACAAAAGT TGTTCTTGAA GTTGCTTTGG AACTTGGTGA 100 

TGGTGTTATC CGCACAATCG CTATGGAGGC TACTGATGGA TTGCAACGTG 150 

50 GAATGGAAGT TGTCGATACT GGCAAATCAA TCTCCGTTCC TGTAGGTAAA 2 00 

172 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

GATACTCTAG GTCGTGTGTT TAACGTATTA GGTGAAACGA TTGATAAAGA 2 50 

AGCACCTTTT CCAGAAGATG CAGAAAGAAG CGGCATTCAT AAAAAGGCGC 3 00 

CTGCTTTTGA AGACCTTAGT ACAAGTAACG AGATTTTGGA AACAGGGATC 3 50 

AAGGTTATCG ACTTATTAGC CCCTTATTTA AAAGGTGGGA AAGTCGGACT 40 0 

5 ATTCGGTGGT GCCGGTGTTG GTAAAACCGT TTTGATCCAA GAATTAATTC 4 50 

ATAATATCGC CCAAGAACAC GGTGGTATTT CAGTGTTTAC CGGTGTTGGG 500 

GAACGTACTC GTGAAGGGAA CGACCTTTAT TATGAAATGA AAGACTCTGG 550 

CGTTATTGAG AAAACAGCCA TGGTGTTCGG ACAAATGAAC GAGCCGCCTG 600 

GTGCACGTAT GCGTGTTGCC TTGACTGGTT TGACATTAGC TGAATATTTC 650 

10 CGTGATGAAG AAGGACAAGA TGTGTTGCTA TTTATTGACA ACATCTTCCG 700 

CTTTACTCAA GCCGGATCAG AAGTTTCTGC CTTATTAGGA CGTATGCCAT 750 

CAGCCGTTGG GTATCAACCA ACTTTGGCAA CTGAAATGGG GCAATTACAA 8 00 

GAACGAATCA CTTCAACCAA AAAAGGTTCG ATCACTTCA 83 9 



15 



30 



2) INFORMATION FOR SEQ ID NO: 289 



( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 84 7 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseli flaws 

(B) STRAIN: ATCC 25788 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289 



TTCTCTCTAG ACCAATCATT ACCAGATATC AACAATGCGT TGATTGTTTA 50 

CAAAAAAGAT GAGCAGAAAA CAAAAGTTGT GTTAGAAGCT GCCTTAGAAC 100 

3 5 TTGGCGACGG CGTTATCCGT ACGATTGCCA TGGAATCAAC AGATGGCTTA 150 

CAACGAGGAA TGGAAGTAAT CGATACAGGC GCCTCCATTT CTGTTCCAGT 2 00 

TGGGACAGAA ACCTTAGGAC GGGTGTTTAA TGTCTTAGGG GACACCATCG 250 

ATTTAGAAGC GCCGTTCCCT GAAGAAGCAC CCCGCAGTGG GATTCACAAA 300 

AAAGCACCTG ACTTTGATGA ATTGTCAACA AGTACGGAGA TCCTTGAAAC 3 50 

4 0 TGGGATCAAA GTTATCGATT TGTTAGCCCC TTATTTAAAA GGGGGGAAAG 4 00 

TTGGACTTTT CGGTGGTGCC GGTGTTGGTA AAACCGTCTT GATCCAAGAA 450 

TTGATCCACA ACATCGCCCA AGAGCATGGT GGGATCTCTG TCTTCACAGG 500 

TGTTGGTGAA CGGACACGTG AAGGAAATGA CCTTTATAAT GAAATGAAAG 550 

AATCTGGCGT TATCGAAAAA ACAGCCATGG TGTTTGGACA AATGAACGAA 600 

45 CCACCAGGTG CTCGGATGCG GGTAGCCTTG ACTGGTTTGA CATTAGCCGA 650 

GTACTTCCGT GATGTGGAAG GACAAGACGT GCTCTTGTTT ATCGATAATA 700 

TCTTCCGCTT CACTCAAGCA GGTTCTGAAG TATCTGCCTT ACTAGGTCGG 750 

ATGCCGTCTG CCGTTGGGTA TCAGCCAACA TTAGCAACTG AGATGGGGCA 8 00 

ATTACAAGAA CGGATCACAT CGACGAAGAA AGGTTCCGTT ACGTCTA 84 7 

50 



2) INFORMATION FOR SEQ ID NO: 290 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

173 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus durans 
5 (B) STRAIN: ATCC 19432 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290 

TTTCTTTAGA CCAATCCTTA CCAGATATCA ACAACGCTTT AGTTGTTTAT 50 

10 AAAAATGATG AGAAGAAATC GAAAGTTGTT CTTGAAACAG CGCTAGAATT 100 

AGGTGACGGT GTCATCCGTA CAATCGCGAT GGAATCAACA GATGGTTTAC 150 

AACGCGGAAT GGAAGTCATT GATACAGAAA AAGCAATTTC TGTACCAGTG 2 00 

GGTAAAGAAA CGTTAGGTCG TGTATTCAAT GTATTAGGAG ATACGATCGA 2 50 

TTTATCTGCA CCTTTCCCAG AAGATGCAAA ACGTAGCGAA ATCCATAAAA 3 00 

15 AAGCACCAAA CTTTGATGAG TTAAGTACAA GTACTGAGAT CCTTGAAACT 350 

GGGATCAAAG TTATTGACTT GCTTGCTCCT TACTTAAAAG GTGGGAAAGT 400 

TGGATTATTC GGTGGTGCCG GTGTAGGTAA AACTGTATTG ATCCAAGAAT 4 50 

TGATCCATAA TATCGCTCAA GAACACGGTG GTATTTCTGT ATTTACTGGT 500 

GTTGGTGAAC GTACACGTGA AGGTAATGAC CTTTATTATG AAATGAAAGA 550 

2 0 TTCAGGAGTT ATTGAAAAAA CAGCCATGGT GTTTGGTCAA ATGAACGAAC 600 

CACCAGGTGC ACGTATGCGT GTTGCCTTGA CTGGTTTGAC GATTGCTGAA 650 

TACTTCCGTG ATGTTGAAGG GCAAGACGTG CTATTGTTTA TTGATAATAT 700 

TTTCCGTTTC ACTCAAGCCG GTTCAGAAGT TTCTGCCCTA TTAGGTCGTA 750 

TGCCTTCTGC CGTTGGGTAC CAACCAACGC TAGCAACAGA AATGGGTCAA 800 

25 TTACAAGAAC GGATCACTTC AACGAAAAAA GGTTCAATCA CTTCA 845 



2) INFORMATION FOR SEQ ID NO : 2 91 



30 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

3 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

4 0 (A) ORGANISM: Enterococcus faecal is 

(B) STRAIN: ATCC 2 9212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291 

£5 TTAGATCAAT CCTTACCCGA TATTAACAAC GCTTTAGTCG TTTATAAAAA 50 

TGGCGAAGCA AAACAAAAAG TAGTACTTGA AGTCGCTTTA GAACTAGGTG 100 

ATGGAGTGAT TCGTTCTATC GCCATGGAAT CGACAGATGG TTTACAACGT 150 

GGAATGGAAG TTATCGATAC AGGAAAATCA ATTTCAGTTC CTGTTGGTAA 2 00 

AGATACATTA GGTCGTGTGT TTAACGTTTT AGGAGACACA ATTGACTTAG 250 

50 AAGCGCCATT CCCTGCAGAT GCTGAACGTA GTGGGATTCA TAAAAAAGCG 3 00 

CCAGCATTTG ATGAATTAAG TACCAGTAAT GAAATTTTAG AAACAGGGAT 3 50 

TAAAGTTATT GACTTATTAG CACCTTATCT AAAAGGTGGT AAAGTCGGAC 4 00 

TTTTCGGTGG TGCCGGTGTT GGTAAAACCG TCTTAATTCA AGAATTAATT 4 50 

CATAATATTG CCCAAGAACA TGGAGGGATT TCCGTCTTTA CTGGTGTTGG 5 00 

55 TGAACGGACA CGTGAAGGGA ACGATCTGTA CTATGAAATG AAAGATTCAG 550 

GCGTTATTGA AAAAACAGCC ATGGTTTTTG GTCAAATGAA CGAACCGCCA 600 

GGTGCACGGA TGCGTGTGGC CTTAACTGGG TTAACGATTG CTGAATATTT 650 

CCGTGATGTG GAAGGACAAG ACGTGCTATT ATTTATTGAT AACATTTTCC 700 

GTTTCACCCA AGCCGGTTCA GAAGTTTCTG CCCTTTTAGG TCGGATGCCG 750 

50 TCAGCCGTTG GTTACCAACC AACCTTAGCG ACTGAAATGG GACAATTACA 800 

174 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

AGAACGGATT ACTTCAACGA AAAAAGGATC AATTACCTCT 



PCT/CA00/01150 

840 



5 2) INFORMATION FOR SEQ ID NO: 2 92 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: ATCC 19434 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292 



TTTCTTTAGA CCAATCCTTA CCAGATATCA ACAACGCATT AGTTGTTTAT 5 0 

AAAAATGACG AAAATAAATC AAAAGTTGTT CTTGAAGCAG CGTTAGAATT 100 

AGGGGACGGA GTGATCCGGA CCATTGCGAT GGAATCAACA GATGGTTTAC 150 

AAAGAGGAAT GGAAGTCATT GATACAGGCA AAGCAATCTC TGTTCCTGTA 2 00 

2 5 GGTAAAGAAA CATTAGGTCG CGTATTCAAC GTACTAGGAG ATACGATCGA 2 50 

TTTAGAAACA CCTTTCCCAG AAGATGCGGA AAGAAGCGAA ATTCATAAAA 300 

AAGCACCAGC CTTTGACGAA TTAAGTACAA GTACAGAAAT TTTGGAAACA 3 50 

GGGATCAAAG TTATCGATTT GCTTGCCCCA TATTTAAAAG GTGGGAAAGT 4 00 

CGGACTATTC GGTGGTGCCG GTGTTGGTAA AACCGTACTG ATCCAAGAAC 450 

3 0 TGATCCATAA TATCGCCCAA GAACATGGTG GTATTTCTGT ATTTACCGGT 500 

GTAGGTGAAC GTACTCGTGA AGGTAATGAC TTGTATTATG AAATGAAAGA 550 

TTCAGGAGTT ATCGAAAAAA CAGCCATGGT GTTCGGACAA ATGAACGAAC 600 

CACCAGGTGC ACGTATGCGT GTTGCTTTGA CTGGTTTGAC GATTGCGGAA 650 

TATTTCCGTG ATGTAGAAGG TCAAGATGTA CTGTTGTTTA TCGACAACAT 70 0 

3 5 TTTCCGTTTC ACTCAAGCTG GATCTGAAGT ATCAGCCTTG TTAGGACGGA 750 

TGCCTTCTGC GGTTGGTTAT CAACCAACAT TGGCAACAGA AATGGGTCAA 8 00 

TTGCAAGAAC GTATCACATC TACGAAAAAA G 831 



40 



50 



2) INFORMATION FOR SEQ ID NO: 293 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 82 6 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: ATCC 4 9573 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293 

CCTTACCAGA TATCAACAAT GCGTTGATCG TTTACAAAAA AGACGAGAAA 50 

AAAACAAAAG TAGTATTGGA AGCCGCTTTA GAACTAGGGG ATGGTGTGAT 100 

CCGCACCATC GCAATGGAAT CTACAGACGG TTTGCAACGA GGAATGGAAG 150 

60 TCATCGATAC CGGTGCCTCA ATCTCTGTCC CTGTAGGAAC AGATACTCTA 2 00 

175 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

GGCCGAGTAT TTAATGTACT AGGCGATACT ATCGACTTGG AAGCACCAT^ ' 2 5b : 

CCCAGAAGAT GCCAAACGTA GTGGCATCCA CAAAAAAGCC CCAGATTTCG 300 

ATGAATTGTC AACAAGTACA GAAATCCTTG AAACTGGGAT CAAAGTTATC 3 50 

GATTTATTAG CTCCTTACTT AAAAGGTGGT AAAGTCGGCT TGTTCGGTGG 400 

5 TGCCGGTGTT GGTAAAACCG TATTGATTCA AGAATTGATT CACAATATCG 4 50 

CTCAAGAGCA TGGGGGAATT TCAGTATTTA CCGGTGTTGG CGAACGGACG 500 

CGTGAAGGTA ATGACTTGTA TTATGAAATG AAAGAATCAG GCGTTATCGA 550 

AAAGACAGCC ATGGTTTTCG GTCAAATGAA TGAACCACCA GGTGCCCGGA 6 00 

TGCGGGTTGC TTTGACTGGT TTGACCATTG CTGAGTATTT CCGTGACGTT 650 

10 GAAGGACAAG ATGTGCTCTT GTTTATCGAT AATATTTTCC GTTTCACACA 70 0 

AGCGGGTTCT GAAGTATCTG CCTTGTTAGG CCGGATGCCA TCAGCCGTTG 750 

GTTATCAACC AACTCTAGCA ACTGAAATGG GTCAATTACA AGAACGAATC 800 

ACTTCTACGA AAAAAGGATC TGTAAC 826 



15 



30 



2) INFORMATION FOR SEQ ID NO: 2 94 



(i) SEQUENCE CHARACTERISTICS : 
2 0 (A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus saccharolyticus 

(B) STRAIN: ATCC 43076 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294 



TTTCTTTGGA CCAATCCTTA CCAGACATCA ACAATGCGTT GGTGGTCTAT 50 

AAAAAGAATG ATGAAAAAAC AAAAGTGGTA CTTGAAACAG CTTTAGAACT 10 0 

3 5 TGGTGATGGT GTCGTACGTA CGATTGCTAT GTCGTCAACA GACGGTTTGC 150 

AACGTGGGAT GGAAGTCATC GATACAGGAG CATCAATTTC TGTTCCTGTT 200 

GGGAAAGAGA CATTAGGACG TGTATTTAAC GTTTTAGGGG AGACTATCGA 2 50 

CTTAGATGGT CCATTCCCAG AAGAAGTAGC ACGAGATGGT ATTCATAAAA 300 

AGGCACCTGA TTTTGATGAA TTAAGTACAA GTACGGAGAT TCTTGAAACA 3 50 

10 GGGATTAAAG TAATCGATTT ATTAGCGCCT TACTTAAAAG GTGGGAAAGT 40 0 

TGGTTTATTC GGTGGTGCCG GTGTAGGTAA AACGGTATTA ATTCAAGAAT 450 

TGATTAACAA TATTGCGCAA GAACATGGTG GTATTTCAGT ATTTGCGGGT 500 

GTTGGTGAGC GTACTCGTGA AGGAAATGAC CTTTATTATG AAATGAAAGA 550 

GTCGGGCGTT ATTGAGAAAA CAGCGATGGT TTTTGGACAA ATGAACGAAC 600 

15 CACCAGGTGC ACGTATGCGA GTTGCTTTAA CTGGTTTAAC CATTGCAGAA 650 

TACTTCCGTG ATGTTGAAGG ACAAGATGTA TTACTATTTA TTGATAACAT 700 

TTTCCGTTTT ACTCAAGCTG GTTCAGAAGT TTCAGCTTTA TTAGGACGTA 750 

TGCCTTCAGC GGTAGGGTAT CAACCGACAT TAGCAACAGA AATGGGACAA 800 

TTACAAGAAC GTATTACGTC AACGAAAAAA GGCTCAATTA CATCAA 84 6 

50 

2) INFORMATION FOR SEQ ID NO: 2 95 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

176 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia fergusonii 
5 (B) STRAIN: ATCC 3 5469 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295 

GCCGTACCGC GCGTGTACGA TGCTCTTGAG GTGCAAAATG GTAATGAGCG 50 

10 TCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGTGGTATC GTGCGTACCA 100 

TCGCAATGGG GTCCTCCGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAC 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCGACTC TGGGCCGTAT 200 

CATGAACGTA CTGGGTGAAC CGGTCGACAT GAAAGGCGAG ATCGGTGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCAC CTTCCTACGA AGAGCTGTCA 3 00 

15 AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACTGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CTGACTCCAA CGTTATCGAC AAAGTATCCC 550 

2 0 TGGTATATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

CTGACCGGCC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

TCTGCTGTTC GTTGACAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGCCGT ATGCCTTCAG CGGTAGGTTA TCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTTCAGGAA CGTATCACCT CCACCAAAAC 800 

25 TGG 803 



2) INFORMATION FOR SEQ ID NO: 2 96 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
£0 (A) ORGANISM: Escherichia hermannii 

(B) STRAIN: ATCC 33650 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296 

4 5 GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTGCAAA ATGGTGATGA 50 

GCGTCTGGTG CTGGAAGTGC AGCAGCAGCT CGGCGGCGGT ATCGTGCGTA 100 

CCATCGCAAT GGGTTCTTCC GACGGTCTGC GTCGTGGTCT GACTGTCGTC 150 

GACCTCGAGC ACCCGATCGA AGTCCCGGTA GGTAAAGCGA CCCTGGGCCG 2 00 

TATCATGAAC GTGCTGGGTC AGCCGATCGA CATGAAAGGC GATATCGGTG 2 50 

50 AAGAAGAGCG TTGGGCGATT CACCGCGCGG CGCCGTCCTA TGAAGAGCTG 3 00 

TCCAGCTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA TCGACCTGAT 3 50 

GTGTCCGTTC GCGAAGGGCG GTAAAGTCGG TCTGTTCGGT GGTGCGGGCG 400 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCTGTGTT TGCGGGCGTG GGTGAACGTA CTCGTGAGGG 500 

55 TAACGACTTC TACCATGAAA TGACCGACTC CAACGTTCTG GACAAAGTAT 550 

CCCTGGTTTA CGGCCAGATG AACGAACCGC CGGGAAACCG TCTGCGCGTT 600 

GCACTGACCG GCCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGTTG TTCGTCGACA ACATCTACCG TTACACCCTG GCCGGTACTG 7 00 

AAGTATCCGC ACTGCTGGGC CGTATGCCTT CTGCGGTAGG TTACCAGCCG 750 

SO ACCCTGGCGG AAGAGATGGG CGTTCTGCAG GAGCGTATCA CCTCCACCAA 800 

177 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
AACCGGTTCT ATCACCTCCG TA 



PCT/CA00/01150 



5 2) INFORMATION FOR SEQ ID NO: 297 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 808 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia vulneris 

(B) STRAIN: ATCC 33 821 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297 



20 



CCGAACGTGT ACGACGCCCT CGAAGTGACA AATGGTAATG AGCGTCTGGT 50 

GCTGGAAGTT CAGCAGCAGC TCGGCGGCGG TATCGTACGT ACCATCGCTA 100 

TGGGTTCTTC CGACGGTCTG CGTCGTGGTC TGGAAGTTCA GGACCTCGAG 150 

CACCCGATCG AAGTGCCGGT AGGTAAAGCG ACCCTGGGTC GTATCATGAA 2 00 

25 CGTACTGGGT CAGCCGATCG ATATGAAAGG CGACATCGGT GAAGAAGAGC 2 50 

GTTGGGCTAT TCACCGTGCA GCACCGTCCT ATGAAGAGCT CTCCAGCTCT 3 00 

CAGGAACTGC TGGAAACCGG CATCAAGGTT ATCGACCTGA TGTGTCCGTT 3 50 

CGCCAAGGGC GGTAAAGTCG GCCTGTTCGG CGGCGCGGGC GTGGGTAAAA 4 00 

CCGTAAACAT GATGGAGCTG ATCCGTAACA TCGCGATCGA GCACTCCGGT 4 50 

30 TACTCCGTGT TTGCAGGCGT GGGTGAGCGT ACTCGTGAGG GTAACGACTT 500 

CTACCACGAG ATGACCGACT CCAACGTTCT GGACAAAGTA TCCCTGGTGT 550 

ACGGCCAGAT GAACGAGCCG CCGGGAAACC GTCTGCGCGT GGCACTGACC 60 0 

GGCCTGACCA TGGCTGAGAA GTTCCGTGAC GAAGGTCGTG ACGTTCTGCT 65 0 

GTTCGTTGAC AACATCTATC GTTACACCCT GGCCGGTACG GAAGTATCTG 70 0 

35 CACTGCTGGG CCGTATGCCT TCAGCGGTAG GTTACCAGCC GACGCTGGCG 750 

GAAGAGATGG GCGTTCTGCA GGAGCGTATC ACCTCCACCA AAACCGGTTC 8 00 

TATCACCT 8 08 



40 



50 



2) INFORMATION FOR SEQ ID NO: 298 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 843 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eubacterium 1 en turn 

(B) STRAIN: ATCC 43055 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298 

TTTCCCCCTG ATCAGCTGCC GGCGATTTAC AACGCGCTGA CGGTTGATGC 50 

CAAGACCCTG GCGGGCGACT TGCACCTCGT GCTCGAGGTC GAGACGCACC 100 

TGCCGGGCAA CCTTGTCCGC TCGGTGGCCA TGAGCTCGAC GGACGGTCTC 150 

SO GTCCGCGGCC TCGAGGTCGT CGACACGGGC AACCCGATCA TGATGCCCGT 2O0 

178 
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GGGTCCCGAG ACCCTGGGTC GCATCTGGAA 
ACGAGAAGCC GATGCCCGAG GTGAAGGGCT 
GCTCCGGACT ACGACGAGCT GTCCACCACC 
CATCAAGGCC ATCGACCTCG TCGAGCCCTT 
5 GTCTGTTCGG CGGCGCCGGC GTGGGCAAGA 
ATCAACAACC TGGCCCAGGA GCACGGCGGC 
GGGCGAGCGT ACCCGCGAGG GTACCGACCT 
CGGGCGTCAT CAACAAGACC TGC CTCGTGT 
CCGGGAGCGC GTCTGCGCGT GGGTCTCGCG 
10 CTTCCGCGAT CAGGGCCAGG ACGTGCTTCT 
GCTTCACGCA GGCCGGCTCC GAGGTGTCCG 
TCTGCCGTGG GTTACCAGCC GACGCTGGCA 
GGAGCGCATC ACGTCGACGT CCACCGGCTC 

15 



PCT/CA00/01150 

CGTCATGGGC GAGCCCGTCG 2 50 

ACATGCCCAT CCACCGTCCG 300 

ACCGAGATCT TCGAGACCGG 350 

CGTCAAGGGC GGCAAGACGG 400 

CGGTTATCAT CCAGGAGCTC 4 50 

ACGTCGGTGT TCACGGGCGT 500 

CTACCTGGAG ATGAGCGACT 550 

ACGGTCAGAT GAACGAGCCT 600 

GGCCTCACCG AGGCGGAGTA 650 

GTTCGTGGAC AACATCTTCC 700 

CTCTGCTGGG CCGCATGCCC 750 

ACCGAGATGG GCGACCTGCA 8 00 

CATCACGTCC GTG 843 



2) INFORMATION FOR SEQ ID NO: 299 

(i) SEQUENCE CHARACTERISTICS : 
2 0 (A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



2 5 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ewingella. americana 

(B) STRAIN: ATCC 33852 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299 



TCCCTCAGGA TGCAGTACCG AACGTGTACA ATGCTCTTGA GGTAGAAAAC 50 

GGTGCCTCCA AACTGGTTCT GGAAGTTCAG CAACAGTTAG GCGGCGGCGT 100 

3 5 TGTTCGTTGT ATCGCAATGG GTACCTCAGA CGGCCTTCGT CGCGGTCTGA 150 

AAGTGAACAA CCTGGAACAC CCAATTGAAG TTCCGGTTGG TAAAGCGACT 2 00 

CTGGGTCGTA TCATGAACGT ATTGGGTGAA CCAATCGACA TGAAAGGTGA 2 50 

AATCGGCGAA GAAGAACGTC GTGCAATTCA CCGTCCAGCG CCTTCTTATG 3 00 

AAGAGCTGGC TAACTCCCAA GAATTGCTGG AAACCGGTAT CAAAGTTATG 350 

40 GACCTGATGT GTCCGTTCGC TAAGGGCGGT AAAGTCGGTC TGTTCGGTGG 4 00 

TGCGGGTGTT GGTAAAACTG TAAACATGAT GGAGCTGATC CGTAACATCG 4 50 

CGATCGAGCA CTCCGGTTAC TCAGTGTTTG CAGGCGTGGG TGAGCGTACT 500 

CGTGAGGGTA ACGACTTCTA CCACGAAATG ACTGACTCCA ACGTTATCGA 550 

CAAAGTTTCC CTGGTCTATG GTCAGATGAA TGAGCCACCA GGTAACCGTC 600 

45 TGCGCGTTGC ACTGACCGGC CTGACCATGG CGGAGAAATT CCGTGATGAA 650 

GGTCGTGACG TACTGCTGTT CGTTGACAAC ATTTACCGTT ACACCCTGGC 700 

AGGTACCGAA GTGTCCGCAC TTCTGGGCCG TATGCCATCG GCGGTAGGTT 750 

ATCAGCCAAC GCTGGCGGAA GAGATGGGTG CTCTGCAAGA GCGTATCACC 8 00 

TCTACCAAAA GTGGTTCTAT CACCTCCGT 82 9 

50 



2) INFORMATION FOR SEQ ID NO: 300 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

179 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Francisella tularensis 

(B) STRAIN: LVS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300 



AACACGCCTA 

L0 ATTAGAAGTT 
TGGGATCTAG 
GCGCCTATTT 
TGTTTTAGGT 
AAAGATCTAT 

L5 ACAGAAATAT 
TGCTAAGGGC 
CTGTAACGAT 
TACTCTGTAT 
CTACTATGAG 

2 0 ATGGTCAGAT 
GGCTTAACAA 
GTTTATCGAT 
CGCTACTTGG 
GCTGAGATGG 

25 TATTA 



AAGTATATGA 
CAGCAACAAA 
TGATGGTCTT 
CTGTTCCAGT 
GAACCAATTG 
CCATCAAGCT 
TAGAAACAGG 
GGTAAAGTTG 
GATGGAACTT 
TTTCCGGTGT 
ATGAAATATT 
GAATGAGCCG 
TAGCAGAAGG 
AACATCTATC 
TCGTATGCCA 
GTGCTTTACA 



TGCTTTAAAT 
TTGGTGATGG 
AGACGTGGTA 
TGGACATGGC 
ATGAAGCTGG 
CCTCCTGCAT 
TATCAAAGTA 
GTTTATTTGG 
ATCAACAATA 
TGGTGAAAGA 
CTAATGTATT 
CCTGGAAACA 
ATTCCGTGAT 
GTTATACATT 
TCTGCTGTGG 
GGAGCGTATT 



GTAGTAGAAG 
CGTAGTTCGT 
TGGAAGTTAA 
ACACTTGGAC 
TCCAATTGAA 
ATGATGAGTT 
GTTGACCTTA 
CGGTGCAGGT 
TTGCAAAAGA 
ACTCGTGAAG 
GGATAAAGTA 
GATTAAGAGT 
GAAAAGCGTG 
AGCAGGTACA 
GTTATCAGCC 
ACATCTACTA 



CTGGTTTAGT 50 

ACAATTGCTA 100 

GAACACAAAT 150 

GTATCATGAA 2 00 

TATACTGAGA 2 50 

AGCATTAAGT 3 00 

TTTGTCCATT 3 50 

GTTGGTAAAA 400 

ACATAGTGGC 450 

GTAATGACTT 500 

TCATTAGTAT 550 

AGCTCTTAGT 6 00 

ATGTTTTGAT 650 

GAGGTATCGG 700 

AACGCTTGCA 750 

AGACAGGATC 8 00 

805 



30 



2) INFORMATION FOR SEQ ID NO: 301 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Fusobacterium gonidiaformans 

(B) STRAIN: ATCC 25563 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301 

4 5 GACGAATTGC CAAAAATATA CAATGCATTA AAGGTGCAAG TTGGAGAAAA 50 

AGAACTTGTA TTGGAAGTGC AACAACATTT GGGAAATAAT GTTGTGAGAA 10 0 

CAGTAGCGAT GGACTCAACA GATGGATTGC TTCGAGGAAT GGAAGTAATG 150 

GATACCGGAG CACCGATTAC TGTTCCAGTA GGGAAGGCGG TTTTAGGAAG 2 00 

AATATTGAAT GTTTTGGGAG AGCCTGTGGA TCAAAAAGGG CCTGTGGAAA 2 50 

50 CAGAAGAATA TTTACCTATC CATAGAGAAG CACCAAAATT TGAAGAACAA 3 00 

GAAACAGTAA CAGAAATTTT TGAAACAGGA ATTAAAGTCA TAGATTTGTT 3 50 

AGCCCCTTAT ATCAAAGGAG GAAAGACAGG TCTATTCGGT GGAGCCGGAG 400 

TAGGGAAAAC AGTTTTAATT ATGGAATTAA TTAATAACAT TGCAAAGGGC 4 50 

CACGGAGGAA TTTCTGTGTT TGCAGGAGTT GGAGAAAGAA CAAGAGAAGG 500 

55 AAGAGATTTA TACAACGAAA TGACAGAGTC CGGAGTTTTG AATAAGACCT 550 

CGTTGGTGTA TGGTCAAATG AATGAGCCGC CCGGAGCAAG ACTTCGTGTG 600 

GCGTTGACAG GATTAACGGT TGCTGAAAAC TTTAGAGATA AAGAAGGGCA 650 

AGATGTATTG TTGTTTATCG ACAATATCTT CCGTTTCACA CAAGCAGGAT 7 00 

CAGAAGTATC GGCTCTATTG GGAAGAATTC CATCGGCAGT AGGATATCAA 750 

50 CCGAACTTAG CGACAGAAAT GGGAACTTTA CAAGAAAGAA TTACTTCTAC 80 0 

180 
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AAAATCAGGA TCTATCACTT CGGTA 
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5 2) INFORMATION FOR SEQ ID NO: 302 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 06 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusobacterium necrophorum subsp. 
n e croph or urn 

(B) STRAIN: ATCC 25286 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302 

ACAATGCATT AAAGGTACAG GTGGGAGAAA GGGAACTTGT GTTGGAAGTG 50 

CAGCAACATT TAGGAAATAA TGTTGTCAGA ACAGTAGCAA TGGATTCAAC 100 

AGACGGATTA CTTCGGGGAA TGGAAGTGAG AGATACAGGA GTTCCCATTA 150 

2 5 CTGTTCCGGT AGGAAAGGCG GTTTTGGGAA GAATATTAAA TGTCTTAGGG 2 00 

GAGCCTGTGG ACGAAAAAGG TCCGATAGAG ACAGAAGAAT ATTTACCAAT 2 50 

ACATAGAGAA GCACCGAAAT TTGAAGAACA GGAAACGGTG ACAGAAATTT 300 

TTGAAACAGG AATTAAAGTC ATTGATTTGT TAGCTCCTTA TATTAAAGGA 3 50 

GGAAAAACAG GCCTATTCGG AGGAGCCGGA GTAGGAAAAA CCGTTTTGAT 4 00 

3 0 TATGGAACTG ATCAATAATA TTGCAAAAGG TCATGGAGGA ATTTCTGTTT 4 50 

TTGCAGGAGT TGGAGAAAGA ACGAGAGAGG GAAGAGATCT ATACAACGAA 500 

ATGACAGAGT CCGGAGTTTT GAATAAAACT TCTTTGGTAT ATGGGCAAAT 550 

GAATGAGCCG CCCGGAGCAA GACTTCGAGT GGCTTTAACC GGACTTACTG 600 

TTGCCGAAAA TTTCAGAGAT AAAGAGGGAC AGGATGTCTT ATTGTTCATT 650 

3 5 GACAATATTT TCCGTTTCAC ACAAGCAGGT TCGGAAGTAT CGGCACTTTT 700 

GGGGAGAATT CCTTCTGCAG TGGGATATCA ACCGAACTTG GCGACAGAAA 750 

TGGGAAGCTT ACAAGAAAGA ATTACTTCTA CAAAATCCGG TTCTATCACT 800 

TCCGTG 806 



40 



2) INFORMATION FOR SEQ ID NO: 303 

( i ) SEQUENCE CHARACTERI ST ICS : 
4 5 (A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusobacterium nucleatum subsp, polymorphum 

(B) STRAIN: ATCC 10953 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 03 

GATGAATTGC CTGCAATATA TAATGCTTTA AAAGTAAAAT TAGAAGATAA 50 

GGAACTTGTT CTAGAAGTTG AACAACATCT TGGTAACAAT GTTGTAAGAA 100 

CTGTTGCTAT GGATTCAACT GATGGATTAA AAAGAGGAAT GGAAGTTATA 150 

181 
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GATACAGGTA AACCAATTAC AGTACCAGTT 
AATATTAAAT GTTTTAGGAG AACCTGTTGA 
CTGAAACATT TTTACCTATT CATAGAGAAG 
GAAACTGAAA CTGAAATATT TGAAACAGGA 
5 AGCACCATAT ATTAAAGGTG GAAAAATAGG 
TAGGAAAAAC AGTTTTAATA ATGGAACTTA 
CATGGAGGAA TTTCAGTTTT TGCAGGAGTT 
TAGAGACTTA TATGGTGAAA TGACTGAATC 
CTCTTGTTTA TGGACAAATG AATGAGCCAC 
10 GCATTAACAG GGCTTACTGT TGCAGAAAAC 
AGATGTTCTT CTATTTATAG ATAATATATT 
CAGAAGTTTC AGCTTTACTT GGAAGAATAC 
CCAAACCTAG CAACTGAAAT GGGTGCTTTA 
AAAATCTGGT TCAATTACAT C 

15 



PCT/CAOO/01150 

GGTAAAGCTG TTCTTGGTAG 2 00 

TAATCAAGGT CCTATAAATG 2 50 

CACCAGAATT TGATGACTTA 300 

ATAAAAGTTA TAGACTTATT 350 

ATTATTTGGT GGAGCTGGAG 4 00 

TCAACAACAT TGCAAAAGGA 4 50 

GGAGAAAGAA CAAGAGAAGG 5 00 

AGGAGTTATC ACAAAAACAG 550 

CTGGAGCAAG ACTTAGAGTT 600 

TTTAGAGATA AAGATGGGCA 650 

TAGATTTACA CAAGCAGGTT 700 

CATCAGCTGT TGGATATCAA 750 

CAAGAAAGAA TAACATCTAC 8 00 

821 



2) INFORMATION FOR SEQ ID NO: 304 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 864 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gardnerella vaginalis 
30 (B) STRAIN: ATCC 49145 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304 

TTCCCAGTTG GCTATCTTCC AGATATTTAT AATGCTCTCA AGGTTGATAT 50 

35 CAACACCGTT GGAAACACGG AGGGAGATAC CGTCCACGAG ATTACATTGG 10 0 

AAGTTGAGCA GCACCTTGGT GATTCAACTG TGCGAGCAGT GGCACTTAAG 150 

CCTACGGACG GCTTGGTCCG TGGTGCTTTA GTGCGAGATA CTGGTGGCCC 200 

AATTTCTGTG CCTGTTGGAG ATGTTACAAA AGGTCACGTT TTTGACGTAA 2 50 

CTGGTAACAT TTTAAACGCT AAACCAGGCG AAAACATTGA GGTGACCGAG 300 

40 CGCTGGCCAA TCCACCGCAA CCCACCTGCT TTCGATCAGC TTGAGTCTAA 350 

GACTCAAATG TTTGAAACAG GCATTAAGGT TATCGATTTG CTTACGCCTT 4 00 

ACGTTCAGGG CGGAAAGATT GGTCTGTTCG GTGGTGCAGG CGTTGGTAAA 450 

ACTGTGTTGA TTCAGGAGAT GATTCAGCGC GTTGCACAGA ACCACGGCGG 500 

TGTGTCTGTG TTTGCTGGCG TTGGCGAACG TACTCGTGAG GGTAACGATT 550 

45 TGATTGGCGA AATGGCTGAG GCTGGCGTTT TGGAGAAAAC AGCGCTTGTC 600 

TTTGGTCAGA TGGATGAGCC TCCTGGGACT CGTCTTCGTG TGCCTCTTAC 650 

TGCTTTGACT ATGGCTGAGT ATTTCCGTGA TGTTCAGAAT CAGGATGTGT 700 

TGCTGTTTAT CGACAACATC TTCCGCTTTA CTCAGGCAGG TTCTGAGGTT 750 

TCCACGTTGC TTGGTCGTAT GCCTTCTGCA GTTGGTTATC AGCCAAACTT 800 

50 GGCGGATGAA ATGGGTGCGT TGCAGGAGCG CATTACTTCT ACGCGCGGTC 8 50 

ATTCTATTAC GTCG 864 



55 2) INFORMATION FOR SEQ ID NO: 305 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 848 bases 

(B) TYPE: Nucleic acid 
SO (C) STRANDEDNESS: Double 

182 
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LO 



15 



20 



25 



WO 01/23604 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Gemella haemolysans 

(B) STRAIN: ATCC 10379 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305 



PCT/CAOO/01150 



TCGAATCAGG 
GAAAAAGGCG 
AATTGGTGAT 
TAAATAGGGG 
GTAGGTAACT 
TGACCACGGT 
AAGAAGCTCC 
ACAGGTATTA 
AATCGGTCTT 
AACTTATCAA 
GGTGTAGGTG 
AGATTCTGGT 
AACCACCAGG 
GAATACTTCC 
CATTTTCCGT 
GTATGCCATC 
CGTTTACAAG 



GCATATGCCA 
ATGGGAAAAA 
AACGTAGTAA 
AGCAGAAGTA 
ACACATTAGG 
GAAGAAGCAG 
AACATTCGAT 
AAGTTATCGA 
TTCGGTGGTG 
CAACGTTGCG 
AGCGTACTCG 
GTTATTAACA 
TGCTCGTATG 
GTGATGAAGA 
TTCACACAAG 
AGCCGTTGGT 
AACGTATAAC 



AATCTATTAA 
AGAAAAATTA 
GAACAATCGC 
GTAGATACAG 
TCGTGTGTTC 
GAGCAGAAGT 
GAATTATCAA 
CTTACTTGCA 
CGGGAGTTGG 
CAACAACACG 
TGAAGGAAAT 
AAACAGCCAT 
CGTGTAGCAT 
AGGACAAGAC 
CAGGTTCTGA 
TACCAACCAA 
ATCAACTAAA 



ACGCTTTAGA 
GTTCTTGAAG 
TATGTCATCT 
GAGCACCAAT 
AACGTATTAG 
TCGTAAAGAT 
CTCACGTTGA 
CCATATATTA 
TAAAACGGTT 
GTGGATTATC 
GACTTATACT 
GGTATTCGGA 
TAACAGGATT 
GTGCTTCTAT 
GGTTTCTGCG 
CACTTGCTAC 
AAAGGTTCTG 



AGTTTACATA 50 

TTTCTCTTGA 100 

ACTGATGGAT 150 

TACAGTTCCT 2 00 

GTGAAGCAGT 2 50 

TCAATTCACA 300 

GGTTCTTGAA 350 

AAGGTGGTAA 400 

CTTATCCAAG 450 

AGTATTCACA 500 

ATGAAATGAA 550 

CAAATGAACG 600 

AACAATGGCG 650 

TCATCGATAA 700 

TTATTAGGAC 750 

AGAGATGGGA 8 00 

TTACATCT 848 



30 



ao 



2) INFORMATION FOR SEQ ID NO: 306 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 84 8 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gemella morbillorum 

(B) STRAIN: ATCC 27824 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306 

TCGAATCAGG GCATATGCCT AATCTACTAA ACGCTTTAGA AGTTTATATA 50 

GAAAAAGGCG ATGGAAAAAA AGAAAAATTA GTTCTTGAAG TTTCTCTTGA 100 

AATCGGGGAT AATGTCGTAA GAACTATTGC GATGTCATCT ACTGATGGAT 150 

50 TAAACAGAGG GGCAGAAGTA GTTGATACTG GAGCGCCAAT TACAGTGCCA 2 00 

GTAGGTAACT ATACATTAGG ACGTGTGTTC AACGTATTAG GTGAAGCAGT 2 50 

TGACCACGGA GAAGAAGCTG GAGCAGAAGT TCAAAAAGAA TCTATTCATA 3 00 

AAGAAGCTCC AACTTTCGAA GAATTATCAA CACATGTTGA GGTATTAGAA 3 50 

ACAGGTATTA AAGTTATCGA CCTTCTTGCA CCATATATTA AAGGTGGTAA 4 00 

55 GATTGGACTA TTCGGTGGTG CTGGAGTTGG GAAAACAGTT CTTATCCAAG 4 50 

AACTTATTAA CAACGTAGCA CAACAACACG GAGGACTTTC AGTATTTACT 500 

GGGGTAGGTG AACGTACTCG TGAGGGTAAC GACTTGTACT ATGAAATGAA 550 

AGACTCTGGA GTTATTAATA AAACTGCCAT GGTATTTGGT CAAATGAATG 600 

AGCCACCAGG TGCACGTATG CGTGTTGCCT TAACAGGATT AACAATGGCA 650 

50 GAGTACTTCC GTGATGAAGA AGGACAAGAC GTACTATTAT TTATCGATAA 700 

183 
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PCT/CA00/01150 



TATCTTCCGT TTCACACAAG CAGGGTCTGA 
GTATGCCTTC AGCCGTTGGA TATCAACCAA 
CGTCTTCAAG AACGTATTAC ATCAACTAAA 



GGTATCTGCA 
CTCTTGCAAC 
AAAGGATCTG 



TTATTAGGGC 
AGAAATGGGA 
TTACATCT 



750 
800 
848 



2) INFORMATION FOR SEQ ID NO: 307 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus ducreyi 

(B) STRAIN: DSM 8925 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307 

GATGCAGTAC CAAAAGTATA TGATGCTTTA AAAGTTGAAT CAGGTTTAAC 50 

CTTAGAAGTT CAACAACAAT TAGGTGGTGG TTTAGTACGT TGTATCGCAT 100 

TAGGTACCTC AGATGGTTTA AAGCGTAGCT TAAAGGTTGT AAATACAGGT 150 

AACCCTATTC AAGTTCCTGT AGGCACTAAA ACATTAGGCC GTATTATGAA 2 00 

TGTATTAGGC GAACCAATTG ATGAAAAAGG ACCTATTAGC GAAGAAGCTC 250 

GTTGGGATAT TCATCGTGCG GCTCCAAATT ATGAAGAACA GTCAAATAGT 3 00 

ACTGAATTAC TTGAAACCGG TAT C AAAGTT ATTGACTTAA TTTGTCCATT 3 50 

TGCAAAAGGT GGTAAAGTCG GCTTATTTGG TGGAGCTGGT GTAGGTAAAA 4 00 

CCGTTAATAT GATGGAATTG ATCCGTAATA TTGCTATTGA GCACTCAGGT 4 50 

TATTCGGTTT TTGCTGGTGT AGGTGAGCGT ACTCGTGAAG GTAATGATTT 500 

TTATCATGAA ATGACGGATT CTAATGTATT AGATAAAGTA TCACTAGTAT 550 

ATGGTCAAAT GAATGAACCA CCAGGTAACC GCCTACGTGT TGCGTTAACA 60 0 

GGTTTAACTA TGGCTGAAAA ATTCCGTGAT GAAGGTCGTG ATGTATTATT 650 

TTTCGTAGAT AATATTTATC GTTATACTTT AGCCGGTACA GAAGTTTCTG 700 

CTTTATTAGG CCGTATGCCA TCAGCGGTAG GTTATCAACC AACCCTTGCA 750 

GAAGAAATGG GTGTATTACA AGAACGTATT ACCTCAACTA AAACTGGTTC 800 

AATCACGGCA GTA 813 



2) INFORMATION FOR SEQ ID NO: 308 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus haemolyticus 

(B) STRAIN: ATCC 333 90 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308 

TGAATTTC C A CAAGATGCAG TGCCAAAAGT TTACGATGCA TTAAAAGTTG 50 
AATCAGGTTT AACACTTGAG GTGCAACAAC AATTAGGTGG CGGTGTGGTA 100 



184 
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CGTTGTATCG CATTAGGTGC TTCTGACGGT TTAAAACGTG GTTTAAAAGT 150 

AGAAAACACG AATGATCCGA TTCAAGTACC GGTAGGCACA AAAACCCTTG 200 

GTCGTATCAT GAATGTATTG GGTGAACCAA TTGACGAACA AGGTCCAATC 250 

GGTGAAGAAG AGCGTTGGGC TATCCATCGT TCTGCACCAA GCTATGAAGA 300 

5 ACAATCAAAC AGTACGGAAT TATTAGAGAC TGGTATCAAA GTTATCGACT 350 

TAATTTGTCC ATTCGCAAAA GGTGGTAAAG TTGGTCTATT CGGTGGTGCG 400 

GGTGTAGGTA AAACCGTTAA CATGATGGAA TTAATCCGTA ACATCGCGAT 450 

CGAGCACTCA GGTTACTCCG TATTTGCGGG TGTAGGTGAA CGTACTCGTG 500 

AAGGTAACGA CTTCTATCAT GAAATGAAAG ATTCTAACGT ATTAGATAAA 550 

LO GTATCTTTGG TTTATGGTCA GATGAATGAG CCACCAGGTA ACCGTTTACG 600 

TGTTGCGTTA ACTGGTTTAA CCATGGCAGA AAAATTCCGC GATGAAGGTC 650 

GTGATGTATT ATTCTTCGTG GATAATATCT ATCGTTATAC CCTTGCTGGT 700 

ACGGAAGTAT CTGCGTTATT AGGTCGTATG CCATCTGCGG TAGGTTACCA 750 

ACCAACTCTT GCTGAAGAAA TGGGTGTGTT ACAAGAACGT ATCACTTCAA 800 

L5 CCAAAACAGG TTCTATTACA TCTGTA 826 



2) INFORMATION FOR SEQ ID NO: 309 



20 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 09 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

3 0 (A) ORGANISM: Haemophilus parahaemolyticus 

(B) STRAIN: ATCC 10014 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309 

35 GATGCAGTAC CAAAAGTATA TGATGCGTTA AAAGTTGAAT CAGGTTTAAC 50 

GCTTGAAGTT CAACAACAAT TAGGCGGTGG CTTAGTGCGC TGTATCGCAT 100 

TAGGTACGTC TGATGGTTTA AAACGTGGCT TAAAAGTAGA AAATACAGGC 150 

AACCCAATTG AAGTGCCAGT GGGCACTAAA ACCCTTGGTC GTATTATGAA 2 00 

CGTATTGGGT GAGCCGATTG ACGAAAAAGG TCCTATCGGT GAAGAAGCAC 250 

4 0 GCTGGGCAAT CCACCGTGCA GCACCAAGCT ACGAAGAGCA ATCAAATAGC 3 00 

ACGGAATTAC TCGAAACAGG TATCAAAGTT ATCGACTTAA TCTGCCCATT 350 

CGCAAAAGGG GGTAAAGTTG GTTTATTTGG TGGTGCAGGT GTAGGTAAAA 4 00 

CCGTAAATAT GATGGAGTTA ATCCGTAACA TCGCGATCGA ACACTCTGGT 4 50 

TACTCTGTAT TTGCAGGGGT AGGTGAGCGT ACTCGTGAAG GTAATGACTT 500 

4 5 CTACCACGAA ATGACAGACT CTAACGTATT AGATAAAGTA TCGTTAGTGT 550 

ATGGTCAAAT GAACGAACCA CCAGGTAACC GTTTACGCGT AGCTTTAACA 600 

GGCTTAACCA TGGCGGAAAA ATTCCGCGAT GAAGGTCGTG ACGTATTATT 650 

CTTCGTCGAT AACATCTACC GTTATACCCT AGCAGGTACG GAAGTGTCAG 700 

CACTTCTCGG TCGTATGCCA TCTGCGGTAG GTTATCAGCC AACCTTAGCA 750 

50 GAAGAAATGG GTGTATTACA AGAGCGTATC ACTTCAACCA AAACTGGTTC 800 

TATCACCTC 809 



55 2 ) INFORMATION FOR SEQ ID NO: 310 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus parainfluenzas 

(B) STRAIN: ATCC 7901 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310 



CGAATTTCCA CAAGATGCAG TACCAAAAGT TTATGATGCA TTAAAAGTTG 50 

AATCGGGTTT AACCCTTGAA GTTCAACAAC AATTAGGTGG TGGTGTGGTA 100 

CGTTGTATCG CACTGGGAGC TTCTGACGGT TTAAAACGCA GTTTAAGCGT 150 

TGAAAATACC AATAAACCAA TTTCAGTACC GGTTGGTGTA AAAACTCTCG 2 00 

15 GTCGTATTAT GAACGTATTG GGCGAACCGA TTGATGAAAG AGGTCCTATC 250 

GGTGCGGAAG AAGAATGGGC AATTCACCGT TCTACTCCAA GTTATGAAGA 3 00 

ACAGTCCAAC AGTACCGAAT TATTAGAAAC CGGTAT CAAA GTTATCGACT 350 

TAATTTGTCC ATTCGCGAAG GGTGGTAAAG TTGGTTTATT CGGTGGTGCG 4 00 

GGTGTAGGTA AGACCGTAAA TATGATGGAA TTAATCCGTA ATATTGCGAT 450 

20 TGAGCACTCA GGTTACTCCG TATTTGCCGG TGTAGGTGAG CGTACCCGTG 500 

AAGGTAACGA CTTCTACCAT GAAATGACAG AATCTAACGT ATTAGACAAA 550 

GTATCCCTAG TTTACGGACA AATGAATGAG CCGCCGGGTA ACCGTTTACG 600 

TGTTGCTTTA AC CGGTTTAA CCATGGCAGA AAAATTCCGT GACGAAGGTC 650 

GTGATGTATT ATTCTTCGTG GATAACATCT ATCGTTATAC CCTTGCAGGG 700 

2 5 ACTGAAGTAT CGGCACTTTT AGGCCGTATG CCATCAGCGG TAGGTTATCA 750 

GCCGACACTT GCAGAAGAAA TGGGTGTGTT ACAAGAACGT ATTACATCAA 800 

CCAAAACAGG TTCTATTACT TCTG 824 



30 



2) INFORMATION FOR SEQ ID NO: 311 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 811 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hafnia alvei 

(B) STRAIN: ATCC 13337 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311 

GCCGTGCCTA AAGTGTATAA CGCACTTGAG GTGAAAGGCG GTGCCACTAA 50 

ACTGGTACTG GAAGTTCAGC AGCAGCTAGG CGGCGGCGTT GTACGCTGTA 100 

TCGCTATGGG TACTTCTGAC GGTCTGCGTC GCGGACTGGA CGTTGTTGAC 150 

50 CTGGAGCACC CGATTGAAGT CCCAGTAGGT AAAGCGACCT TAGGCCGCAT 2 00 

TATGAACGTA CTGGGTGAGC CAATTGATAT GAAGGGTGAT ATCGGCGAAG 250 

AAGATCGCTG GGCTATTCAC CGTGAAGCTC CAAGCTACGA AGAACTGTCT 3 00 

AACTCGCAAG AACTGCTGGA AACTGGTATC AAGGTAATGG ACCTGATTTG 3 50 

TCCGTTCGCT AAGGGCGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTTG 4 00 

55 GTAAAACAGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 450 

TCAGGTTACT CTGTATTTGC CGGCGTGGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTATTGGAC AAAGTATCAC 550 

TGGTTTATGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGCGTTGCG 600 

CTGACCGGTC TGACTATGGC TGAGAAGTTC CGTGACGAAG GTCGTGACGT 650 

60 ACTGCTGTTC ATCGATAACA TCTACCGTTA TACCTTGGCC GGTACCGAAG 700 
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TATCTGCACT GTTGGGTCGT ATGCCTTCTG CGGTAGGTTA TCAGCCAACG 
CTGGCGGAAG AGATGGGTGT TCTGCAAGAA CGTATCACCT CGACCAAAAC 
GGGTTCAATC A 



750 
800 
811 



2) INFORMATION FOR SEQ ID NO: 312 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kingella kingae 

(B) STRAIN: ATCC 23330 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312 

GCGATGCTAT TCCACGCGTT TACGATGCGT TGAAACTGGT TGATGTGGAC 50 

TTGACATTAG AAGTGCAACA ACAACTGGGC GATGGCGTAG TGCGTACCAT 100 

TGCGATGGGT AGTACCGATG GTTTGAAACG CGGCTTAGCC GTGAACAACA 150 

CAGGCGCACC TATTACAGTG CCTGTTGGTA AAGCAACATT GGGTCGTATT 2 00 

ATGGACGTAT TGGGTAATCC TGTTGATGAA GCAGGTCCAA TTGGTTCTGA 2 50 

CCAAACGCGT GCTATTCACC AACCAGCTCC TAAATTTGAT GAACTGTCTA 3 00 

GCGCAACCGA ATTGCTGGAA ACAGGCATCA AAGTGATTGA CTTGCTTTGC 3 50 

CCATTTGCAA AAGGTGGTAA AGTAGGTTTG TTTGGTGGTG CAGGTGTGGG 4 00 

CAAAACTGTG AACATGATGG AGTTGATTAA CAACATTGCC AAAGCGCACA 450 

GTGGTTTGTC TGTATTTGCA GGCGTGGGTG AACGTACTCG CGAAGGTAAT 50 0 

GACTTCTATC ACGAGATGAA AGATTCTAAC GTGTTGGATA AAGTTGCCAT 550 

GGTGTATGGT CAAATGAATG AACCTCCTGG CAACCGTTTG CGCGTTGCAT 60 0 

TGACTGGTTT GTCTATGGCA GAACACTTCC GTGATGAAAA AGACGAAAAT 650 

GGCAAAGGTC GCGATGTATT GTTCTTTGTG GACAACATCT ATCGCTACAC 7 00 

ATTGGCAGGT ACAGAAGTAT CGGCATTGCT GGGTCGTATG CCCTCTGCGG 750 

TAGGTTATCA ACCAACATTG GCAGAAGAAA TGGGTCGTTT GCAAGAGCGT 800 

ATTACTTCAA CGCAAACAGG TTCGATTACT T 831 



2) INFORMATION FOR SEQ ID NO: 313 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella, pneumoniae subsp . ozaenae 

(B) STRAIN: ATCC 112 96 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313 

ATGCCGTACC ACGCGTGTAC GAAGCCCTTG AGGTACAGAA TGGTAATGAA 50 
GTTCTGGTGC TGGAAGTTCA GCAGCAGCTG GGCGGCGGTA TCGTACGTAC 100 
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CATCGCCATG GGTTCTTCTG ATGGTCTGCG CCGCGGTCTG GATGTAAAaG 15 0 

ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC GCTGGGTCGT 2 00 

ATCATGAACG TAC TGGGTCA ACCGGTTGAC ATGAAAGGCG ACATCGGCGA 250 

AGAAGAGCGT TGGGCTATCC ACCGCGCGGC ACCGTCCTAT GAAGAGCTGT 3 00 

5 CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CCAAGGGCGG TAAAGTTGGT CTGTTCGGCG GTGCGGGTGT 4 00 

AGGTAAAACT GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 4 50 

ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAGCGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ATAAAGTATC 550 

LO CCTGGTGTAC GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CGCTGACCGG CCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG CCGGTACTGA 700 

AGTATCCGCG CTGCTGGGTC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

L5 ACCGGTTCTA TC 812 



20 



2) INFORMATION FOR SEQ ID NO: 314 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
3 0 (A) ORGANISM: Klebsiella, ornithinolytica 

(B) STRAIN: ATCC 31898 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314 

3 5 ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTACAGAA TGGTAATGAG 50 

AGCCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGTGGTA TCGTACGTGC 100 

TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG GAAGTTAAAG 150 

ACCTTGAGCA CCCGATCGAA GTCCCGGTTG GTAAAGCAAC GCTGGGTCGT 2 00 

ATCATGAACG TGCTGGGTCA GCCAATCGAT ATGAAAGGCG ACATCGGCGA 2 50 

10 AGAAGAGCGT TGGGCTATTC ACCGTGCAGC TCCGTCCTAT GAAGAGCTGT 3 00 

CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT CGACCTGATG 3 50 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 4 00 

AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 450 

ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

15 AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTCTGG ATAAAGTATC 550 

CCTGGTTTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG CCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTTCTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG CCGGTACTGA 700 

AGTATCCGCA CTGCTGGGTC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

50 CCCTGGCGGA AGAGATGGGT GTTCTGCAGG AACGTATCAC CTCCACCAAA 800 

ACCGGTTCTA TC 812 



55 2) INFORMATION FOR SEQ ID NO: 315 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 
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WO 01/23604 PCT/CA00/01150 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 33496 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315 



LO 



GTACCGCGCG TGTACGAGGC TCTTGAGGTA CAAAATGGTA GTGAGAATCT 50 

GGTGCTGGAA GTTCAGCAGC AGCTCGGCGG CGGTATTGTT CGTAC CATCG 100 

CCATGGGTTC TTCCGACGGT CTGCGTCGCG GTCTGGAAGT CAAAGACCTC 150 

GAGCATCCGA TCGAAGTCCC GGTAGGTAAA GCAACGCTGG GTCGTATCAT 200 

L5 GAACGTACTG GGCCAACCGG TAGACATGAA AGGCGACATC GGCGAAGAAG 2 50 

AGCGTTGGGC GATTCACCGC GCAGCGCCTT CCTACGAAGA GTTGTCAAAC 3 00 

TCTCAGGAAC TGCTGGAAAC CGGCATCAAA GTTATCGACC TGATGTGTCC 3 50 

GTTTGCGAAG GGCGGTAAAG TTGGTCTGTT CGGTGGTGCG GGTGTAGGTA 40 0 

AAACCGTAAA CATGATGGAG CTGATCCGTA ACATCGCGAT CGAGCACTCC 450 

?0 GGTTACTCCG TGTTTGCGGG CGTAGGTGAA CGTACTCGTG AGGGTAACGA 500 

CTTCTACCAC GAAATGACCG ACTCCAACGT TATCGATAAA GTATCCCTGG 550 

TGTATGGCCA GATGAACGAG CCGCCGGGAA ACCGTCTGCG CGTTGCGCTG 600 

ACCGGCCTGA CCATGGCTGA GAAGTTCCGT GACGAAGGTC GTGACGTTCT 650 

GCTGTTCGTC GATAACATCT ATCGTTACAC CCTGGCCGGT ACTGAAGTAT 700 

?5 CCGCACTGCT GGGTCGTATG CCTTCAGCGG TAGGTTACCA GCCGACTCTG 750 

GCGGAAGAGA TGGGCGTTCT GCAGGAACGT ATCACCTCCA CCAAAACGGG 800 

TTCTATCACT TCC 813 



30 



40 



2 ) INFORMATION FOR SEQ ID NO: 316 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 822 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella planticola 

(B) STRAIN: ATCC 33531 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316 

GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTACAGA ATGGTAATGA 50 

GAGCCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGTGGT ATCGTACGTG 100 

CTATCGCCAT GGGTTCTTCT GACGGTCTGC GTCGTGGTCT GGAAGTTAAA 150 

50 GACCTTGAGC ACC CGATCGA AGTCCCGGTT GGTAAAGCAA CGCTGGGTCG 2 00 

TATCATGAAC GTGCTGGGTC AGCCGATCGA TATGAAAGGC GACATCGGCG 250 

AAGAAGAGCG TTGGGCTATT CACCGCGCAG CTCCGTCTTA TGAAGAGCTG 300 

TCCAGTTCTC AGGAACTGCT GGAAACCGGC ATCAAAGTTA TCGACCTGAT 3 50 

GTGTCCGTTC GCTAAGGGCG GTAAAGTAGG TCTGTTCGGT GGTGCGGGCG 4 00 

55 TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 4 50 

CACTCCGGTT ACTCCGTGTT TGCGGGCGTC GGTGAACGTA CTCGTGAGGG 500 

TAACGACTTC TAC CACGAAA TGACCGACTC CAACGTTCTG GATAAAGTAT 550 

CCCTGGTTTA TGGCCAGATG AACGAGCCGC CGGGAAACCG TCTGCGCGTT 600 

GCTCTGACCG GCCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGTCGTGA 650 

50 CGTTCTGCTG TTCGTCGATA ACATCTATCG TTATACCCTG GCCGGTACTG 700 
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AAGTATCCGC ACTGCTGGGT CGTATGCCTT 
ACCCTGGCGG AAGAGATGGG TGTTCTGCAG 
AACCGGTTCT ATCACTTCCG TA 



PCT/CA00/01150 

CAGCGGTAGG TTATCAGCCG 750 
GAACGTATCA CCTCCACCAA 800 

822 



5 



2) INFORMATION FOR SEQ ID NO: 317 

(i) SEQUENCE CHARACTERISTICS: 
L0 (A) LENGTH: 785 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



L5 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317 



AGAATGGTAA TGAAGTTCTG GTGCTGGAAG TTCAGCAGCA GCTGGGCGGC 50 

GGTATCGTAC GTACCATCGC CATGGGTTCT TCTGATGGTC TGCGCCGCGG 10 0 

IS TCTGGATGTA AAAGACCTCG AGCACCCGAT CGAAGTCCCG GTAGGTAAAG 150 

CAACGCTGGG TCGTATCATG AACGTACTGG GTCAACCGGT TGACATGAAA 2 00 

GGCGACATCG GCGAAGAAGA GCGTTGGGCT ATCCACCGCG CGGCACCGTC 2 50 

CTATGAAGAG CTGTCCAGCT CTCAGGAACT GCTGGAAACC GGCATCAAAG 3 00 

TTATCGACCT GATGTGTCCG TTCGCCAAGG GCGGTAAAGT TGGTCTGTTC 3 50 

3 0 GGCGGTGCGG GTGTAGGTAA AACTGTAAAC ATGATGGAGC TGATCCGTAA 4 00 

CATCGCGATC GAGCACTCCG GTTACTCTGT GTTTGCGGGC GTAGGTGAGC 4 50 

GTACTCGTGA GGGTAATGAC TTCTACCACG AAATGACCGA CTCCAACGTT 500 

ATCGATAAAG TATCCCTGGT GTACGGCCAG ATGAACGAGC CGCCGGGAAA 550 

CCGTCTGCGC GTTGCGCTGA CCGGCCTGAC CATGGCTGAG AAATTCCGTG 600 

3 5 ACGAAGGTCG TGACGTACTG CTGTTCGTCG ATAACATCTA TCGTTACACC 650 

CTGGCCGGTA CTGAAGTATC CGCACTGCTG GGTCGTATGC CTTCAGCGGT 700 

AGGTTATCAG CCGACCCTGG CGGAAGAGAT GGGCGTTCTG CAGGAACGTA 7 50 

TCACCTCCAC CAAAACCGGT TCTATCACCT CCGTA 785 

10 

2 ) INFORMATION FOR SEQ ID NO: 318 

( i) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 759 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) . TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera ascorbata 

(B) STRAIN: ATCC 33433 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 318 



CTGGTGCTGG AAGTTCAGCA GCAGCTCGGC GGCGGTATCG TACGTWC CAT 50 

CGCTATGGGT TCTTCCGACG GTCTGCGTCG CGGTCTGGAT GTTAAAGATC 100 

50 TCGAGCACCC AATCGAAGTT CCGGTMGGTA AAGCAACMCT GGGTCGTATC 150 
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ATGAACGTAC 
AGAGCGTTGG 
GCTCTCAGGA 
CCGTTCGCTA 
TAAAACCGTA 
CCGGTTACTC 
GACTTCTACC 
GGTATATGGC 
TGACCGGTCT 
CTGCTGTTCG 
ATCTGCWCTG 
TGGCGGAAGA 
GGTTCTATC 



TGGGTCAKCC 
GCTATCCACC 
ATTGCTGGAA 
AGGGCGGTAA 
AACATGATGG 
CGTGTTTGCG 
ACGAAATGAC 
CAGATGAACG 
GACCATGGCT 
TCGATAACAT 
CTGGGTCGTA 
GATGGGCGTT 



AGTMGACATG 
GCGCTGCACC 
ACCGGTATCA 
AGTCGGTCTG 
AGCTGATCCG 
GGCGTAGGTG 
CGACTCCAAC 
AGCCACCGGG 
GAGAAATTCC 
CTATCGTTAC 
TGCCTTCAGC 
CTGCAGGAAC 



AAAGGCGACA TCGGTGAA<5A 
TTCCTACGAA GAGCTGTCTA 
AAGTTATCGA CCTGATGTGT 
TTCGGTGGTG CSGGTGTTGG 
TAACATCGCG ATCGAGCACT 
AACGTACTCG TGAGGGTAAC 
GTTATCGATA AAGTATCCCT 
AAACCGTCTG CGCGTTGCTC 
GTGACGAAGG TCGTGACGTA 
ACCCTGGCCG GTACTGAAGT 
GGTAGGTTAC CAGCCGACCC 
GTATCACCTC CACCAAGACC 



200' 

250 

300 

350 

400 

450 

500 

550 

600 

650 

700 

750 

759 



2) INFORMATION FOR SEQ ID NO: 319 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera cryocrescens 

(B) STRAIN: ATCC 33435 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319 

TTCCCTCAGG ATGCCGTACC GCGTGTGTAC GAAGCCCTTG AGGTTCAGAA 50 

TGGTAATGAA GTGCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 100 

TCGTACGTAC CATCGCTATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 150 

GATGTAAAAG ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 2 00 

ACTGGGTCGT ATCATGAACG TACTGGGCCA ACCGGTAGAC ATGAAAGGCG 2 50 

ACATCGGTGA AGAAGAACGT TGGGCTATCC ACCGTGCAGC ACCTTCCTAC 3 00 

GAAGAGCTGT CAAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTTG CGAAGGGCGG TAAAGTTGGT CTGTTCGGTG 4 00 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT TCGTAACATC 4 50 

GCGATTGAGC ACTCCGGTTA TTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTTTC CCTGGTTTAC GGCCAGATGA ACGAGCCACC AGGAAACCGT 600 

CTGCGCGTTG CGCTGACTGG TCTGACTATG GCTGAGAAGT TCCGTGACGA 650 

AGGTCGCGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 700 

CCGGTACAGA AGTATCTGCA CTGCTGGGTC GTATGCCTTC AGCGGTAGGT 750 

TACCAGCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 8 00 

CTCCACCAAA ACCGGTTCTA TCACCTCCGT A 831 



2) INFORMATION FOR SEQ ID NO: 320 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera georgiana 
5 (B) STRAIN: ATCC 51603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 32 0 

GCCGTACCGC GCGTGTACGA AGCCCTTGAG GTACAGAATG GTAATGAAGT 50 

L0 GCTGGTGCTG GAAGTTCAGC AGCAGCTCGG TGGCGGTATC GTGCGTACCA 100 

TCGCCATGGG TTCCTCCGAC GGTCTGCGTC GCGGTCTGGA AGTTAAAGAT 150 

CTCGAGCACC CGATCGAAGT TCCGGTAGGT AAAGCAACAC TGGGTCGTAT 2 00 

CATGAACGTA CTGGGTCACC CGGTAGACAT GAAAGGCGAC ATCGGTGAAG 2 50 

AAGAGCGTTG GGCTATCCAC CGCGCTGCGC CTTCCTACGA AGAGCTGTCC 3 00 

L5 AGCTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGCGGT GCGGGTGTTG 4 00 

GTAAAACCGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 50 0 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

20 TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCG 60 0 

CTGACCGGCC TGACCATGGC TGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACTGAAG 70 0 

TATCTGCACT GCTGGGTCGT ATGCCTTCAG CGGTAGGTTA CCAGCCGACT 7 50 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAGAC 800 

25 CGGTTCTATC 810 



2) INFORMATION FOR SEQ ID NO : 321 



30 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 0 (A) ORGANISM: Lactobacillus acidophilus 

(B) STRAIN: ATCC 4356 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321 

*5 TCGATAAGAA TTTACCTGAT ATTAACAACG CCTTACGTGT AATCAAGTCC 50 

GAAGATGAAA GCATCGTTCT TGAAGTTACA CTTGAACfCG GTGATGGTGT 100 

TTTAAGAACA ATCGCCATGG AATCTACCGA TGGTCTTCGT CGTGGTATGA 150 

AAGTCGAAGA TACTGGCGCT CCAATTTCAG TTCCAGTTGG AGAAGACACT 2 00 

TTAGGTCGTG TGTTTAACGT TTTAGGACAG CCTATTGATG GTGGTCCAGC 250 

50 CTTTCCAAAG GATCACCCAC GTGAGGGTAT CCACAAGGAA GCACCTAAAT 3 00 

ATGAAGATTT AACTACTAGT CGTGAAATTC TTGAAACTGG TATCAAGGTT 350 

ATCGACCTTC TTGAACCATA TGTTCGTGGT GGTAAAGTTG GTTTGTTTGG 4 00 

TGGTGCCGGT GTTGGTAAAA CTACTATTAT TCAAGAATTA ATTCACAACA 4 50 

TCGCTCAAGA ACACGGTGGT ATTTCCGTAT TTACTGGTGT TGGTGAAAGA 500 

55 ACTCGTGAAG GTAATGACCT TTACTTTGAA ATGAAAGCTT CAGGCGTTTT 550 

AAGTAAGACT GCCATGGTAT TTGGTCAGAT GAACGAGCCG CCTGGTGCCA 600 

GAATGCGTGT TGCATTAACC GGTTTGACAC TTGCTGAATA CTTTAGAGAT 650 

GTTGAAGGTC AAGACGTATT GCTCTTTATT GACAATATCT TTAGATTTAC 700 

TCAGGCTGGT TCAGAGGTAT CTGCTTTGCT TGGTCGTATG CCAAGTGCCG 750 

50 TAGGTTATCA GCCAACTTTG GCAACAGAAA TGGGTCAATT GCAGGAAAGA 800 
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5 2) INFORMATION FOR SEQ ID NO: 322 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Legionella pneumophila, subsp . pneumophila 

(B) STRAIN: ATCC 33152 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322 



TTCCTCGTGA TAGCGTGCCT AAAGTCAATG ATGCGTTAAA GCTTGTTGAT 50 

AGTGATCTGG TTTTTGAAGT GCAGCAGCAA CTTGGAGACG GAGTTGTGCG 100 

TACTATTGCC ATGGGAACAA CCGATGGTTT AAAGCGAGGA TTAAAAGCAG 150 

AAAATACAGG CCATCCTATT CAAGTGCCAG TAGGTAAGAA AACTTTGGGA 2 00 

25 CGCATTATGG ATGTTCTTGG GCGTCCTGTA GATGATGCTG GGCCTATCGA 2 50 

TGCTGAAGAG ACTTGGGCTA TTCATCGTAA AGCACCAAGT TATGAAGAGC 3 00 

AAGCTGGCAG CCAGGAATTA TTGGAAACTG GTATTAAAGT AATTGATTTG 3 50 

CTTTGCCCTT TTGCCAAGGG AGGTAAAGTT GGTCTATTCG GTGGTGCCGG 4 00 

TGTAGGCAAA ACCGTTAACA TGATGGAATT AATACGAAAC ATTGCAATTG 4 50 

3 0 AGCATAGCGG TTATTCAGTG TTTGCAGGGG TTGGTGAACG TACCCGTGAA 500 

GGAAACGACT TCTATCATGA GATGAAAGAC TCTAATGTAT TGGATAAAGT 550 

ATCGCTTGTT TATGGTCAGA TGAATGAGCC GCCAGGAAAC CGTTTGCGTG 600 

TTGCTCTAAC CGGTTTGACT ATGGCTGAAA AATTCCGGGA TGAAGGGCGA 650 

GACGTTCTTT TGTTTATCGA TAATATTTAT CGTTATACCT TGGCTGGGGT 700 

3 5 TGAAGTATCT GCGCTGTTAG GCCGTATGCC TTCTGCAGTA GGATATCAGC 750 

CGACATTAGC AGAGGAAATG GGTATGCTGC AAGAGCGCAT TACCTCCACA 8 00 

AAAACAGGTT CTATTACTTC CATA 824 



40 



50 



2) INFORMATION FOR SEQ ID NO: 323 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 818 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leminorella grimontii 

(B) STRAIN: ATCC 33999 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323 

GACGCCGTAC CGAAAGTGTA CGATGCGCTT GAAGTTCAAA TTGATGCCAA 50 

GCTGGTTCTG GAAGTTCAAC AGCAGCTCGG CGGCGGCGTT GTTCGCTGCA 100 

TCGCGATGGG TACTTCAGAC GGCTTAAGCC GCGGTCTGGA CGTGCTCGAT 150 

60 CTGGAACACC CGATTGAAGT ACCGGTGGGC AAAGCGACGC TGGGCCGCAT 2 00 
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CATGAACGTG CTTGGTCACC CTATCGACAT GAAGGGCGAC ATCGGCGAAG ' 2"5<T 

AAGAGCGTTG GGCTATTCAC CGCGCAGCGC CGAGCTACGA AGACCTGTCG 3 00 

GGCGCAACCG AGCTGCTGGA GACCGGCATC AAGGTTATCG ACCTGATTTG 3 50 

TCCGTTCGCC AAGGGCGGTA AAGTCGGCCT GTTCGGCGGC GCCGGCGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTCATTC GCAACATTGC GACCGAGCAC 4 50 

TCCGGTTACT CCGTGTTTGC AGGCGTAGGT GAACGTACCC GTGAGGGTAA 500 

CGACTTCTAC CACGAAATGA CTGAATCCAA CGTATTGGAC AAGGTGTCGC 550 

TGGTATACGG TCAGATGAAC GAGCCGCCTG GAAACCGTCT GCGCGTAGCG 600 

TTAACGGGCT TGACCATGGC GGAGAAGTTC CGTGATGAAG GCCGTGACGT 650 

TCTGCTGTTT AT CGAC AACA TTTACCGCTA TACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGCCGT ATGCCTTCAG CCGTAGGCTA CCAGCCGACT 750 

CTGGCTGAGG AAATGGGCGT GCTTCAAGAG CGTATTACCT CTACCAAGAC 800 

GGGGTCTATC ACCTCCGT 818 



2) INFORMATION FOR SEQ ID NO: 324 

( i ) SEQUENCE CHARACTER I STICS : 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria, monocytogenes 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324 

GTGGAAACTT ACCTGAAATC TACAATGCCC TAGTTATTGA ATATAAATCT 50 

GATGCAGAAG AAGCACCAAC TAGCCAACTT ACTTTAGAAG TAGCCATCCA 100 

ATTAGGTGAT GATGTTGTAC GTACAATCGC AATGGCATCA ACAGATGGTG 150 

TTCAAAGAGG TATGGAAGTT ATTGATACTG GGAGCCCAAT TACAGTTCCT 2 00 

GTAGGTACAG TAACTCTTGG TCGTGTATTT AATGTATTAG GAAACACCAT 2 50 

CGATTTGGAC GAACCACTTC CAAGCGATAT TAAACGTAAT AAAATTCACC 3 00 

GCGAAGCTCC AACTTTCGAT CAATTAGCAA CGACAACAGA AATTCTTGAA 3 50 

ACAGGAATTA AAGTAGTTGA CTTGCTAGCT CCTTACTTAA AAGGTGGTAA 4 00 

AATCGGATTG TTCGGTGGTG CCGGTGTTGG TAAAACCGTT CTAATTCAAG 4 50 

AGCTTATCCA TAATATCGCA CAAGAACATG GTGGTATTTC TGTGTTCGCT 500 

GGCGTTGGAG AACGTACTCG TGAAGGTAAC GACCTTTACT TCGAAATGAA 5 50 

AGATTCAGGC GTTATTGAAA AAACAGCGAT GGTATTCGGT CAAATGAACG 600 

AGCCACCAGG TGCGCGTATG CGTGTTGCCT TAACTGGTCT AACAATTGCT 650 

GAATATTTCC GTGATGAAGA ACATCAAGAT GTACTTTTAT TCATTGATAA 7 00 

CATTTTCCGT TTCACACAAG CTGGTTCAGA GGTTTCGGCT TTACTAGGTC 750 

GTATGCCATC TGCGGTAGGT TACCAACCAA CCCTAGCTAC TGAAATGGGT 800 

CAACTACAAG AACGTATTAC ATCTACTAAC GTTGG 835 



2) INFORMATION FOR SEQ ID NO: 32 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Micrococcus lylae 

(B) STRAIN: ATCC 27566 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325 

CCCGCGTGGC GAGTTGCCGG CACTGTTCAA CGCGCTGACT GTCGAGGTCA 50 

CCCTCGAAGC AGTCGCTAAA ACCATTACCC TTGAGGTTGC TCAGCACCTC 100 

10 GGCGACAACT TGGTTCGTGC CGTGTCCATG GCACCGACCG ACGGTCTCGT 150 

CCGTGGCGCT GCTGTGATCG ACAGCGGTAA GCCGATCTCA GTTCCCGTTG 2 00 

GTGACGTAGT CAAGGGACAC GTCTTCAACG CTCTGGGTGA TTGCCTCGAT 2 50 

GAGCCAGGTC TTGGCCGTGA CGGTGAGCAG TGGGGCATCC ACCGCGATCC 3 00 

GCCACCTTTT GACCAGCTTG AGGGTAAGAC CGAGATTCTG GAAACCGGTA 350 

15 TTAAGGT CAT CGACCTGCTG ACCCCGTATG TTAAGGGCGG CAAGATCGGC 4 00 

CTGTTCGGTG GTGCTGGTGT GGGTAAGACC GTTCTTATCC AGGAAATGAT 4 50 

CACCCGTATC GCTCGCGAGT TCTCCGGTAC CTCGGTGTTC GCAGGCGTGG 50 0 
GTGAGCGTAC CCGTGAGGGC ACCGACCTCT TCCTGGAAAT GGAAGAGATG , 550 

GGCGTTCTCC AGGACACCGC TCTTGTGTTC GGCCAGATGG ACGAGCCTCC 600 

2 0 AGGAGTTCGT ATGCGCGTGG CGCTGTCCGG CCTGACCATG GCGGAGTACT 650 

TCCGCGATGT GCAGCACCAG GACGTGCTTC TGTTCATCGA CAACATCTTC 7 00 

CGTTTCACCC AGGCAGGTTC CGAGGTTTCC ACCCTCCTAG GCCGCATGCC 750 

TTCTGCCGTG GGTTACCAGC CAACGCTGGC AGACGAGATG GGTGTTCTGC 800 

AGGAGCGTAT TACCTCCACA AAGGGTAA 82 8 

25 



2) INFORMATION FOR SEQ ID NO: 32 6 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moellerella wisconsensis 
40 (B) STRAIN: ATCC 35017 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326 

GATGCCGTAC CAAAAGTGTA CGATGCTCTT GAGGTTCTTA ACGGTAAAGA 50 

4 5 AAAATTGGTG CTGGAAGTTC AGCAACAATT AGGCGGTGGT GTTGTTCGTT 100 

GTATCGCAAT GGGTACATCA GATGGTTTAA GCCGCGGTTT AGAAGTTAAA 150 

AATACAGATC ATCCGATCGA AGTTCCTGTC GGTGTTAAAA CGCTTGGCCG 2 00 

TATCATGAAC GTGCTGGGTG ACCCAATCGA CATGAAAGGT GATATCGGCG 2 50 

AAGAAGAACG CTGGTCAATT CACCGCGCAG CACCAAGCTA TGAAGATCTG 3 00 

50 GCTAACTCAA CAGAACTTCT AGAAACAGGT ATCAAAGTTA TGGACCTGAT 350 

TTGCCCATTC GCTAAAGGGG GTAAAGTGGG TCTGTTCGGT GGTGCGGGTG 400 

TCGGTAAAAC AGTTAACATG ATGGAGCTTA TTCGTAATAT CGCGATTGAG 4 50 

CACTCAGGTT ATTCTGTATT CGCGGGTGTT GGTGAACGTA CTCGTGAAGG 500 

TAACGATTTC TACCATGAAA TGACAGACTC AAACGTTCTG GATAAAGTTT 550 

55 CATTGGTTTA TGGCCAGATG AATGAGCCAC CAGGAAACCG TCTGCGTGTT 600 

GCTCTGACTG GTCTGACTAT GGCAGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTACTGTTA TTCGTAGATA ATATTTATCG TTATACCTTA GCAGGGACAG 700 

AAGTATCTGC ACTGCTGGGT CGTATGCCTT CAGCGGTGGG TTATCAGCCA 750 

ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCGACTAA 800 

50 GACCGGCTCT ATCACTTCCG TA 822 
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2) INFORMATION FOR SEQ ID NO: 327 

5 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 854 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Branhamella catarrhalls 

(B) STRAIN: ATCC 43628 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327 

2 0 CCGTGGCGAT GTCCCCCAAA TCTTTGATGC ACTTCATGTT GATGGTACTG 50 

AAACCACCCT TGAAGTCCAA CAACAGTTAG GTGATGGTGT GGTGCGTACC 100 

ATTGCCATGG GTTCTACCGA AGGCTTAAAG CGTGGCTTGC CTGTCTCTAA 150 

TTCAGGTGCA CCCATTTCGG TACCAGTCGG TCAAGCAACA CTGGGTCGCA 2 00 

TTATGGATGT CCTAGGTCGC CCAATCGATG AAGCAGGTCC GGTAAATGCT 2 50 

25 GAACAAAAAT GGTCCATTCA TCGTGAAGCA CCAAGTTATG ATGAACAGTC 3 00 

AAATAGTACA GAACTTTTAG AAACAGGCAT CAAAGTGATT GATTTGCTTT 3 50 

GTCCATTTGC CAAAGGTGGT AAAGTCGGTC TGTTCGGTGG TGCTGGTGTT 4 00 

GGTAAGACCG TTAACATGAT GGAGCTTATC AATAATATCG CCCTAAAACA 4 50 

CTCAGGTCTG TCGGTTTTTG CTGGTGTGGG TGAGCGTACT CGTGAGGGTA 500 

3 0 ATGACTTCTA CCATGAAATG CAAGAAGCAG GCGTTGTTAA TACCGAAGAT 550 

TTTACTCAGT CAAAAGTTGC CATGGTTTAT GGTCAGATGA ATGAGCCACC 600 

AGGAAACCGT CTGCGTGTTG CCTTAACTGG TTTGACCATG GCAGAGTATT 650 

TCCGTGATGA AAAAGACGAA GCAACGGGCA AAGGCCGTGA TGTTCTGCTG 70 0 

TTCGTTGATA ATATTTATCG TTACACATTG GCAGGTACTG AGGTATCAGC 750 

35 ACTTTTAGGT CGTATGCCAT CTGCGGTAGG TTATCAGCCG ACTTTGGCCG 800 

AAGAGATGGG CTTGCTACAA GAGCGTATCA CCTCCACCCA ATCAGGCTCA 850 

ATTA 854 



40 



2) INFORMATION FOR SEQ ID NO: 328 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 831 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Moraxella osloensis 

(B) STRAIN: ATCC 19976 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328 

CCGTCAAAGC GTACCAAGAA TTTATGATGC CTTAAAAGTT GAAGGCACAG 50 

AAACTACATT AGAAGTACAA CAACAATTGG GTGATGGTAT CGTACGTACT 100 

ATTGCCATGG GTTCTACTGA AGGTCTAAAA CGTGGTCTAC CAGTTAGCAA 150 

SO CACTGGCGCA CCAATCTCTG TACCTGTGGG TAAAGGTACA CTAGGTCGTA 200 
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TCATGGACGT TTTAGGACAC CCAATCGATG AGGCAGGTCC GGTAGA'gCST 35D 

AGTAACACTT GGGCGATTCA CCGTGAAGCG CCAAGCTATG ATGAACAATC 3 00 

AAACTCTACT GAACTTTTAG AAACCGGTAT TAAAGTAATT GACTTACTAT 3 50 

GCCCATTTGC TAAAGGTGGT AAAGTCGGTC TGTTCGGTGG CGCGGGTGTT 4 00 

5 GGTAAAACCG TTAACATGAT GGAACTTATC AATAACATCG CAAAAGCACA 450 

CTCAGGTTTA TCGGTATTTG CTGGTGTAGG TGAGCGTACT CGTGAAGGTA 500 

ATGACTTCTA CCACGAGATG AAAGACTCAA ACGTACTTGA TAAAGTTGCG 550 

ATGGTGTATG GTCAGATGAA TGAGCCACCA GGAAACCGTT TACGTGTTGC 600 

CCTGACAGGT TTAACCATGG CAGAATACTT CCGTGACGAA AAAGATGAAA 650 

10 ACGGTAAAGG TCGTGACGTA TTATTGTTCG TTGACAATAT TTATCGTTAC 700 

ACGCTAGCGG GTACCGAAGT ATCAGCATTA TTAGGTCGTA TGCCATCTGC 750 

AGTAGGGTAT CAGCCAACGC TTGCAGAAGA GATGGGTGTA CTACAAGAAC 8 00 

GTATTACTTC AACCCAATCA GGCTCTATTA C 831 



15 



30 



2) INFORMATION FOR SEQ ID NO: 32 9 



(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Morganella morganii subsp . morganii 

(B) STRAIN: ATCC 25830 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 32 9 



CGAATTTCCT CAGGATGCAG TACCGAAAGT GTACGATGCG CTTGAGGTAA 50 

CAAATGGTAA AGAAAAACTG GTGCTGGAAG TTCAGCAGCA GTTAGGCGGC 100 

3 5 GGGGTTGTCC GTTGTATCGC TATGGGTACA TCTGATGGTC TGAGCCGTAA 150 

TCTGGAAGTA ACCGATTTAG GCCACCCGAT CGAAGTCCCT GTCGGCGTGA 200 

AAACCTTAGG ACGTATCATG AACGTTCTGG GTGATCCGAT CGATATGAAA 250 

GGTGACATCG GCGCAGAAGA AAAATGGTCT ATTCACCGTG CTGCACCAAC 3 00 

ATACGAAGAA CTGTCTAACT CCCAGGAACT GCTGGAAACA GGTATCAAAG 350 

*0 TAATGGACCT GATCTGCCCG TTCGCGAAGG GTGGTAAAGT CGGTCTGTTC 4 00 

GGTGGTGCGG GTGTGGGTAA AACCGTAAAC ATGATGGAAC TGATCCGTAA 4 50 

CATCGCGATC GAGCACTCCG GTT ACT CTGT ATTCGCAGGG GTCGGTGAGC 50 0 

GTACCCGTGA AGGTAACGAC TTCTATCATG AAATGACAGA CTCCAACGTT 550 

CTGGACAAAG TATCACTCGT GTACGGCCAG ATGAACGAGC CACCGGGAAA 600 

£5 CCGTCTGCGC GTTGCTCTGA CCGGTCTGAC CATGGCGGAA AAATTCCGTG 650 

ATGAAGGCCG CGATGTACTG CTGTTCGTTG ATAACATCTA CCGTTATACC 700 

CTGGCCGGTA CTGAAGTATC CGCGCTGTTA GGCCGTATGC CTTCAGCGGT 750 

AGGTTACCAG CCGACACTGG CGGAAGAAAT GGGTGTGCTT CAGGAACGTA 800 

TCACATCGAC CAAAACAGGC TCTATCACGT CTGTA 835 

50 



2) INFORMATION FOR SEQ ID NO: 330 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 
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(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Pantoea agglomerans 
5 (B) STRAIN: ATCC 27155 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330 

GACGCGGTAC CGCAAGTGTA CAGCGCCCTC GAGGTTATGA ATGGTGATGC 50 

10 GCGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGC GTAGTACGTA 100 

CCATCGCAAT GGGTACGTCT GACGGCCTGA AGCGTGGTCT GAGCGTCAAC 150 

GACCTGCAGA AACCGATTCA GGTACCCGTC GGTAAAGCGA CCCTGGGCCG 2 00 

TATCATGAAC GTTCTCGGCG AGCCAATCGA TATGAAAGGC GAGCTGAAAG 2 50 

AAGAAGATGG CAGCGCAGTA GAGATCGCCT CTATTCACCG CGCAGCCCCT 3 00 

15 TCTTATGAAG ATCAGTCTAA CTCGCAGGAA CTGCTGGAAA CCGGCATCAA 3 50 

GGTTATCGAC CTGATGTGTC CGTTTGCTAA AGGCGGTAAA GTCGGTCTGT 4 00 

TCGGTGGTGC GGGTGTAGGT AAAACCGTCA ACATGATGGA ACTGATCCGT 4 50 

AACATCGCGG CTGAACACTC AGGTTACTCA GTGTTTGCCG GTGTGGGTGA 500 

GCGTACTCGT GAGGGTAACG ACTTCTACCA CGAAATGACT GACTCTAACG 550 

2 0 TTATCGATAA AGTTGCACTG GTCTATGGCC AGATGAACGA GCCGCCGGGT 600 

AACCGTCTGC GCGTAGCACT GACCGGTCTG ACCATGGCGG AAAAATTCCG 650 

TGATGAAGGT CGCGACGTTC TGCTGTTCAT CGATAACATC TACCGTTATA 700 

CCCTGGCCGG TACAGAAGTT TCTGCACTGC TGGGTCGTAT GCCATCTGCG 750 

GTAGGTTACC AGCCAACGCT GGCAGAAGAG ATGGGTGTGT TGCAGGAGCG 800 

2 5 TATTACCTCC ACCAAGACCG GTTC 824 



2) INFORMATION FOR SEQ ID NO: 331 
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( i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 808 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
4 0 (A) ORGANISM: Pantoea dispersa 

(B) STRAIN: ATCC 14589 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331 

45 TATACAGCGC TCTCGAGGTA AAAAATGGTG ATGCTCGTCT GGTGCTGGAA 50 

GTACAGCAGC AGCTGGGCGG TGGCGTGGTG CGTACCATCG CCATGGGTTC 100 

TTCTGACGGC CTGAAGCGCG GTCTGGAAGT CACCGACCTG AAAAAACCTA 150 

TCCAGGTTCC GGTTGGTAAA GCAACACTCG GCCGTATCAT GAACGTGCTG 2 00 

GGTGAGCCAA TCGACATGAA AGGCGACCTG AAAGAAGAAG ACGGCAGCGC 2 50 

50 TGTAGAGGTT TCCTCTATTC ATCGCGCAGC GCCTTCTTAT GAAGATCAGT 3 00 

CAAACTCGCA GGAACTGCTG GAAACCGGCA TCAAGGTTAT CGACCTGATG 3 50 

TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG GTGCGGGTGT 4 00 

AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGGCTGAGC 4 50 

ACTCAGGTTA TTCGGTCTTT GCCGGCGTGG GTGAGCGTAC TCGTGAGGGT 500 

55 AACGACTTCT ACCACGAAAT GACGGACTCC AACGTTATCG ATAAAGTAGC 550 

GCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGTAACCGT CTGCGCGTAG 600 

CACTGACCGG TCTGACCATG GCGGAAAAAT TCCGTGATGA AGGCCGTGAC 650 

GTTCTGCTGT TCATCGACAA CATCTACCGT TACACCCTGG CCGGTACAGA 700 

GGTTTCTGCA CTGCTGGGTC GTATGCCATC GGCGGTAGGT TATCAGCCAA 750 

50 CGCTGGCTGA AGAGATGGGT GTGCTGCAGG AGCGTATTAC CTCCACCAAG 800 

198 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
ACCGGTTC 



PCT/CA00/01150 



5 2) INFORMATION FOR SEQ ID NO: 332 

( i ) SEQUENCE CHARACTERI ST I CS : 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 
L0 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

L5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pasteurella multocida 

(B) STRAIN: NCTC 10322 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332 



GATGCAGTAC CAAAAGTATA TGATGCCTTA AATGTTGAAA CAGGTTTAGT 50 

ACTTGAAGTT CAACAACAAT TAGGTGGTGG TGTAGTTCGC TGTATCGCAA 100 

TGGGATCATC TGATGGATTA AAACGCGGTT TAAGCGTAAC AAATACGAAT 150 

AACCCAATTT CTGTTCCAGT GGGAACGAAA ACATTGGGTC GTATCATGAA 2 00 

25 CGTATTGGGT GAACCAATCG ATGAGCAAGG TGAAATCGGT GCAGAAGAGA 2 50 

ATTGGTCTAT TCACCGTGCG CCACCAAGTT ATGAAGAACA ATCTAACAGT 300 

ACTGAACTTT TAGAAACGGG AATTAAAGTT ATCGACTTAG TTTGTCCGTT 3 50 

TGCGAAAGGG GGTAAAGTAG GTTTATTCGG TGGTGCGGGT GTCGGTAAAA 4 00 

CCGTCAATAT GATGGAATTA ATCCGTAACA TCGCAATTGA GCACTCAGGT 4 50 

3 0 TACTCTGTCT TTGCGGGGGT AGGTGAGCGT ACGCGTGAAG GTAACGACTT 500 

CTATCATGAG ATGAAAGACT CTAACGTATT AGATAAAGTG TCTCTTGTTT 550 

ATGGTCAAAT GAACGAGCCA CCAGGTAACC GTTTACGTGT GGCATTAACA 600 

GGCTTAACTA TGGCGGAAAA ATTCCGTGAT GAAGGTCGTG ATGTCTTATT 650 

CTTCGTTGAT AATATTTATC GTTATACTCT TGCTGGTACA GAAGTTTCTG 7 00 

3 5 CATTATTAGG TCGTATGCCA TCTGCGGTAG GTTATCAACC AACCCTTGCA 750 

GAAGAAATGG GTGTTCTGCA AGAGCGTATT ACCTCAACCA AAACAGGTTC 800 

TATTA 8 05 



10 



2) INFORMATION FOR SEQ ID NO: 333 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 82 8 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pragia fontium 

(B) STRAIN: ATCC 49100 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333 

TTTCCTCAAG ACGCCGTACC AAAAGTGTAC GACGCGCTTG AAGTTCAGAA 50 

CGATGCCAAG CTGGTGCTGG AAGTTCAACA ACAGCTCGGT GGTGGTGTCG 100 

TTCGTTGTAT CGCAATGGGT ACTTCCGATG GCTTAAGCCG CGGTTTAAAA 150 

50 GTGCTTGATT TAGAACATCC TATCGAAGTA CCGGTTGGTA CTGCGACGCT 2 00 
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GGGCCGTATT ATGAACGTGC TCGGTCAGCC 
TTGGTGAAGA AGAGCGTTGG GCTATTCACC 
GATTTATCTG GCGCCAATGA ACTGCTGGAA 
CCTGATTTGT CCGTTTGCTA AAGGTGGTAA 
5 CGGGTGTAGG TAAAACCGTA AACATGATGG 
ACTGAGCACT CAGGTTACTC CGTATTCGCC 
TGAAGGTAAT GACTTCTACC ACGAAATGAC 
AAGTATCTCT GGTTTATGGC CAGATGAACG 
CGCGTGGCGT TAACGGGTTT GACCATGGCT 
LO TCGTGACGTT CTGTTATTTA TCGATAACAT 
GTACCGAAGT ATCAGCACTG TTGGGGCGTA 
CAGCCAACGT TAGCAGAAGA GATGGGTGTG 
AACCAAAACC GGTTCAATCA CTTCTGTA 

L5 



PCT/CAOO/01150 

AATCGATATG AAAGGCGATA 25 0 

GTGAAGCACC AAGTTATGAA 3 00 

ACGGGTATCA AGGTTATCGA 3 50 

AGTTGGTCTG TTTGGTGGTG 4 00 

AGCTGATTCG TAACATTGCG 450 

GGTGTAGGGG AACGTACCCG 500 

CGAATCAAAC GTACTGGATA 550 

AGCCACCAGG AAACCGTCTG 60 0 

GAAAAATTCC GTGATGAAGG 650 

TTATCGCTAT ACCTTAGCCG 700 

TGCCATCAGC GGTAGGTTAT 750 

TTGCAGGAAC GTATTACTTC 800 

828 



2) INFORMATION FOR SEQ ID NO: 334 

(i) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 



30 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334 



GTCCCTAAAG TATACGACGC TCTTGAGGTT ATGAATGGTA AAGAAAAACT 50 

GGTGCTGGAA GTTCAGCAAC AGTTAGGCGG TGGTATCGTT CGTTGTATCG 10 0 

35 CAATGGGTAC ATCAGACGGT TTAAGCCGTG GCTTAAAGGT TGAAGATTTA 150 

GGCCACCCAA TTGAAGT AC C AGTAGGTAAA GCGACTTTAG GACGTATCAT 2 00 

GAACGTTCTG GGTACACCTA TTGATATGAA AGGTGAGATT GAAACCGAAG 250 

AGCGTTGGTC AATCCACCGT GAAGCACCAA CTTACGAAGA GTTATCAAAC 3 00 

TCTCAAGAAC TGCTTGAAAC CGGTATCAAA GTTATGGACT TAATCTGTCC 350 

10 ATTTGCTAAA GGTGGTAAAG TCGGTCTGTT CGGTGGTGCG GGTGTTGGTA 400 

AAACAGTTAA CATGATGGAA TTGATCCGTA ATATCGCGAT CGAGCACTCA 450 

GGTTACTCTG TATTTGCTGG TGTTGGTGAG CGTACTCGTG AGGGTAACGA 500 

CTTCTATCAT GAAATGACAG ATTCTAACGT TCTTGACAAA GTATCGTTAG 550 

TTTACGGTCA GATGAATGAG CCACCAGGAA ACCGTCTGCG TGTTGCACTG 600 

15 ACTGGTCTGA CTATGGCTGA GAAATTCCGT GATGAAGGCC GTGACGTACT 650 

GTTATTCGTC GATAACATCT ATCGTTACAC CTTAGCCGGT ACAGAAGTAT 700 

CAGCACTGTT AGGTCGTATG CCATCAGCGG TAGGTTACCA ACCAACATTG 750 

GCTGAAGAGA TGGGTGTTCT GCAAGAGCGT ATCACTTCAA CCAAAACAGG 800 

TTCTATC 807 

50 



2) INFORMATION FOR SEQ ID NO: 33 5 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 811 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

200 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus vulgaris 
5 (B) STRAIN: ATCC 13315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335 

CCCTAAAGTA TACGACGCTC TTGAGGTTAT GAATGGTAAA GAGAAACTGG 50 

L0 TGCTAGAAGT TCAGCAACAG TTAGGCGGTG GTATCGTTCG TTGTATCGCA 100 

ATGGGTACAT CAGACGGTTT AAGCCGTGGC TTAAAAGTTG AAAACTTAGG 150 

CCACCCAATT GAAGTACCAG TAGGTAAAGC AACACTGGGA CGTATCATGA 2 00 

ACGTTCTGGG TACACCTATC GATATGAAAG GTGATATTGC AACTGAAGAA 2 50 

CGTTGGTCTA TTCACCGCGA AGCGCCAACC TATGAAGAGT TATCAAGCTC 3 00 

L5 TCAAGAACTA CTAGAAACCG GTATCAAAGT AATGGACTTA ATCTGTCCGT 3 50 

TTGCTAAAGG TGGTAAAGTA GGTCTCTTCG GTGGTGCGGG TGTTGGTAAA 400 

ACAGTTAACA TGATGGAATT GATCCGTAAC ATCGCGATTG AGCACTCAGG 4 50 

TTATTCTGTA TTTGCAGGTG TTGGTGAGCG TACTCGTGAG GGTAACGACT 50 0 

TCTATCATGA AATGACAGAT TCTAACGTTC TTGACAAAGT ATCGTTAGTT 550 

20 TATGGTCAGA TGAATGAGCC ACCAGGAAAC CGTCTACGTG TAGCACTGAC 60 0 

GGGTTTAACC ATGGCGGAAA AATTCCGTGA TGAAGGCCGT GACGTACTGT 650 

TATTCGTCGA TAACATCTAT CGTTACACCT TAGCCGGTAC CGAAGTATCA 70 0 

GCACTGTTAG GCCGTATGCC ATCAGCAGTA GGTTACCAAC CAACATTGGC 750 

TGAAGAGATG GGTGTTCTGC AAGAACGTAT CACTTCAACC AAAACAGGTT 800 

25 CAATCACCTC T 811 



2) INFORMATION FOR SEQ ID NO: 336 



30 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
£0 (A) ORGANISM: Providencia alcalifaciens 

(B) STRAIN: ATCC 9886 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336 

15 TCAAGATAAC GTACCAAAAG TGTACGATGC TCTTGAGGTT ATTAACGGTA 50 

AAGAAAAACT GGTGTTGGAA GTTCAACAAC AGTTAGGTGG TGGTGTTGTC 100 

CGTTGTATCG CAATGGGTAC ATCAGATGGT CTGAGCCGTG GTTTAGAAGT 150 

TGTAAACTTA GAGCACCCAA TCGAAGTACC AGTCGGTAAA GCAACTCTGG 2 00 

GACGTATCAT GAACGTTCTG GGTGAACCAA TCGACATGAA AGGTGATATC 250 

50 GGCGAAGAAG AGCGCTGGTC TATTCACCGT GCTGCACCAA GCTACGAAGA 300 

ATTAGCTAAC TCAACTGAAC TGCTGGAAAC CGGTATCAAA GTAATGGACT 350 

TAATCTGTCC ATTCGCGAAA GGTGGTAAAG TAGGTCTGTT CGGTGGTGCG 4 00 

GGTGTTGGTA AAACCGTAAA CATGATGGAA CTGATCCGTA ACATCGCGAT 450 

TGAGCACTCA GGTTACTCAG TGTTCGCTGG TGTTGGTGAG CGTACCCGTG 500 

55 AAGGTAACGA CTTCTATCAT GAAATGACAG ACTCAAACGT TCTGGATAAA 550 

GTATCACTGG TTTATGGCCA GATGAACGAG CCACCAGGAA ACCGTCTGCG 60 0 

TGTTGCGCTG ACTGGTCTGA CTATGGCTGA AAAATTCCGT GACGAAGGTC 650 

GTGACGTACT GCTGTTCGTT GACAACATTT ATCGTTATAC ACTGGCAGGT 700 

ACTGAAGTAT CAGCACTGTT AGGTCGTATG CCATCAGCGG TAGGTTACCA 750 

50 ACCAACGCTG GCGGAAGAGA TGGGTGTTCT TCAAGAACGT ATTACCTCAA 800 

201 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
CTCAAA 



PCT/CA00/01150 



5 2) INFORMATION FOR SEQ ID NO: 337 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 
L0 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

L5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rettgeri 

(B) STRAIN: ATCC 9250 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337 



TTCCCTCAAG ATGACGTACC AAAAGTGTAC GACGCTCTTG AGGTTGTTAA 50 

CGGTAAAGAA ACACTGGTGC TGGAAGTTCA GCAACAGTTA GGCGGTGGTG 100 

TTGTCCGTTG TATCGCAATG GGTACATCAG ATGGCCTGAG CCGTGGTTTA 150 

GAAGTTGTAA ACTTAGAGCA CCCAATTGAA GTACCAGTAG GTAAAGCAAC 2 00 

25 TTTAGGACGT ATCATGAACG TTCTGGGTCA GCCTATTGAT ATGAAAGGTG 2 50 

ATATCGGCGA AGAAGAGCGC TGGTCAATTC ACCGTGCTGC ACCTAGCTAC 300 

GAAGAGTTAG CTAACTCAAC AGAGCTGCTG GAAACCGGTA TCAAAGTAAT 350 

GGACTTAATC TGTCCATTCG CGAAAGGTGG TAAAGTTGGT CTGTTCGGTG 4 00 

GTGCGGGTGT TGGTAAAACA GTAAACATGA TGGAACTGAT CCGTAACATC 450 

30 GCGATTGAGC ACTCAGGTTA CTCAGTATTC GCTGGTGTTG GTGAGCGTAC 500 

TCGTGAAGGG AACGACTTCT ATCATGAAAT GACTGACTCA AACGTTCTGG 550 

ATAAAGTATC ACTGGTTTAT GGC CAGATGA ATGAGCCACC AGGAAACCGT 600 

CTGCGCGTTG CGTTGACTGG TCTGACTATG GCTGAAAAAT TCCGTGACGA 650 

AGGTCGTGAC GTACTACTGT TCGTTGACAA CATCTATCGT TATACACTGG 700 

35 CAGGTACTGA AGTATCAGCA CTGTTAGGTC GTATGCCTTC AGCGGTAGGT 750 

TATCAGCCAA CGCTGGCGGA AGAGATGGGT GTTCTGCAAG AACGTATTAC 800 

CTCAACTCAA ACGGGTTCTA TCACTTCCGT 83 0 



10 



50 



2) INFORMATION FOR SEQ ID NO: 338 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 812 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rustigianii 

(B) STRAIN: ATCC 33673 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 338 

AGCGTACCAA AAGTGTACGA TGCTCTTGAG GTTATTAACG GTAAAGAAAA 50 

ACTGGTGTTG GAAGTTCAGC AGCAGTTAGG CGGTGGTGTT GTCCGTTGTA 100 

TCGCAATGGG TACATCAGAT GGTCTGAGCC GTGGTTTAGA AGTTGTAAAC 150 

50 TTAGAACACC CAATTGAAGT ACCAGTAGGT AAAGCAACTC TGGGACGTAT 2 00 

202 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01 150 

CATGAACGTT CTGGGTGACC C T ATTG AT AT GAAAGGTGAT ATCGGCGAAG 25 a 

AAGAGCGCTG GTCTATTCAC CGTTCAGCGC CAAGCTATGA AGAATTAGCT 3 00 

AACTCAACAG AACTGCTAGA AACCGGTATC AAAGTAATGG ACTTAATCTG 3 50 

TCCATTCGCG AAAGGTGGTA AAGTTGGTCT GTTCGGTGGT GCGGGTGTTG 400 

5 GTAAAACAGT AAACATGATG GAACTGATCC GTAACATCGC GATTGAGCAC 4 50 

TCAGGTTACT CAGTATTCGC TGGTGTTGGT GAGCGTACCC GTGAAGGTAA 500 

CGACTTCTAT CATGAAATGA CTGATTCTAA CGTTCTGGAT AAAGTATCAC 550 

TGGTTTATGG CCAGATGAAC GAGCCACCAG GAAACCGTCT GCGTGTTGCG 600 

CTGACTGGTC TGACTATGGC TGAAAAATTC CGTGACGAAG GTCGTGACGT 650 

LO ACTGCTGTTC GTTGACAACA TTTATCGTTA TACACTGGCA GGTACTGAAG 700 

TATCAGCACT GTTAGGTCGT ATGCCTTCAG CGGTAGGTTA TCAGCCAACA 750 

TTGGCAGAAG AGATGGGTGT TCTACAAGAA CGTATCACTT CTACCAAAAC 80 0 

CGGTTCTATC AC 812 



15 



30 



2) INFORMATION FOR SEQ ID NO : 33 9 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia. stuartii 

(B) STRAIN: ATCC 33672 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 339 



TCAAGATGCA GTACCAAAAG TGTACGATGC GCTTGAGGTT GTTAACGGTA 50 

AAGAAAAACT GGTGCTGGAA GTTCAGCAAC AGTTAGGCGG TGGTGTTGTC 100 

35 CGTTGTATCG CAATGGGTAC ATCAGATGGC CTAAGCCGTG GTTTAGAAGT 150 

TAAAAATTTA GAACACCCAA TTGAAGTACC AGTAGGTAAA GCAACACTCG 2 00 

GACGTATCAT GAACGTTCTG GGTGACCCTA TTGATATGAA AGGTGATATC 2 50 

GGCGAAGAAG AGCGTTGGTC TATTCACCGC GCTGCACCAA GCTACGAAGA 3 00 

GCTATCGAGC TCAACTGAAC TGCTAGAGAC AGGTATCAAA GTCATGGACT 350 

10 TGATCTGTCC ATTCGCGAAA GGTGGTAAAG TTGGTCTGTT CGGTGGTGCG 4 00 

GGTGTTGGTA AAACGGTAAA CATGATGGAA CTTATCCGTA ACATCGCGAT 4 50 

TGAGCACTCA GGTTACTCAG TATTCGCAGG TGTTGGTGAG CGTACCCGTG 500 

AAGGTAACGA CTTCTATCAT GAAATGACAG ATTCAAACGT TCTTGACAAA 550 

GTATCACTGG TTTATGGTCA GATGAATGAG CCACCAGGAA ACCGTCTACG 600 

15 CGTAGCATTG ACTGGTTTGA CTATGGCTGA GAAATTCCGT GACGAAGGCC 650 

GTGATGTTCT GTTGTTCGTG GATAACATCT ATCGTTATAC ACTGGCAGGT 700 

ACAGAAGTAT CGGCTCTGTT AGGTCGTATG CCATCAGCAG TAGGTTATCA 750 

GCCAACATTG GCAGAAGAGA TGGGTGTTCT TCAAGAACGT ATCACTTCTA 800 

CTAAGACAGG TTCTATCAC 819 



2) INFORMATION FOR SEQ ID NO : 34 0 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 907 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 

203 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Psychrobacter phenylpyruvicus 
5 (B) STRAIN: ATCC 23333 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340 

AACCGCACTG ACGTGCCTCA AATTTATGAC GCGTTAGTTG TAGATGGTAC 50 

10 CGAAACCACC CTAGAAGTTC AGCAGCAGCT GGGCGATGGT GTGGTACGTA 100 

CTATTGCAAT GGGATCTACT GAAGGTCTTA AGCGTGGTTT ACCAGTAACA 150 

AACACTGGTG CCCCAATTAC AGTTCCAGTA GGTGATGCGA CTTTAGGTCG 2 00 

TATTATGGAC GTTTTAGGTC GTCCAATTGA CGAACAAGGT CCAGTTAATT 2 50 

CTGAAGACCA TTGGTCAATC CACCGTCAAG CGCCATCATA TGATGAGCAA 3 00 

15 GCTAACAGTA CTGACCTATT AGAGACAGGT ATTAAAGTAA TTGACTTACT 3 50 

TTGTCCGTTC GCTAAAGGGG GTAAAGTTGG TCTGTTCGGT GGTGCCGGTG 4 00 

TTGGTAAAAC CGTAAACATG ATGGAATTGA TTAATAACAT CGCTCTTAAG 4 50 

CACTCAGGTT TATCAGTATT CGCTGGTGTG GGTGAGCGTA CTCGTGAAGG 500 

TAACGACTTC TACCACGAGA TGCAAGAAGC GGGTGTTGTT GACGTTGAAA 55 0 

2 0 ACTTCACCAA CTCAAAAGTT GCGATGGTTT ATGGTCAGAT GAATGAGCCA 600 

CCAGGTAACC GTTTACGTGT TGCGTTAACC GGTCTGACTA TGGCTGAGTA 650 

CTTCCGTGAT CAAAAAGATG AAAACGGTAA AGGTAAAGAC GTTCTATTAT 700 

TCGTTGATAA CATCTACCGC TACACGCTAG CCGGTACTGA AGTATCAGCA 750 

CTTCTAGGTC GTATGCCATC AGCAGTAGGT TATCAGCCAA CACTAGCGGA 8 00 

25 AGAGATGGGT GTACTACAAG AGCGTATTAC TTCAACTCAG ACTGGTTCTA 850 

TTACTTC 9 07 



30 2) INFORMATION FOR SEQ ID NO: 341 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



40 



45 



50 



55 



60 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rahnella aquatilis 

(B) STRAIN: DSM 4594 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341 



GTTCCCTCAG 
ACGGTACCTC 
GTTGTTCGTT 
GAAAGTGAAC 
CTCTGGGTCG 
GAAATCGGCG 
TGAAGAGCTG 
TGGACCTGAT 
GGTGCGGGTG 
TGCGATCGAG 
CTCGTGAGGG 
GACAAAGTTT 
TCTGCGCGTT 
AAGGTCGTGA 
GCCGGTACCG 



GATGCAGTAC 
CAAACTGGTG 
GTATCGCAAT 
AACCTGGAAC 
TATCATGAAC 
AAGAAGAACG 
GCAAACTCCC 
GTGTCCGTTC 
TAGGTAAAAC 
CACTCCGGTT 
TAACGACTTC 
CCCTGGTGTA 
GCACTGACCG 
CGTACTGCTG 
AAGTGTCCGC 



CGAACGTGTA CAATGCTCTT 
CTGGAAGTTC AGCAACAGTT 
GGGTACCTCA GACGGCCTGC 
AC C CAATTGA AGTACCGGTA 
GTATTGGGTG AACCAATCGA 
TCGTGCTATT CACCGTGCTG 
AGGAATTGCT GGAAACCGGT 
GCTAAGGGCG GTAAAGTTGG 
TGTGAACATG ATGGAGCTGA 
ATTCTGTGTT TGCAGGCGTG 
TACCACGAAA TGACTGATTC 
TGGCCAGATG AATGAGCCAC 
GCCTGACCAT GGCGGAAAAA 
TTCGTTGACA ACATTTACCG 
ACTTCTGGGC CGTATGCCAT 
204 



GAGGTAGAAA 50 

AGGCGGCGGC 100 

GTCGCGGTCT 150 

GGTAAAGCGA 2 00 

CATGAAAGGT 2 50 

CGCCTTCTTA 3 00 

ATCAAAGTTA 3 50 

TCTGTTCGGT. 4 00 

TCCGTAACAT 4 50 

GGTGAACGTA 500 

CAACGTTATC 550 

CAGGTAACCG 600 

TTCCGTGATG 650 

TTACACCCTG 700 

CGGCGGTAGG 750 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

TTATCAGCCA ACGCTGGCGG AAGAGATGGG CGCTCTGCAA GAACGTATCA* 800 
CCTCGACCAA AAGTGGTTCT ATCACCTCCG TA 832 



2) INFORMATION FOR SEQ ID NO: 342 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 824 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella, choleraesuis subsp. arizonae 

(B) STRAIN: ATCC 13314 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTT 150 

25 GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCCAC 2 00 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCTATTC ACCGCGCGGC GCCGTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TAAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

30 GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCT AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCACC GGGAAACCGT 600 

CTGCGCGTTG CATTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 650 

35 AGGTCGTGAC GTTCTGCTGT TCGTCGACAA CATCTACCGT TATACCCTCG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TAYCASCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAA ACCGGTTCTA TCAC 824 



40 



2) INFORMATION FOR SEQ ID NO: 343 



(i) SEQUENCE CHARACTERISTICS : 
45 (A) LENGTH: 82 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Choleraesuis 
55 (B) STRAIN: ATCC 7001 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343 

GATGCCGTAC CACGCGTGTA CGATGCCCTT GAGGTGCAGA ATGGTAATGA 50 
60 GAAGCTGGTG CTGGAAGTTC AGCAGCAGCT TGGCGGCGGT ATCGTGCGTA 100 

205 



SUBSTITUTE SHEET (RULE 26) 



10 



15 



WO 01/23604 

CCATCGCGAT 
GATCTCGAAC 
TATCATGAAC 
AAGAAGAGCG 
TCAAACTCTC 
GTGTCCGTTC 
TAGGTAAAAC 
CACTCCGGTT 
TAACGACTTC 
CCCTGGTGTA 
GCACTGACCG 
TGTACTGCTG 
AAGTATCCGC 
ACTCTGGCGG 
AACCGGTTCT 



GGGGTCTTCT 
ACCCGATCGA 
GTCCTGGGCG 
TTGGGCGATT 
AGGAACTGCT 
GCGAAGGGCG 
CGTAAACATG 
ACTCAGTGTT 
TACCACGAAA 
TGGCCAGATG 
GCCTGACCAT 
TTCGTCGATA 
ACTGCTGGGC 
AAGAGATGGG 
ATCACCTCCG 



GACGGTCTGC 
AGTCCCGGTA 
AACCGGTCGA 
CACCGCGCAG 
GGAAACCGGT 
GTAAAGTCGG 
ATGGAGCTTA 
TGCGGGCGTA 
TGACCGACTC 
AACGAGCCGC 
GGCGGAGAAA 
ACATCTATCG 
CGTATGCCTT 
CGTTCTGCAG 



GTCGCGGTCT * 

GGTAAAGCCA 

CATGAAAGGC 

CACCTTCCTA 

ATCAAAGTTA 

TCTGTTCGGT 

TTCGTAACAT 

GGGGAACGTA 

CAACGTTATC 

CGGGAAACCG 

TTCCGTGACG 

TTACACCCTG 

CCGCAGTAGG 

GAACGTATCA 



PCT/CA00/01150 

GGATGTAAA^ 15 0 

CGCTGGGTCG 2 00 

GAGATCGGCG 250 

CGAAGAGTTG 3 00 

TCGACCTGAT 3 50 

GGTGCGGGTG 4 00 

CGCGATCGAG 450 

CTCGTGAGGG 500 

GATAAAGTAT 550 

TCTGCGCGTT 6 00 

AAGGTCGTGA 650 

GCCGGTACGG 70 0 

TTACCAGCCG 750 

CCTCCACCAA 800 

820 



2) INFORMATION FOR SEQ ID NO: 344 



20 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) . TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Salmonella, choleraesuis subsp. diarizonae 

(B) STRAIN: ATCC 43973 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344 

35 TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 10 0 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTT 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 2 00 

GCTGGGTCGT ATCATGAACG TCCTGGGTGA ACCGGTCGAT ATGAAAGGCG 250 

40 AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCGGC GCCGTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 4 50 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

45 TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCT AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCACC GGGAAAC CGT 60 0 

CTGCGCGTTG CATTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 65 0 

AGGTCGTGAC GTTCTGCTGT TCGTCGACAA CATCTACCGT TATACCCTCG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

50 TATCAGCCGA CTCTGGCTGA AGAAATGGGC GTTCTGCAGG AACGTATCAC 8 00 

CTCCACCAAA ACCGGTTCTA TCACCTCCGT A 831 



55 2) INFORMATION FOR SEQ ID NO: 345 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

206 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella, choleraesuis subsp. houtenae 

(B) STRAIN: ATCC 43974 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345 

10 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCTCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 10 0 

TCGTACGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCTAC 200 

15 GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCTGC GCCGTCCTAC 3 00 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

20 GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGTCAGATGA ACGAGCCGCC GGGAAACCGT 60 0 

CTGCGCGTTG CATTGAC CGG CCTGACCATG GCGGAAAAAT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 70 0 

25 CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TATCAGCCGA CGCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAG ACCGGTTCTA TCACCTCCGT A 831 



3 0 



40 



2) INFORMATION FOR SEQ ID NO: 346 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 82 9 bases 
35 (B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Doubl e 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. indica 

(B) STRAIN: ATCC 43974 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTCGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

50 GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCCAC 2 00 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAT ATGAAAGGCG 2 50 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCGGC GCCGTCCTAT 3 00 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

55 GTGCGGGCGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ACAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAGAAGT TCCGTGACGA 650 

60 AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG 700 
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CCGGTACGGA AGTTTCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT * 75t> 

TATCAGCCAA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 8 00 

CTCCACCAAA ACCGGTTCTA TCACCTCCG 829 



2) INFORMATION FOR SEQ ID NO: 347 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Paratyphi A 
20 (B) STRAIN: ATCC 9150 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347 

GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA 50 

25 GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCTACGC TGGGTCGTAT 2 00 

CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 2 50 

AAGAGCGTTG GGCGATTCAC CGCGCAGCGC CTTCCTACGA AGAGTTGTCA 3 00 

30 AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 3 50 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACCGT AAACATGATG GAGCTTATCC GTAACATCGC GATCGAGCAC 4 50 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGT GAACGTACTC GTGAGGGTAA 500 

CGACTTCTAC CATGAAATGA CCGACTCCAA CGTTATCGAT AAAGTATCCC 550 

35 TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 60 0 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 70 0 

TATCCGCACT GCTGGGTCGT ATGCCTTCCG CGGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAGAC 800 

4 0 CGGTTCTATC ACCTCCG 817 



45 



60 



2) INFORMATION FOR SEQ ID NO : 34 8 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 

serotype Paratyphi B 
(B) STRAIN: ATCC 8759 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348 
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GCCGTACCAC GCGTGTACGA 
GCTGGTGCTG GAAGTTCAGC 
TCGCGATGGG GTCTTCTGAC 
CTCGAACACC CGATCGAAGT 
5 CATGAACGTC CTGGGCGAAC 
AAGAGCGTTG GGCGATTCAC 
AACTCTCAGG AACTGCTGGA 
TCCGTTCGCG AAGGGCGGTA 
GTAAAACCGT AAACATGATG 

10 TCCGGTTACT CTGTGTTTGC 
CGACTTCTAC CACGAAATGA 
TGGTGTATGG CCAGATGAAC 
TTGACCGGTC TGACCATGGC 
ACTGCTGTTC GTCGATAACA 

15 TATCCGCACT GCTGGGCCGT 
CTGGCGGAAG AGATGGGCGT 
CGGTTC 



TGCCCTTGAG GTGCAGAATG 
AGCAGCTTGG CGGCGGTATT 
GGTCTGCGTC GCGGTCTGGA 
CCCGGTAGGT AAAGCCACGC 
CGGTCGACAT GAAAGGCGAG 
CGCGCAGCGC CTTCCTACGA 
AACCGGTATC AAAGTTATCG 
AAGTCGGTCT GTTCGGTGGT 
GAGCTTATCC GTAACATCGC 
GGGCGTAGGT GAACGTACTC 
CCGACTCCAA CGTTATCGAT 
GAGCCGCCGG GAAACCGTCT 
GGAGAAATTC CGTGACGAAG 
TCTATCGTTA CACCCTGGCC 
ATGCCTTCCG CAGTAGGTTA 
TCTGCAGGAA CGTATCACCT 



PCT/CA00/01150 



GTAATGAGAA 


£o 


GTGCGTACCA 


100 


TGTAAAAGAT 


150 


TGGGTCGTAT 


200 


ATCGGCGAAG 


250 


AGAGTTGTCA 


300 


ACCTGATGTG 


350 


GCGGGTGTGG 


400 


GATCGAGCAC 


450 


GTGAGGGTAA 


500 


AAAGTATCCC 


550 


GCGCGTTGCA 


600 


GTCGTGACGT 


650 


GGTACGGAAG 


700 


CCAGCCGACT 


750 


CCACCAAAAC 


800 




806 



20 

2) INFORMATION FOR SEQ ID NO: 34 9 

( i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 831 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. salamae 

(B) STRAIN: ATCC 43972 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGKGGTCTG 150 

40 GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 200 

GCTGGGCCGT ATCATGAACG TACTGGGCGA ACCGGTCGAC ATGAAAGGCG 2 50 

AGATTGGTGA AGAAGAGCGT TGGGCTATTC ACCGTGCTGC GCCGTCCTAC 300 

GAAGAGTTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 350 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 400 

45 GCGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTCAT CCGTAACATC 450 

GCGATCGAGC ACTCCGGTTA CTCCGTGTTT GCGGGCGTAG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGACGA 650 

50 AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TATACCCTGG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TATCAGCCGA CGCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACTAAA ACCGGTTCTA TCACCTCCGT A 831 

55 

2 ) INFORMATION FOR SEQ ID NO: 350 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 823 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Typhi 
<B) STRAIN: ATCC 1074 9 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 50 

TTCCCTCAGG ATGCCGTACC ACGCGTGTAC GATGCCCTTG AGGTGCAGAA 50 

TGGTAATGAG AAGCTGGTGC TGGAAGTTCA GCAGCAGCTT GGCGGCGGTA 100 

TCGTGCGTAC CATCGCGATG GGGTCTTCTG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ATCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCTAC 200 

GCTGGGTCGT ATCATGAACG TCCTGGGCGA ACCGGTCGAC ATGAAAGGCG 250 

AGATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC GCCTTCCTAC 3 00 

GAAGAGTTAT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTTAT CCGTAACATC 4 50 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 50 0 

TCGTGAGGGT AACGACTTCT ACCATGAAAT GACCGACTCC AACGTTATCG 550 

ATAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAGAAAT TCCGTGACGA 650 

AGGTCGTGAC GTACTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGTC GTATGCCTTC CGCGGTAGGT 750 

TACCAGCCGA CTCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 800 

CTCCACCAAG ACCGGTTCTA TCA 82 3 



2) INFORMATION FOR SEQ ID NO: 3 51 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Typhimurium 

(B) STRAIN: ATCC 14 028 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351 



TTCCCTCAGG ATGCCGTACC 
TGGTAATGAG AAGCTGGTGC 
TTGTGCGTAC CATCGCGATG 
GATGTAAAAG ATCTCGAACA 
GCTGGGTCGT ATCATGAACG 
AGATCGGCGA AGAAGAGCGT 
GAAGAGTTGT CAAACTCTCA 
CGACCTGATG TGTCCGTTCG 
GTGCGGGTGT AGGTAAAACC 
GCGATCGAGC ACTCCGGTTA 



ACGCGTGTAC GATGCCCTTG 
TGGAAGTTCA GCAGCAGCTT 
GGGTCTTCTG ACGGTCTGCG 
CCCGATCGAA GTCCCGGTAG 
TCCTGGGCGA ACCGGTCGAC 
TGGGCGATTC ACCGCGCAGC 
GGAACTGCTG GAAACCGGTA 
CGAAGGGCGG TAAAGTCGGT 
GTAAACATGA TGGAGCTTAT 
CTCAGTGTTT GCGGGCGTAG 
210 



AGGTGCAGAA 


50 


GGCGGCGGTA 


100 


TCGCGGTCTG 


150 


GTAAAGCTAC 


200 


ATGAAAGGCG 


250 


GCCTTCCTAC 


300 


TCAAAGTTAT 


350 


CTGTTCGGTG 


400 


CCGTAACATC 


450 


GGGAACGTAC 


500 
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TCGTGAGGGT AACGACTTCT ACCACGAAAT 
ATAAAGTATC CCTGGTGTAT GGCCAGATGA 
CTGCGCGTTG CATTGACCGG TCTGACCATG 
AGGTCGTGAC GTACTGCTGT TCGTCGATAA 
CCGGTACGGA AGTATCCGCA CTGCTGGGCC 
TACCAGCCGA CTCTGGCGGA AGAGATGGGC 
CTCCACCAAA ACCGGTTCTA TCACCTCCGT 



GACCGACTCC ^AGGTTATCS 1 
ACGAGCCGCC GGGAAAC CGT 
GCGGAGAAAT TCCGTGACGA 
CATCTATCGT TACACCCTGG 
GTATGCCTTC CGCAGTAGGT 
GTTCTGCAGG AACGTATCAC 
A 



600 
650 
700 
750 
800 
831 



2) INFORMATION FOR SEQ ID NO: 352 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp . choleraesuis 
serotype Virchow 

(B) STRAIN: ATCC 51955 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352 

GCCGTACCAC GCGTGTACGA TGCCCTTGAG GTGCAGAATG GTAATGAGAA 50 

GCTGGTGCTG GAAGTTCAGC AGCAGCTTGG CGGCGGTATC GTGCGTACCA 100 

TCGCGATGGG GTCTTCTGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAT 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCCACGC TGGGTCGTAT 2 00 

CATGAACGTC CTGGGCGAAC CGGTCGACAT GAAAGGCGAG ATCGGCGAAG 250 

AAGAGCGTTG GGCGATTCAC CGCGCAGCAC CTTCCTACGA AGAGTTGTCA 3 00 

AACTCTCAGG AACTGCTGGA AACCGGTATC AAAGTTATCG ACCTGATGTG 350 

TCCGTTCGCG AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 400 

GTAAAACCGT AAACATGATG GAGCTTATTC GTAACATCGC GATCGAGCAC 450 

TCCGGTTACT CTGTGTTTGC GGGCGTAGGG GAACGTACTC GTGAGGGTAA 50 0 

CGACTTCTAC CACGAAATGA CCGACTCCAA CGTTATCGAT AAAGTCTCCC 550 

TGGTGTATGG CCAGATGAAC GAGCCGCCGG GAAACCGTCT GCGCGTTGCA 600 

TTGACCGGTC TGACCATGGC GGAGAAATTC CGTGACGAAG GTCGTGACGT 650 

ACTGCTGTTC GTCGATAACA TCTATCGTTA CACCCTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGCCGT ATGCCTTCCG CAGTAGGTTA CCAGCCGACT 750 

CTGGCGGAAG AGATGGGCGT TCTGCAGGAA CGTATCACCT CCACCAAAAC 800 

CGGTTCTATC 810 



2) INFORMATION FOR SEQ ID NO: 353 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia ficaria 

(B) STRAIN: ATCC 33105 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353 



GATGCCGTAC 
5 GAAACTGGTG 
GTATCGCAAT 
AACCTGGAAC 
TATCATGAAC 
AAGAAGAGCG 

10 TCCAACTCCC 
TTGTCCGTTC 
TGGGCAAAAC 
CACTCCGGTT 
TAACGACTTC 

15 CCCTGGTTTA 
GCATTGACCG 
CGTTCTGCTG 
AAGTGTCCGC 
ACGCTGGCGG 

2 0 GACCGGTTCC 



CAAAAGTGTA 
CTGGAAGTTC 
GGGGACCTCT 
ACCCGATTGA 
GTATTGGGCG 
TTGGGCGATT 
AGGACCTGCT 
GCCAAGGGCG 
CGTAAACATG 
ATTCCGTGTT 
TACCACGAAA 
CGGCCAGATG 
GCCTGACCAT 
TTCGTTGACA 
ACTTCTGGGC 
AAGAGATGGG 
ATCACCTCCG 



CGATGCCCTT 
AGCAGCAGCT 
GACGGTCTGC 
AGTGCCGGTG 
AACCAATCGA 
CACCGTCCTG 
GGAAACCGGT 
GTAAAGTCGG 
ATGGAGCTGA 
TGCGGGCGTG 
TGAACGACTC 
AACGAGCCGC 
GGCGGAGAAA 
ACATTTACCG 
CGTATGCCAT 
CGTTCTGCAA 



GAGGTAGAAA 
GGGCGGTGGC 
GTCGCGGTCT 
GGTAAAGCTA 
CATGAAAGGC 
CGCCAAGCTA 
ATCAAGGTAA 
TCTGTTCGGT 
TCCGTAACAT 
GGCGAGCGTA 
CAACGTTCTG 
CGGGTAACCG 
TTCCGTGACG 
TTACACCCTG 
CCGCGGTAGG 
GAACGTATCA 



ACGGTACCGA 50 

GTCGTTCGCT 100 

GAAAGTGAAC 150 

CCCTGGGCCG 200 

GAGATCGGCG 2 50 

CGAAGAGCTG 3 00 

TGGACCTGAT 350 

GGTGCGGGCG 4 00 

CGCGATCGAG 450 

CTCGTGAGGG 500 

GACAAAGTAT 550 

TCTGCGCGTT 600 

AAGGCCGCGA 650 

GCCGGTACCG 700 

TTATCAGCCA 750 

CCTCGACCAA 8 00 

820 



25 



2) INFORMATION FOR SEQ ID NO: 3 54 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

Cvi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Serratia fonticola 

(B) STRAIN: ATCC 29844 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354 

40 GATGCCGTAC CGAAAGTGTA CAACGCTCTT GAGGTTGCAA ACGGCACCGA 50 

GAAATTGGTG CTGGAAGTTC AGCAACAGCT GGGTGGCGGC GTGGTTCGCT 10 0 

GTATCGCAAT GGGGACCTCA GACGGTCTGC GTCGTGGTCT GGCCGTAACC 150 

GACCTGCAGC AC C C AATTGA AGTACCGGTA GGTAAAGCTA CCCTGGGCCG 2 00 

TATCATGAAC GTATTGGGTG AACCAATCGA CATGAAGGGC GACATCGGCG 2 50 

45 AAGAAGAACG TTGGGCTATT CACCGCCCTG CGCCAAGCTA CGAAGAGCTG 300 

TCCAGCTCCC AGGATCTGCT GGAAACCGGT ATCAAGGTAA TGGACCTGAT 3 50 

CTGCCCGTTC GCCAAGGGTG GTAAAGTTGG TCTGTTCGGT GGTGCTGGTG 4 00 

TAGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ATTCTGTGTT TGCGGGCGTG GGTGAACGTA CTCGTGAGGG 500 

50 TAACGACTTC TACCACGAAA TGACCGATTC CAACGTACTG GACAAAGTTT 550 

CCCTGGTTTA CGGCCAGATG AACGAGCCAC CAGGTAACCG TCTGCGCGTT 600 

GCGCTGACCG GCCTGACCAT GGCTGAGAAG TTCCGTGACG AAGGTCGTGA 650 

CGTACTGCTG TTCGTCGATA ACATCTACCG TTATACCCTG GCCGGTACCG 70 0 

AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG TTATCAGCCA 75 0 

55 ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCTACCAA 800 

GACTGGTTCA ATCACC 816 



60 2) INFORMATION FOR SEQ ID NO: 355 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia grimes ii 

(B) STRAIN: ATCC 14460 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355 



GATGCCGTAC CAAAAGTGTA CAACGCTCTT GAGGTAGAAA ACGGTGCCAA 50 

TAAGCTGGTG CTGGAAGTTC AGCAACAGTT GGGCGGTGGC GTGGTTCGCT 100 

GTATCGCAAT GGGGACCTCT GATGGTCTGC GTCGCGGTCT GAAAGTCACA 15 0 

GACCTGGACC AC C C AATTGA AGTACCGGTA GGTAAAGCTA CTCTGGGCCG 200 

2 0 TATCATGAAC GTATTGGGTG AACCAATCGA CATGAAGGGC GATATCGGCG 250 

AAGAAGAACG TTGGGCGATT CACCGTCCGG CGCCAAGCTA CGAAGATTTG 3 00 

GCCAACTCCC AGGATCTGCT GGAAACCGGT ATCAAGGTAA TGGACCTGAT 3 50 

CTGCCCGTTC GCCAAGGGTG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 4 00 

TTGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 4 50 

2 5 CACTCCGGTT ATTCTGTGTT TGCGGGCGTG GGTGAGCGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGAACGACTC CAACGTACTG GACAAAGTAT 550 

CCCTGGTTTA CGGCCAGATG AACGAGCCAC CGGGTAACCG TCTGCGCGTT 600 

GCTCTGACCG GTCTGACCAT GGCTGAGAAA TTCCGTGACG AAGGCCGTGA 65 0 

CGTTCTGCTG TTCGTTGATA ACATCTACCG TTATACCCTG GCCGGTACCG 70 0 

30 AAGTGTCCGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG TTATCAGCCA 7 50 

ACGCTGGCGG AAGAGATGGG TGTTCTGCAA GAACGTATCA CCTCTACCAA 800 

GACTGGTTCA ATCACCTCCG TA 822 



35 



45 



2) INFORMATION FOR SEQ ID NO: 356 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 819 bases 
4 0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia liquefaciens 

(B) STRAIN: ATCC 27592 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356 

ATGCCGTACC AAAAGTGTAC AATGCTCTTG AGGTAGAAAA CGGTACCGAG 50 

AAGCTGGTGC TGGAAGTTCA GCAGCAGTTG GGCGGTGGCG TAGTTCGCTG 100 

TATCGCGATG GGGACCTCAG ATGGTCTGCG CCGCGGTCTG AAAGTGAACG 150 

55 ATCTGGAACA CCCAATTGAA GTACCGGTAG GTAAAGCTAC CCTGGGCCGT 2 00 

ATCATGAACG TATTGGGTGA ACCAATCGAC ATGAAAGGCG ACATCGGCGA 2 50 

AGAAGAACGT TGGGCGATTC ACCGTCCAGC GCCAAGCTAC GAAGATTTGT 3 00 

CAAACTCCCA GGATCTGCTG GAAACCGGTA TCAAGGTTAT GGACCTGATT 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCTGGTGT 4 00 

60 TGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC GCGATCGAGC 450 
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ACTCCGGTTA TTCCGTGTTT GCAGGCGTGG GTGAGCGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GAACGACTCC AACGTACTGG ACAAAGTATC 550 

CCTGGTTTAC GGCCAGATGA ACGAGCCACC GGGTAACCGT CTGCGCGTTG 600 

CTCTGACCGG TCTGACCATG GCGGAGAAAT TCCGTGACGA AGGCCGCGAC 650 

GTTCTGCTGT TCGTTGATAA CATTTACCGT TAT AC C CTGG CCGGTACCGA 700 

AGTGTCCGCA CTTCTGGGCC GTATGCCATC TGCGGTAGGT TATCAGCCAA 750 

CGCTGGCGGA AGAGATGGGC GTTCTGCAAG AACGTATCAC CTCTACCAAG 800 

ACCGGTTCTA TCACTTCCG 819 



2) INFORMATION FOR SEQ ID NO: 357 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 

(B) STRAIN: ATCC 13880 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357 



TCAGGATGCC GTACCGAAAG TGTACGACGC CCTTGAGGTA GAAAACGGCA 50 

CCGAAAAACT GGTGTTGGAA GTTCAGCAAC AGCTGGGCGG TGGCGTGGTT 100 

3 0 CGCTGTATCG CAATGGGGAC CTCCGACGGT CTGCGTCGCG GTCTGAAAGT 150 

GAACAACCTG GACCACCCGA TTGAAGTGCC GGTGGGTAAA GCTACCCTGG 2 00 

GTCGTATCAT GAACGTATTG GGTCAACCGA TCGACATGAA AGGCGACATC 2 50 

GGCGAAGAAG AGCGTTGGGC GATTCACCGC GCGGCGCCAA GCTACGAAGA 3 00 

GCTGTCAAGC TCTCAGGAAC TGCTGGAAAC CGGTATCAAG GTAATGGACC 3 50 

35 TGATTTGTCC GTTCGCCAAG GGCGGTAAAG TCGGTCTGTT CGGCGGTGCG 4 00 

GGCGTAGGTA AAACCGTAAA CATGATGGAG CTGATCCGTA ACATCGCGAT 4 50 

CGAGCACTCC GGTTATTCCG TGTTTGCGGG CGTGGGCGAG CGTACTCGTG 500 

AGGGTAACGA CTTCTACCAC GAAATGACCG ACTCCAACGT TCTGGACAAA 550 

GTATCCCTGG TTTACGGCCA GATGAACGAG CCACCAGGTA ACCGTCTGCG 600 

4 0 CGTTGCGCTG ACCGGTCTGA CCATGGCGGA GAAATTCCGT GACGAAGGCC 650 

GTGACGTTCT GCTGTTCGTT GACAACATCT ACCGTTACAC CCTGGCCGGT 700 

ACCGAAGTGT CCGCACTTCT GGGCCGTATG CCATCCGCGG TAGGTTATCA 750 

GCCAACGCTG GCGGAAGAGA TGGGCGTTCT GCAAGAACGT ATCACCTCGA 8 00 

CCAAG 805 

45 

2) INFORMATION FOR SEQ ID NO: 3 58 

50 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Serratia odor if era 
60 (B) STRAIN: ATCC 33077 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358 

GATGCCGTAC CAAAAGTGTA CGATGCCCTT GAGGTAGAAA ACGGTACCGA 50 

5 AACTCTGGTG CTGGAAGTTC AGCAGCAGCT GGGCGGTGGC GTGGTTCGTT 100 

GTATCGCTAT GGGCACCTCC GACGGTTTGC GTCGCGGCCT GAAAGTGAAC 150 

GATCTGCAAC ACCCAATCGA AGTCCCGGTT GGCAAGGCAA CGCTGGGTCG 2 00 

TATCATGAAC GTATTGGGTC AACCAATCGA CATGAAAGGC GACATCGGCG 2 50 

AAGAAGAGCG TTGGGCGATT CACCGCGCGG CGCCAAGCTA CGAAGAACTG 300 

10 TCCAACTCCC AGGATCTGCT GGAAACCGGT ATCAAGGTTA TGGACCTGAT 3 50 

CTGCCCGTTT GCCAAGGGTG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 4 00 

TTGGTAAAAC CGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 4 50 

CACTCCGGTT ATTCAGTGTT TGCGGGCGTG GGTGAGCGTA CTCGTGAGGG 500 

TAACGACTTC TACCACGAAA TGACCGACTC CAACGTACTG GACAAGGTTT 550 

15 CCCTGGTTTA CGGCCAGATG AACGAGCCAC CGGGTAACCG TCTGCGCGTT 60 0 

GCGCTGACCG GTCTGACCAT GGCCGAGAAA TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGCTG TTCGTTGACA ACATTTACCG TTACACCCTG GCCGGTACCG 7 00 

AAGTGTCTGC ACTTCTGGGC CGTATGCCAT CGGCGGTAGG TTATCAGCCA 7 50 

ACGCTGGCGG AAGAGATGGG CGTTCTGCAA GAACGTATCA CCTCGACCAA 800 

2 0 GACCGGTTCT ATCACCTCCG TA 822 



2) INFORMATION FOR SEQ ID NO: 359 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 05 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Serratiei plymuthica 

(B) STRAIN: ATCC 183 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359 

4 0 GTGTACAACG CTCTTGAGGT AGAAAACGGT GCCAATAAGC TGGTGCTGGA 50 

AGTTCAGCAA CAGCTGGGCG GTGGCGTGGT TCGCTGTATC GCGATGGGGA 100 

CCTCTGATGG TCTGCGTCGC GGTCTGAAAG TGATCGACCT GGATCACCCG 150 

ATTGAAGTAC CGGTAGGTAA AGCTACCCTG GGCCGTATCA TGAACGTATT 2 00 

GGGTGAAC C A ATCGACATGA AAGGCGACAT CGGCGAAGAA GAACGTTGGG 250 

4 5 CAATTCACCG TCCAGCGCCA AGCTACGAAG ATTTGGCCAA CTCCCAGGAT 3 00 

CTGCTGGAAA CCGGTATCAA GGTTATGGAC CTGATCTGTC CGTTCGCTAA 3 50 

GGGCGGTAAA GTGGGTCTGT TCGGCGGTGC GGGCGTGGGT AAAACCGTAA 4 00 

ACATGATGGA GCTGATCCGT AACATCGCGA TCGAACACTC CGGTTATTCC 4 50 

GTGTTTGCGG GCGTGGGTGA GCGTACTCGT GAGGGTAACG ACTTCTACCA 500 

5 0 CGAAATGAAC GACTCCAACG TACTGGACAA AGTATCCCTG GTTTACGGCC 550 

AGATGAACGA GCCACCGGGT AACCGTCTGC GCGTTGCTCT GACCGGTCTG 600 

AC C ATGGCGG AGAAATTCCG TGACGAAGGC CGCGACGTTC TGCTGTTCGT 650 

TGATAACATC TACCGTTATA CCCTGGCCGG TACCGAAGTG TCCGCACTTC 7 00 

TGGGCCGTAT GCCATCTGCG GTAGGTTATC AGCCAACGCT GGCGGAAGAG 750 

55 ATGGGCGTTC TGCAAGAACG TATCACCTCT ACCAAGACCG GTTCTATCAC 8 00 

CTCCG 8 05 



60 2) INFORMATION FOR SEQ ID NO: 360 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia rubidaea 

(B) STRAIN: ATCC 27593 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360 



TTCCCTCAGG ATGCCGTACC AAAAGTGTAC GATGCCCTTG AGGTAGAGAA 50 

CGGTAACGAA AAACTGGTGC TGGAAGTTCA GCAGCAGCTG GGCGGCGGCG 100 

TTGTACGCTG TATCGCCATG GGTACTTCCG ACGGTCTGCG TCGCGGTCTG 15 0 

AAAGTTAACG ACCTCGAGCA CCCAATCGAA GTGCCAGTTG GTAAAGCAAC 200 

2 0 GCTGGGTCGT ATCATGAACG TATTGGGTCA GCCAATCGAC ATGAAAGGCG 250 

ACATCGGCGA AGAAGAGCGT TGGGCGATTC ACCGCGCGGC GCCAAGCTAC 3 00 

GAAGAGCTGT CCAGCTCCCA AGAGCTGCTG GAAACCGGTA TCAAGGTAAT 350 

GGACCTGATC TGCCCGTTCG CCAAGGGTGG TAAAGTTGGT CTGTTCGGTG 4 00 

GTGCGGGCGT AGGTAAAACC GTAAACATGA TGGAGCTGAT CCGTAACATC 450 

25 GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTGG GTGAGCGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTACTGG 550 

ACAAAGTATC CCTGGTTTAC GGCCAGATGA ACGAGCCGCC GGGTAACCGT 600 

CTGCGCGTTG CACTGACCGG CCTGACCATG GCGGAAAAAT TCCGTGATGA 650 

AGGCCGCGAC GTTCTGCTGT TCGTGGATAA CATCTACCGT TACACCCTGG 700 

3 0 CCGGTACCGA AGTGTCCGCA CTGCTCGGCC GTATGCCATC TGCGGTAGGT 750 

TATCAGCCAA CGCTGGCGGA AGAGATGGGC GTTCTGCAAG AACGTATCAC 800 

CTCGACCAAG ACCGGTTCAA TCACCTCCGT A 831 



35 



2) INFORMATION FOR SEQ ID NO: 361 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 406 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas putida 

(B) STRAIN: LCDC D7172 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361 

CCGTAAACAT GATGGAACTG ATCCGTAACA TCGCCATCGA GCACAGCGGT 50 

TATTCCGTGT TCGCCGGTGT GGGTGAGCGT ACTCGTGAGG GTAACGACTT 100 

CTACCACGAG ATGAAGGACT CCAACGTTCT CGACAAGGTA GCGCTGGTCT 150 

55 ACGGTCAGAT GAACGAGCCA CCAGGAAACC GTCTGCGCGT AGCGCTGACC 2 00 

GGCCTGACCA TGGCCGAGAA GTTCCGTGAC GAAGGTAACG ACGTTCTGCT 250 

GTTCGTCGAC AACATCTATC GTTACACCCT GGCCGGTACC GAAGTATCCG 3 00 

CACTGCTGGG CCGTATGCCT TCGGCGGTAG GTTACCAGCC GACCCTGGCT 3 50 

GAAGAGATGG GCGTTCTGCA AGAACGTATC ACTTCGACCA AGGAAGGTTC 4 00 

60 GATCAC 406 
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2) INFORMATION FOR SEQ ID NO: 362 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella boydii 

(B) STRAIN: ATCC 9207 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362 

TTCCCTCAGG ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA 50 

TGGTAATGAG CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA 10 0 

TCGTGCGTAC CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG 150 

GATGTAAAAG ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC 2 00 

TCTGGGCCGT ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG 2 50 

AGATCGGTGA AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC 3 00 

GAAGAGCTGT CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG 4 00 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTCAT TCGTAACATC 4 50 

GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC 5 00 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG 550 

ACAAAGTATC CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTTGACAA CATCTATCGT TACACCCTGG 700 

CCGGTACGGA AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT 7 50 

TATCAGCCGA CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC 8 00 

CTCCACCAAA ACTGGTTCTA TCACCTCCGT A 831 



2) INFORMATION FOR SEQ ID NO: 363 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella dysenteriae 

(B) STRAIN: ATCC 11835 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 363 

GCCGTACCGC GCGTGTACGA TGCTCTTGCG GTGCAAAATG GTAATGAGCG 50 

TCTGGTGCTG GAAGTTCAGC AGCAGCTCGG CGGCGGTATC GTGCGTACCA 100 

TCGCAATGGG TTCCTCCGAC GGTCTGCGTC GCGGTCTGGA TGTAAAAGAC 150 

CTCGAACACC CGATCGAAGT CCCGGTAGGT AAAGCGACTC TGGGCCGTAT 2 00 

CATGAACGTA CTGGGTGAAC CGGTCGACAT GAAAGGCGAG ATCGGTGAAG 2 50 
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AAGAGCGTTG GGCTATTCAC 
AACTCTCAGG AACTGCTGGA 
TCCGTTCGCT AAGGGCGGTA 
GTAAAACCGT AAACATGATG 
TCCGGTTACT CTGTGTTTGC 
CGACTTCTAC CACGAAATGA 
TGGTGTATGG CCAGATGAAC 
CTGACCGGTC TGACCATGGC 
TCTGCTGTTC GTTGACAACA 
TATCCGCACT GCTGGGCCGT 
CTGGCGGAAG AGATGGGCGT 
CG 



CGCGCAGCAC CTTCCTACGA 
AACCGGTATC AAAGTTATCG 
AAGTTGGTCT GTTCGGTGGT 
GAGCTCATTC GTAACATCGC 
GGGCGTAGGT GAACGTACTC 
CCGACTCCAA CGTTATCGAC 
GAGCCGCCGG GAAACCGTCT 
TGAGAAATTC CGTGACGAAG 
TCTATCGTTA CACCCTGGCC 
ATGCCTTCAG CGGTAGGTTA 
TCTGCAGGAA CGTATCACCT 
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AGAGCTGTCA 3 00 

ACCTGATGTG 350 

GCGGGTGTAG 4 00 

GATCGAGCAC 4 50 

GTGAGGGTAA 5 00 

AAAGTATCCC 550 

GCGCGTTGCT 600 

GTCGTGACGT 6 50 

GGTACGGAAG 700 

TCAGCCGACC 750 

CCACCAAAAC 800 

802 



2) INFORMATION FOR SEQ ID NO: 3 64 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella flexneri 

(B) STRAIN: ATCC 12 022 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364 

ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA TGGTAATGAG 50 

CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA TCGTGCGTAC 100 

CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG GATGTAAAAG 150 

ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC TCTGGGCCGT 2 00 

ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG AGATCGGTGA 2 50 

AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC GAAGAGCTGT 3 00 

CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG 3 50 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 4 00 

AGGTAAAACC GTAAACATGA TGGAGCTCAT TCGTAACATC GCGATCGAGC 4 50 

ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ACAAAGTATC 550 

CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTTCTGCTGT TCGTTGACAA CATCTATCGT TACACCCTGG CCGGTACGGA 700 

AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 80 0 

ACTGGTTCTA TCACCTCCG 819 



2) INFORMATION FOR SEQ ID NO: 365 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella sonnei 

(B) STRAIN: ATCC 2 993 0 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365 

GTACCGCGCG TGTACGATGC TCTTGAGGTG CAAAATGGTA ATGAGCGTCT 50 

GGTGCTGGAA GTTCAGCAGC AGCTCGGCGG CGGTATCGTG CGTACCATCG 100 

10 CAATGGGTTC CTCCGACGGT CTGCGTCGCG GTCTGGATGT AAAAGACCTC 150 

GAACACCCGA TCGAAGTCCC GGTAGGTAAA GCGACTCTGG GCCGTATCAT 2 00 

GAACGTACTG GGTGAACCGG TCGACATGAA AGGCGAGATC GGTGAAGAAG 2 50 

AGCGTTGGGC GATTCACCGC GCAGCACCTT CCTACGAAGA GCTGTCAAAC 3 00 

TCTCAGGAAC TGCTGGAAAC CGGTATCAAA GTTATCGACC TGATGTGTCC 350 

15 GTTCGCTAAG GGCGGTAAAG TTGGTCTGTT CGGTGGTGCG GGTGTAGGTA 4 00 

AAACCGTAAA CATGATGGAG CTCATTCGTA ACATCGCGAT CGAGCACTCC 450 

GGTTACTCTG TGTTTGCGGG CGTAGGTGAA CGTACTCGTG AGGGTAACGA 500 

CTTCTACCAC GAAATGACCG ACTCCAACGT TATCGACAAA GTATCCCTGG 550 

TGTATGGCCA GATGAACGAG CCGCCGGGAA ACCGTCTGCG CGTTGCTCTG 600 

2 0 ACCGGTCTGA CCATGGCTGA GAAATTCCGT GACGAAGGTC GTGACGTTCT 650 
GCTGTTCGTT GACAACATCT ATCGTTACAC CCTGGCCGGT ACGGAAGTAT * 7 00 

CCGCACTGCT GGGCCGTATG CCTTCAGCGG TAGGTTATCA GCCGACCCTG 750 

GCGGAAGAGA TGGGCGTTCT GCAGGAACGT ATCACCTCCA CCAAAACTGG 800 

TT 802 

25 



2) INFORMATION FOR SEQ ID NO : 366 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 785 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366 

TAACGCCTTG GTTATTGATG TGCCTAAAGA AGAAGGTACA ATACAACTAA 50 

CATTAGAAGT TGCGCTGCAA TTAGGTGACG ACGTTGTTCG TACAATTGCG 100 

45 ATGGATTCAA CTGATGGTGT CCAAAGAGGC ATGGATGTAA AAGATACAGG 150 

CAAAGAAATT AGTGTACCTG TTGGTGATGA AACATTAGGT CGTGTATTTA 2 00 

ATGTACTAGG TGAAACAATT GACCTTAAAG AAGAAATTAG TGATTCTGTT 2 50 

CGCCGCGATC CTATCCATCG TCAAGCACCA GCATTCGATG AACTTTCAAC 3 00 

AGAAGTTCAA ATTTTAGAAA CAGGTATTAA AGTAGTAGAT TTACTAGCAC 350 

50 CTTATATTAA AGGTGGTAAA ATCGGATTGT TCGGTGGTGC CGGTGTAGGT 4 00 

AAAACAGTAT TAATCCAAGA ATTAATTAAC AACATCGCTC AAGAGCACGG 4 50 

TGGTATTTCT GTATTCGCCG GTGTAGGTGA ACGTACTCGT GAAGGTAACG 500 

ATTTATACTT CGAAATGAGT GATAGTGGTG TAATTAAGAA AACAGCCATG 550 

GTATTCGGGC AAATGAATGA GCCACCTGGT GCACGTATGC GTGTTGCATT 600 

55 ATCTGGTTTA ACAATGGCTG AATATTTCCG TGACGAACAA GGTCAAGACG 650 

TATTATTATT CATCGATAAC ATTTTCAGAT TTACACAAGC TGGTTCTGAG 700 

GTATCTGCAT TATTAGGTCG TATGCCTTCT GCAGTAGGTT ACCAACCAAC 750 

ACTTGCTACT GAAATGGGAC AATTACAAGA ACGTA 785 

60 
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2) INFORMATION FOR SEQ ID NO: 367 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Staphylococcus auricularis 

(B) STRAIN: ATCC 33753 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367 

GAACATAACG AAGTGCCTAA TATTAATAAC GCCTTAGTAC TCGATGTTGA 50 

AAGAGAAGAC GGAACAGTGT CTTTAACTTT AGAAGTAGCT TTACAATTAG 100 

GCGATGACGT TGTTCGTACC ATTGCAATGG ATTCAACTGA TGGTGTTAAA 150 

CGTGGTAACG AAGTCAAAGA TACTGGTAAT AGCATTAGCG TACCAGTCGG 200 

AGACGAAACT TTAGGACGTG TCTTCAACGT TCTAGGTGAA ACAATTGATT 2 50 

TAGAAGATAA ACTTGATGAT TCTGCGCGAC GTGACCCTAT ACATAGAGAA 3 00 

GCGCCAGCGT TTGATCAATT ATCAACTCAA GTTGAAATTT TAGAAACAGG 350 

AATTAAAGTT GTTGACTTAT TAGCACCTTA TATTAAAGGT GGTAAAGTTG 4 00 

GACTCTTCGG TGGTGC CGGT GTTGGTAAAA CCGTTTTAAT CCAAGAATTA 450 

ATCAACAACA TCGCTCAAGA ACACGGTGGT ATTTCAGTCT TTGCCGGTGT 500 

AGGTGAACGT ACACGTGAAG GTAACGACTT GTACTATGAA ATGAGCGACA 550 

GTGGTGTAAT CAAGAAAACA GCCATGGTCT TCGGACAAAT GAACGAACCA 600 

CCTGGCGCAC GTATGCGTGT TGCTTTATCT GGTTTAACAA TGGCTGAATA 650 

TTTCCGTGAT GAACAAGGAC AAGACGTATT GTTATTCATC GACAATATTT 700 

TCCGTTTCAC ACAAGCCGGT TCAGAAGTTT CTGCCTTACT AGGTCGTTTA 750 

CCATCAGCCG TTGGTTATCA ACCTACATTA GCAACAGAAA TGGGACAATT 800 

ACAAGAACGT ATTACTTCAA CAACAAAAGG ATCAGTTACT TCA 843 



2) INFORMATION FOR SEQ ID NO: 368 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus capitis subsp. capitis 

(B) STRAIN: ATCC 27840 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368 

GCTTTGAACA TAATGAAGTT CCTGATATTA ACAATGCCTT ACACATCGAA 50 

GTTCCTAAAG AAGATAGCAC ACTTCATTTA ACTTTAGAAG TTGCACTTCA 100 

ATTAGGTGAC GATGTAGTAC GTACAATCGC AATGGACTCA ACTGACGGCG 150 

TTCAAAGAGG TATGGAAGTT AAAGATACAG GTAAAGATAT TAGCGTACCT 2 00 

GTTGGTGATG CAACTTTAGG AAGAGTATTT AACGTATTAG GAGAAACAAT 250 

CGATTTAGAT GAAAAGATTG ATGATTCAGT ACGTCGTGAT CCTATTCATA 300 

GACAGGCACC TGGCTTCGAT GAATTATCTA CTAAAGTAGA AATCTTAGAA 3 50 
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ACAGGTATCA AAGTAGTAGA CTTATTAGCA 
AATTGGATTA TTCGGTGGTG CCGGTGTTGG 
AACTTATCAA TAATATCGCT CAAGAGCATG 
GGTGTTGGTG AACGTACACG TGAAGGTAAC 
CGATAGTGGT GTAATTAAGA AAACAGCGAT 
AGCCACCTGG TGCTCGTATG CGTGTTGCAT 
GAATATTTCC GTGATGAAGA AGG CCAAGAC 
TATCTTCAGA TTCACACAAG CTGGTTCTGA 
GTATGCCTTC AGCCGTTGGT TATCAACCAA 
CAATTACAAG AACGTATTAG TTCAACTAAT 



PCT/CA00/01150 

CCTTACATTA AAGGTGGTAA 4 00" 

TAAGACAGTT TTAATCCAAG 4 50 

GTGGTATTTC AGTATTCGCC 50 0 

GACCTTTACT ATGAAATGAG 550 

GGTATTCGGT CAGATGAACG 600 

TATCAGGTTT AACAATGGCA 650 

GTATTATTAT TCATTGATAA 700 

AGTTTCAGCA TTACTTGGAC 7 50 

CACTTGCTAC TGAAATGGGT 8 00 

AAAGGTTCTG TTACTTCAA 84 9 



2) INFORMATION FOR SEQ ID NO: 369 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus cohnii 

(B) STRAIN: DSM 20260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369 

GAAGTTCCAG AAATTAATAA TGCCTTAGTT CTCGATATAG AAAGAGAAGA 50 

AGGTACTGTT GAATTAACGT TAGAAGTTGC ATTACAACTT GGTGATGACG 100 

TAGTACGAAC AATCGCAATG GATTCAACTG ATGGTGTTAA ACGTGGTACA 150 

GAAGTTAGAG ATAGCGGAAA TAGTATCAGC GTACCAGTTG GTAATGAAAC 2 00 

ATTAGGTAGA GTATTTAATG TATTAGGTGA TACGATAGAT TTAGATGAAG 250 

ACATAGATGA CTCAGTGCGT CGTGACCCAA TTCATAGAGA AGCACCTGCA 3 00 

TTTGATCAGT TATCTACTAA AGTTGAAATT TTAGAAACAG GTATCAAAGT 3 50 

CATTGATTTA TTAGCAC CAT ATATCAAAGG TGGTAAAGTT GGATTATTCG 4 00 

GTGGTGCCGG TGTTGGTAAA ACTGTATTAA TTCAAGAATT AATCAATAAT 4 50 

ATCGCTCAAG AGCATGGTGG TATATCCGTA TTTGCTGGTG TAGGTGAGCG 500 

TACGCGTGAA GGTAATGACC TATACTTTGA AATGAGTGAT AGTGGTGTTA 550 

TTAAAAAGAC AGCTATGGTA TTTGGACAAA TGAACGAACC ACCTGGTGCG 60 0 

CGTATGCGAG TAGCACTTTC TGGTTTAACA ATGGCTGAAT ATTTCCGGGA 650 

TGAACAAGGA CAAGATGTTC TATTATTCAT AGATAACATC TTTAGATTTA 700 

CTCAAGCTGG TTCAGAAGTT TCTGCGTTAT TAGGTCGTAT GCCTTCAGCT 750 

GTTGGTTACC AACCAACGTT AGCAACTGAA ATGGGACAAT TACAAGAACG 800 

TATTACTTCT ACAACTAAAG GTTCAGTAAC 83 0 



2) INFORMATION FOR SEQ ID NO: 370 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 787 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Staphylococcus epidermidTs 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370 

5 

AATAATGCAT TACACATCGA AGTTCCTAAA GAAGATGGAG CGCTTCAATT 50 

AACATTAGAA GTTGCACTTC AACTAGGTGA CGATGTAGTT CGTACAATTG 100 

CAATGGACTC AACTGACGGC GTTCAAAGAG GAATGGAAGT TAAAGATACA 150 

GGTAGAGACA TAAGTGTACC TGTCGGTGAC GTAACTCTAG GAAGAGTGTT 2 00 

10 TAACGTACTA GGAGAAACTA TTGACTTAGA TGAAAAAATT GATGATTCAG 2 50 

TACGACGTGA CCCTATCCAT AGACAAGCTC CAGGATTCGA CGAATTATCA 3 00 

ACAAAAGTAG AAATCTTAGA AACTGGTATT AAAGTAGTAG ACTTATTAGC 3 50 

ACCTTACATA AAAGGTGGTA AAATTGGATT ATTTGGTGGT GCCGGTGTAG 4 00 

GTAAAACCGT ACTAATCCAA GAACTTATTA ATAACATCGC TCAAGAACAC 4 50 

15 GGTGGTATCT CAGTATTCGC TGGTGTTGGT GAACGTACAC GTGAAGGTAA 500 

TGATCTTTAC TATGAAATGA GTGACAGTGG TGTTATCAAG AAAACTGCAA 550 

TGGTCTTTGG TCAAATGAAT GAGCCACCTG GTGCACGTAT GCGTGTAGCA 600 

TTATCCGGAT TAACAATGGC CGAATATTTC CGAGATGAAG AAGGCCAAGA 650 

TGTGTTATTA TTCATTGATA ACATTTTCAG ATTCACTCAA GCTGGTTCAG 7 00 

2 0 AAGTTTCTGC GTTATTAGGT CGTATGCCAT CAGCTGTTGG TTATCAACCT 750 

ACACTTGCTA CAGAAATGGG TCAATTACAA GAACGTA 7 87 



25 2) INFORMATION FOR SEQ ID NO: 371 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371 
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GAAGTACCTG AAATTAATAA CGCCTTAATC ATCGAAGTTC CCAAAGAAGA 50 

TGGTACTTTT GAATTAACGC TTGAAGTTGC ATTACAACTA GGTGATGACG 100 

TTGTTCGTAC AATTGCTATG GATT CAAC AG ATGGTGTTCA ACGTGGTATG 150 

GAAGTTCAGA ACACTGGAAA AGACATTTCA GTACCAGTTG GCGAAGTAAC 200 

45 TTTAGGACGT GTATTTAACG TATTAGGTGA CACAATTGAT TTAGAAGATA 2 50 

AATTAGATGG TTCAGTAAGA CGTGATCCAA TTCATAGACA ATCACCTAAC 3 00 

TTTGACGAAT TATCTACTGA AGTAGAAATT CTTGAAACTG GAATCAAAGT 3 50 

TGTAGACTTA TTAGCACCAT ACATCAAAGG TGGTAAAATC GGTCTATTTG 4 00 

GTGGTGCCGG TGTTGGTAAA ACCGTTTTAA TCCAAGAATT GATTAATAAT 4 50 

50 ATCGCACAAG AACATGGTGG TATCTCAGTA TTTGCTGGTG TAGGTGAACG 500 

TACACGTGAA GGTAACGACC TATATTATGA AATGAGAGAT AGTGGTGTTA 550 

TTAAGAAAAC AGCAATGGTA TTTGGTCAAA TGAACGAGCC ACCTGGTGCA 600 

CGTATGCGTG TGGCACTTTC TGCATTGACA ATGGCTGAGT ATTTCCGTGA 650 

TGAACAAGGA CAAGACGTTC TGTTATTCAT CGATAACATT TTCAGATTTA 7 00 

55 CTCAAGCAGG TTCAGAAGTA TCAGCATTAT TGGGACGTAT GCCTTCAGCT 750 

GTAGGTTATC AACCTACTTT AGCTACAGAA ATGGGTCAAT TACAAGAACG 800 

TATTACATCA ACGAATAAAG GTTCAGTAAC 83 0 

60 
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PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



10 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis subsp. hominis 

(B) STRAIN: ATCC 27844 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372 

TCGAACATAA TGAAGTCCCT GAAATTAATA ATGCCCTAAT TATTGAAGTA 50 

CCCAAAAATG ATGGCACATT TAAATTAACA TTAGAAGTTG CATTGCAACT 100 

AGGTGATGAT GTTGTTCGTA CTATTGCAAT GGATTCAACT GATGGTGTTC 150 

2 0 AACGTGGTAT GCAAGTTGTG AATACTGGTA AAGATATTAG TGTTCCTGTA 2 00 

GGTGAAGAAA CACTTGGACG TGTGTTTAAC GTTTTAGGAG AAACAATAGA 2 50 

TTTAAACGAA AAAATAGATA GTTCTGTTAG ACGTGATCCA ATTCATCGTC 300 

GTCAACCTAA TTTTGATGAA TTATCTACTG AAGTAGAAAT TCTTGAAACA 3 50 

GGTATTAAAG TTGTAGACTT ATTAGCACCT TATATTAAAG GTGGTAAGAT 4 00 

25 TGGTTTATTC GGTGGTGCCG GCGTAGGTAA AACTGTATTA ATTCAAGAAT 4 50 

TAATCAATAA TATCGCTCAA GAACATGGTG GTATTTCTGT ATTCGCTGGT 50 0 

GTAGGTGAAC GTACTCGTGA AGGTAACGAT TTATACTATG AAATGAGCGA 550 

TAGTGGCGTT ATCAATAAAA CAGCCATGGT ATTTGGGCAA ATGAATGAGC 600 

CGCCAGGTGC GCGTATGCGT GTTGCTTTAT CAGCATTGAC AATGGCTGAA 650 

3 0 TATTTCCGTG ATGAACAAGG TCAAGATGTA CTTTTATTCA TTGACAATAT 700 

TTTCCGCTTT ACTCAAGCTG GTTCTGAAGT TTCAGCATTA TTAGGACGTA 750 
TGCCTTCAGC TGTAGGTTAT CAACCTACAT TAGCAACTGA AATGGGTCAA 80 0 

TTACAAGAAC GTATTACATC TACTAATAAA GGTTCAGTCA CTTCAA 846 

35 

2) INFORMATION FOR SEQ ID NO: 373 

(i) SEQUENCE CHARACTERISTICS: 

4 0 (A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus hominis 

(B) STRAIN: CSG 175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373 
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TCGAACATAA TGAAGTCCCT GAAATTAATA ATGCCCTAAT TATTGAAGTA 50 

CCCAAAAATG ATGGCACATT CAAATTAACA TTAGAAGTTG CATTGCAACT 100 

55 AGGTGATGAT GTTGTTCGTA CTATTGCAAT GGATTCAACT GATGGTGTTC 150 

AACGTGGTAT GCAAGTTGTG AATACTGGTA AAGATATTAG TGTTCCTGTA 2 00 

GGTGAAGAAA CACTTGGACG TGTGTTTAAC GTTTTAGGAG AAACAATAGA 250 

TTTAAACGAA AAAATAGATA GTTCTGTTAG ACGTGATCCA ATTCATCGTC 300 

GTCAACCTAA TTTTGATGAA TTATCTACTG AAGTAGAAAT TCTTGAAACA 350 

60 GGTATTAAAG TTGTAGACTT ATTAGCACCT TATATTAAAG GTGGTAAGAT 4 00 
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TGGTTTATTC GGTGGTGCCG 
TAATCAATAA TATCGCTCAA 
GTAGGTGAAC GTACTCGTGA 
TAGTGGCGTT ATCAATAAAA 
CGCCAGGTGC GCGTATGCGT 
TATTTCCGTG ATGAACAAGG 
TTTCCGCTTT ACTCAAGCTG 
TGCCTTCAGC TGTAGGTTAT 
TTACAAGAAC GTATTACATC 



GCGTAGGTAA 
GAACATGGTG 
AGGTAACGAT 
CAGCCATGGT 
GTTGCTTTAT 
TCAAGATGTA 
GTTCTGAAGT 
CAACCTACAT 
TACTAATAAA 



AACTGTATTA ATTCAAGAAT 
GTATTTCTGT ATTCGCTGGT 
TTATACTATG AAATGAGCGA 
ATTTGGGCAA ATGAATGAGC 
CAGCATTGAC AATGGCTGAA 
CTTTTATTCA TTGACAATAT 
TTCAGCATTA TTAGGACGTA 
TAGCAACTGA AATGGGTCAA 
GGTTCAGTCA CTTCAA 



450 
500 
550 
600 
650 
700 
750 
800 
846 



2) INFORMATION FOR SEQ ID NO: 374 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus lugdunensis 

(B) STRAIN: ATCC 43 80 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374 

ATAATGAAGT GCCTGAAATA AATAATGCGC TCATTGTTGA AATTCCTAAA 50 

AGTGATACAA CAATCAGTTT AACACTTGAA GTTGCTTTGC AATTAGGTGA 100 

CGATGTTGTA CGTACTATTG CAATGGATTC AACTGATGGC GTTCAACGTG 150 

GTATGGAAGT TCAAAACACA GGTAAAGACA TCAGTGTACC TGTTGGAGAT 2 00 

GAAACATTAG GAAGAGTATT TAACGTTTTA GGAGAATCTA TTGATTTAGA 2 50 

AGAAAAGCTA GATGACTCTG TGCGTAGAGA TCCAATTCAT AGACTAGCAC 3 00 

CTAAATTTGA TGAATTATCT ACAGAAGTAG AAATTCTTGA AACTGGTATT 3 50 

AAAGTTGTTG ATTTATTAGC ACCATATATT AAAGGTGGTA AAGTTGGATT 4 00 

GTTTGGTGGT GCCGGAGTAG GTAAAACGGT ATTAATTCAA GAATTAATCA 4 50 

ACAATATTGC TCAAGAACAT GGTGGTATTT CTGTGTTTGC CGGAGTAGGT 500 

GAACGTACAC GTGAAGGTAA TGACTTATAT TATGAAATGA GCGATAGTGG 550 

CGTAATTAAG AAAACAGCGA TGGTATTTGG CCAAATGAAT GAACCACCTG 600 

GTGCACGTAT GAGAGTTGCG TTATCTGCCT TAACAATGGC TGAATATTTC 650 

CGTGACGAGC AAGGACAAGA CGTATTGCTG TTTATCGATA ATATATTCCG 700 

TTTTACACAA GCAGGTTCAG AAGTATCTGC ATTACTTGGA CGTATGCCAT 750 

CTGC CGTTGG TTATCAACCA ACATTGGCTA CAGAAATGGG ACAATTGCAA 800 

GAAAGAATTA CATCTACAAA TAAAGGTTCT GTAAC 835 



2) INFORMATION FOR SEQ ID NO : 3 75 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 
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STRAIN: ATCC 153 05 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375 



5 GAGCACAATG 
AAGAGATGAA 
GCGATGATGT 
CGTGGTACAG 
TGATGCTACG 

10 TAGACGAGAA 
GCACCTGCAT 
TATTAAAGTA 
GTTTATTCGG 
ATTAATAATA 

15 AGGTGAACGT 
GTGGTGTTAT 
CCTGGTGCGC 
CTTCCGTGAT 
TCAGATTTAC 

20 CCATCAGCCG 
ACAAGAACGT 



AAGTTCCAGA 
GGTACAGTAT 
CGTACGTACA 
AAGTTCGAGA 
TTAGGACGTG 
GCTTGATACT 
TCGATCAATT 
ATTGATTTAC 
TGGCGCTGGT 
TAGCTCAAGA 
ACGCGTGAAG 
TAAGAAAACA 
GTATGCGTGT 
GTACAAGGAC 
GCAAGCTGGT 
TTGGTTATCA 
ATTACATCAA 



AATTAACAAT 
CTCTTACATT 
ATTGCAATGG 
TAGCGGAGAT 
TGTTTAATGT 
TCTGTCAAAC 
ATCAACAAAA 
TTGCACCATA 
GTAGGTAAAA 
ACATGGTGGT 
GTAATGACTT 
GCTATGGTCT 
TGCTTTATCA 
AAGATGTTTT 
TCAGAAGTAT 
ACCTACCCTT 
CAACTAAAGG 



GCCTTAGTCG 
AGAAGTGGCA 
ATTCTACTGA 
AGCATCAGTG 
TCTTGGTGAT 
GTGATCCAAT 
GTTGAAATCT 
TATTAAAGGT 
CAGTATTAAT 
ATTTCAGTAT 
ATACTACGAA 
TCGGACAAAT 
GGCTTAACAA 
ACTATTTATT 
CAGCACTATT 
GCTACTGAAA 
ATCTGTAACG 



TAGACGTTGA 50 

TTACAACTTG 100 

TGGTGTTAAA 150 

TTCCAGTTGG 2 00 

ACAATTGACT 2 50 

TCATAGAGAA 3 00 

TAGAAACAGG 3 50 

GGTAAAATCG 4 00 

TCAAGAATTA 450 

TTGCCGGCGT 500 

ATGAGTGATA 550 

GAATGAGCCA 600 

TGGCTGAACA 650 

GATAACATAT 7 00 

AGGTCGTATG 750 

TGGGTCAATT 800 

TC 842 



2 5 2) INFORMATION FOR SEQ ID NO : 3 76 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 842 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus simulans 

(B) STRAIN: ATCC 27848 



40 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376 



TGATGAACTG CCTAAGATTA ATAACGCATT AGTGCTAGAT GTACCTAAGA 50 

AAGATGGCAC GACTGAATCT CTTACATTAG AAGTAGCACT TGAATTAGGC 10 0 

GACGACGTAG TTAGAACTAT CGCCATGGAC TCTACAGACG GAATTAAACG 150 

TGGTGACGAC GTTAAAGACA CTGGTCGTCC AATCAGTGTA CCTGTCGGTG 2 00 

45 AAGATACGTT AGGAAGAGTA TTTAACGTTT TAGGTGATCC AATCGATAAT 250 

GATGGACCGA TTTCTGAATC AGTTCCACGT GAACCAATTC ATAGACAACC 3 00 

ACCTAAATTT GATGAATTAT CAACAAAAGT TGAACTACTT GAAACTGGTA 3 50 

TCAAAGTAGT AGACTTATTA GCACCATATA TCAAAGGTGG TAAAGTTGGT 40 0 

TTATTCGGTG GTGCCGGAGT AGGTAAAACT GTATTAATCC AAGAATTAAT 4 50 

50 TAATAACATC GCTCAAGAAC ACGGCGGTAT TTCAGTATTC GCAGGTGTTG 50 0 

GTGAACGTAC ACGTGAAGGT AACGACTTGT ACTTCGAAAT GAGCGACAGT 550 

GGTGTTATCA AGAAAACAGC GATGGTATTC GGACAAATGA ACGAACCACC 600 

TGGTGCACGT ATGCGTGTAG CTTTATCAGG TTTAACAATG GCTGAATACT 650 

TCCGTGATGT TAAAGGACAA GACGTTCTTT TATTCATCGA TAACATTTTC 7 00 

55 CGCTTCACAC AAGCAGGTTC TGAGGTATCA GCATTGCTTG GCCGTATGCC 750 

ATCAGCCGTT GGTTACCAAC CAACATTGGC AACAGAAATG GGTCAATTAC 800 

AAGAACGTAT CACTTCTACA ATGAAAGGTT CTATCACATC TA 842 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warneri 

(B) STRAIN: ATCC 27836 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377 

CATAACGAAG TCCCTGATAT TAATAATGCC CTTATTATTG AAGTTCCAAA 50 

AGAAGATGGA ACGTTAAACT TAACATTAGA AGTTGCACTA CAATTAGGTG 10 0 

ATGATGTTGT ACGTACAATT GCAATGGATT CAACTGATGG TGTTCAAAGA 150 

20 GGCATGGATG TTAAAGACAC AGGTAAAGAT ATTAGTGTAC CTGTAGGCGA 2 00 

TGAAACGCTT GGAAGAGTGT TTAATGTACT AGGTGAAACA ATTGACTTGG 2 50 

AAGAGAAAAT TGATGATTCC GTACGTCGTG ATCCAATCCA TAGACAATCA 3 00 

CCAGGTTTCG ATGAATTATC TACTGAAGTA GAAATCTTAG AAACAGGTAT 350 

TAAAGTAGTA GACTTATTAG CACCTTACAT TAAAGGTGGT AAAGTTGGAC 400 

25 TATTCGGTGG TGCCGGAGTA GGTAAAACCG TTTTAATCCA AGAATTAATT 4 50 

AACAATATTG CACAAGAACA TGGTGGTATT TCAGTATTCG CGGGTGTAGG 500 

TGAACGTACT CGTGAAGGTA ATGATTTATA CTATGAAATG AGTGATAGTG 550 

GTGTAATTAA GAAAACAGCG ATGGTATTTG GACAAATGAA TGAACCACCT 600 

GGCGCACGTA TGCGTGTAGC TTTATCTGGT TTAACTATGG CTGAATACTT 650 

3 0 CCGTGATGAA CAAGGACAAG ACGTACTTTT ATTCATCGAT AATATTTTCA 70 0 

GATTTACACA AGCTGGTTCT GAAGTTTCTG CATTACTTGG TCGTATGCCT 75 0 

TCAGCCGTTG GTTACCAACC AACATTAGCA ACTGAAATGG GTCAATTACA 800 

AGAACGAATT ACATCTACAA ATAAAGGTTC TGTAACATCT A 841 



35 



2) INFORMATION FOR SEQ ID NO: 378 

(i) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 (ii) MOLECULE TYPE: Genomic DNA 



50 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus acidominimus 

(B) STRAIN: ATCC 51726 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378 



TTTAACACGA ATGAACCGCT TCCTGAGATA AATAATGCAC TTGTTGTTTA 50 

CAAAGACAGT GAGAAAAAAC ATAAAATCGT TCTTGAAGTA GCTCTTGAAC 100 

55 TTGGTGAAGG CCTCGTTCGT ACCATTGCTA TGGAATCAAC TGATGGTTTG 150 

ACACGTGGTC TAGAAGTTCT TGATACAGGC CGTGCAATCA GTGTACCAGT 2 00 

TGGTAAAGAA ACGCTTGGAC GTGTCTTCAA CGTTCTTGGT GATGCTATCG 2 50 

ATCTTGAAGA ACCATTTGGA GAAGATGCAG AACGTCACCC CATTCATAAG 3 00 

AGTGCCCCAA CTTTTGATGA ATTATCAACG TCAACAGAAA TCCTTGAAAC 3 50 

60 AGGGATTAAA GTTATCGACC TACTTGCCCC TTACTTAAAA GGAGGGAAGG 4 00 
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TTGGACTTTT CGGTGGTGCC 
TTGATTCATA ACATTGCTCA 
AGTTGGTGAA CGTACACGTG 
AATCAGGCGT TATTGAAAAA 
CCACCTGGTG CACGTATGCG 
ATATTTCCGT GATGTTGAAG 
TTTTTCGTTT CACACAAGCA 
ATGCCATCAG CCGTTGGTTA 
ATTGCAAGAA CGTATCACGT 



GGAGTTGGTA 
AGAGCATGGT 
AAGGTAATGA 
ACAGCTATGG 
TGTAGCCCTT 
GACAGGACGT 
GGTTCTGAAG 
TCAACCAACC 
CAACTAAAAA 



AGACCGTTCT TATCCAAGAG 
GGTATTTCAG TATTTACCGG 
CCTCTATTGG GAAATGAAAG 
TATTTGGTCA GATGAATGAG 
ACTGGTTTGA CAATCGCTGA 
GCTTCTCTTT ATTGATAACA 
TTTCAGCTCT TCTTGGACGT 
TTGGCAACTG AAATGGGTCA 
AGGTTCTGTT ACATCA 



45T) 
500 
550 
600 
650 
700 
750 
800 
846 



2) INFORMATION FOR SEQ ID NO: 379 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactia^ 

(B) STRAIN: ATCC 12403 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 379 

TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTA CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 2 00 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 2 50 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 3 00 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACA 3 50 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 4 00 

TGGACTTTTC GGTGGTGCAG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 4 50 

TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 6 00 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CCTCAA 846 



2) INFORMATION FOR SEQ ID NO: 380 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 846 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 
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(B) STRAIN: ATCC 13813 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380 

5 TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 200 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

10 TCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACT 3 50 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 400 

TGGACTTTTC GGTGGTGCGG GTGTTGGTAA AACCGTTCTT ATT CAAGAAT 450 

TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

15 GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

2 0 TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 80 0 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CCTCAA 846 



25 2) INFORMATION FOR SEQ ID NO: 3 81 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agralactiae 

(B) STRAIN: ATCC 12 973 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381 



TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GTGCAATTAG TGTGCCGGTT 200 

45 GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 3 00 

AAGCACCATC ATTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACA 350 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 4 00 

TGGACTTTTC GGTGGTGCGG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 45 0 

50 TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 600 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

55 CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 800 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CCTCA 845 



60 
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2) INFORMATION FOR SEQ ID NO: 382 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 27591 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382 

TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 5 0 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

2 0 CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 2 00 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 250 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 3 00 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACT 3 50 

GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 4 00 

2 5 TGGACTTTTC GGTGGTGCAG GTGTTGGTAA AACCGTTCTT ATTCAAGAAT 4 50 

TAATCCACAA CATCGCCCAA GAACATGGTG GTATTTCAGT ATTTACTGGT 500 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC CTTTATTGGG AAATGAAAGA 550 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT CTTTGGTCAA ATGAATGAAC 60 0 

CACCAGGAGC ACGTATGCGT GTGGCACTTA CTGGTCTTAC AATAGCTGAG 650 

3 0 TACTTCCGTG ATGTAGAAGG ACAAGATGTG CTTCTCTTCA TTGATAATAT 700 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT GTCAGCGCTT TTAGGTCGTA 750 
TGCCTTCAGC CGTTGGTTAT CAACCAACAC TTGCTACAGA AATGGGACAA 8 00 

TTACAAGAGC GTATCACTTC AACTAAAAAA GGTTCTGTTA CATCA 84 5 



35 
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2) INFORMATION FOR SEQ ID NO: 3 83 



( i ) SEQUENCE CHARACTERI STICS : 
4 0 (A) LENGTH: 84 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

4 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: CDC SS1073 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383 



TTGCAAGTGG CGACAAACTT CCTGAGATTA ATAATGCATT GATTGTCTAT 50 

AAAAATGGCG ATAAGTCACA AAAAGTAGTA CTTGAAGTTG CTCTTGAACT 100 

55 TGGTGACGGC CTCGTTCGTA CAATCGCTAT GGAATCAACT GATGGGCTTA 150 

CACGTGGTTT GGAAGTATTA GATACTGGTC GCGCAATTAG TGTGCCGGTT 2 00 

GGTAAGGATA CTTTGGGTCG TGTCTTCAAC GTTCTTGGAG ATGCTATTGA 2 50 

CCTTGAAGAG CCTTTTGCAG AAGATGCAGA ACGTCAACCA ATCCATAAAA 3 00 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT CTTAGAAACT 3 50 

60 GGTATTAAAG TTATTGACTT ATTAGCACCT TACTTAAAAG GTGGTAAAGT 4 00 
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TGGACTTTTC GGTGGTGCAG GTGTTGGTAA 

TAATCCACAA CATCGCCCAA GAACATGGTG 

GTAGGAGAAC GTACTCGTGA AGGGAATGAC 

ATCTGGCGTT ATTGAAAAAA CGGCTATGGT 

5 CACCAGGAGC ACGTATGCGT GTGGCACTTA 

TACTTCCGTG ATGTAGAAGG ACAAGATGTG 

CTTCCGTTTC ACACAAGCTG GGTCAGAAGT 

TGCCTTCAGC CGTTGGTTAT CAACCAACAC 

TTACAAGAGC GTATCACTTC AACTAAAAAA 



PCT/CA00/01150 

AACCGTTCTT ATTCAAGAA^ 450 

GTATTTCAGT ATTTACTGGT 500 

CTTTATTGGG AAATGAAAGA 550 

CTTTGGTCAA ATGAATGAAC 600 

CTGGTCTTAC AATAGCTGAG 650 

CTTCTCTTCA TTGATAATAT 700 

GTCAGCGCTT TTAGGTCGTA 750 

TTGCTACAGA AATGGGACAA 800 

GGTTCTGTTA CATCA 845 



2) INFORMATION FOR SEQ ID NO: 384 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus dysgalactiae 
25 (B) STRAIN: ATCC 43078 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384 

TTGCTAGTGG GGACAAACTT CCAGAGATTA ATAATGCATT GATTGTTTAT 50 

AAAGATAGTG ATAAAAAGCA AAAAATCGTC CTTGAAGTTG CTCTGGAACT 100 

TGGTGACGGT ATGGTGCGAA CAATCGCTAT GGAATCAACT GATGGGCTTA 15 0 

CACGTGGGTT AGAAGTTCTT GACACTGGTC GTGCGATTAG TGTACCAGTA 200 

GGTAAAGAAA CTTTGGGACG CGTCTTTAAT GTACTTGGAG AAACCATTGA 2 50 

CTTGGAAGAA CCATTTGCAG AAGACGTTGA CCGTCAGCCA ATCCATAAAA 300 

AAGCACCATC GTTTGATGAA TTATCAACAT CATCAGAAAT TCTTGAAACT 350 

GGTATCAAGG TAATTGACCT TCTTGCCCCT TACCTTAAAG GTGGTAAAGT 4 00 

TGGACTTTTC GGGGGTGCCG GAGTTGGTAA GACTGTCCTT ATCCAAGAAT 4 50 

TAATCCACAA TATCGCCCAA GAACACGGAG GTATTTCAGT ATTTACCGGT 500 

GTTGGTGAGC GAACACGTGA AGGAAATGAC CTTTACTGGG AAATGAAAGA 550 

ATCAGGCGTT ATTGAGAAAA CTGCCATGGT TTTTGGTCAG ATGAATGAGC 600 

CGCCTGGGGC ACGTATGCGT GTAGCCCTTA CTGGTTTAAC CATTGCTGAG 650 

TATTTCCGTG ATGTAGAAGG CCAAGATGTT TTGCTCTTTA TTGATAATAT 70 0 

CTTCCGTTTC ACTCAGGCAG GTTCAGAAGT ATCAGCCCTC TTAGGCCGTA 750 

TGCCTTCTGC TGTTGGTTAC CAACCGACCC TTGCTACTGA AATGGGACAA 800 

TTGCAAGAAC GTATTACGTC AACTCAAAAA GGATCTGTTA CTTCT 845 



50 



2) INFORMATION FOR SEQ ID NO: 385 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Streptococcus equi subsp. egui 
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(B) STRAIN: ATCC 9528 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385 

5 TTGCGAGTGG GGACAAACTA CCAGAGATTA ATAATGCGTT GATAGTTTAT 50 

AAAGATGGCG ATAAAAAGCA AAAAATCGTT CTCGAGGTTG CCCTAGAGCT 100 

TGGAGACGGT ATGGTACGTA CAATTGCTAT GGAATCAACC GATGGGCTTA 150 

CACGTGGATT AGAGGTTCTT GATACTGGTC GTGCCATTAG TGTACCAGTT 2 00 

GGTAAAGAGA CTCTAGGTCG TGTTTTCAAC GTTCTTGGTG AAACCATCGA 2 50 

10 CCTAGAAGCA CCATTTGCAG ATGATGTTAA TCGTGAACCG ATCCATAAAA 3 00 

AAGCACCAGC CTTTGATGAA TTGTCAACAT CATCAGAAAT TCTTGAAACA 3 50 

GGTATCAAGG TTATTGACCT GCTTGCCCCT TACTTAAAGG GTGGTAAGGT 4 00 

CGGTCTTTTC GGTGGTGCCG GAGTTGGTAA AACCGTTCTT ATCCAAGAAT 4 50 

TAATCCACAA TATCGCTCAA GAGCATGGTG GGATCTCGGT ATTTACCGGT 50 0 

15 GTTGGTGAGC GTACGCGTGA AGGAAATGAC CTTTACTGGG AAATGAAGGA 550 

ATCAGGCGTT ATTGAAAAAA CAGCCATGGT TTTTGGTCAG ATGAATGAAC 6 00 

CACCAGGAGC CCGTATGCGT GTTGCCTTGA CCGGCTTGAC AATTGCTGAA 650 

TATTTCCGCG ATGTTGAAGG CCAAGACGTC CTGCTCTTCA TTGACAATAT 70 0 

TTTCCGCTTT ACTCAAGCAG GCTCAGAGGT ATCAGCCCTT CTAGGTCGTA 750 

2 0 TGCCTTCAGC CGTTGGTTAC CAGCCAACAC TTGCCACTGA AATGGGACAA 8 00 

TTGCAAGAGC GTATCACCTC AACGAAAAAA GGCTCTGTTA CCTCTA 84 6 



25 2) INFORMATION FOR SEQ ID NO: 386 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 84 6 bases 
<B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus anginosus 

(B) STRAIN: ATCC 27335 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386 



TTGCAGCTGG TGATAAATTA CCTGAGATAA ATAATGCACT TGTAGTCTAT 50 

AAAAATGACG AAAATAAATC AAAAATCGTC CTTGAAGTAG CTCTTGAGCT 10 0 

TGGTGATGGA GTGGTTCGAA CTATTGCCAT GGAATCCACT GATGGGTTGA 150 

CTCGTGGCAT GGAAGTTCTA GATACTGGTC GACCAATTTC TGTTCCAGTT 200 

45 GGGAAAGAAA CACTTGGTCG CGTCTTTAAC GTTTTAGGCG ATACCATTGA 25 0 

TTTGGATACT CCATTCGGCG AAGATGCAGA ACGTCAACCA ATCCATAAAA 300 

AAGCTCCAAC TTTTGATGAG TTATCTACTT CATCAGAAAT CTTAGAAACA 3 50 

GGAATAAAGG TTATTGACCT TTTAGCCCCC TACCTCAAAG GTGGGAAAGT 4 00 

CGGCCTCTTC GGTGGTGCTG GCGTTGGGAA AACTGTCTTG ATTCAAGAGT 450 

50 TGATTCATAA TATCGCCCAA GAACACGGCG GGATTTCAGT CTTTACTGGT 500 

GTTGGGGAAC GAACTCGTGA AGGGAATGAC CTGTACTGGG AAATGAAAGA 550 

ATCTGGTGTT ATCGAAAAGA CGGCTATGGT CTTTGGGCAA ATGAATGAAC 600 

CGCCTGGAGC ACGTATGCGT GTAGCTTTGA CTGGGTTAAC GATTGCAGAG 650 

TATTTCCGTG ATGTGGAAGG TCAAGATGTT CTTTTGTTTA TTGATAATAT 700 

55 TTTCCGTTTC ACTCAAGCTG GTTCTGAAGT GTCAGCCCTT CTTGGTCGTA 750 

TGCCATCAGC TGTTGGTTAC CAACCAACCT TGGCTACTGA AATGGGGCAA 800 

TTACAAGAAC GTATTACATC AACGAAAAAA GGTTCTGTTA CCTCAA 846 



60 
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PCT/CA00/01150 



10 



15 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus salivarius 

(B) STRAIN: ATCC 7073 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387 



GCAGCTGGTG 
TGATGAACAA 
GAGAAGGTGT 

2 0 CGTGGACTAG 

TAAAGATACC 
TGGAAGCACC 
GCACCAACTT 
GATTAAAGTT 
25 GACTCTTCGG 
ATTCACAACA 
TGGTGAACGT 
CTGGCGTTAT 
CCTGGAGCAC 

3 0 CTTCCGTGAT 

TCCGTTTCAC 
CCATCAGCCG 
GCAAGAACGT 



ATAAACTTCC 
AAGTCTAAAC 
GGTTCGTACC 
AAGTTCTGGA 
CTTGGACGTG 
TTTTGCAGAC 
TCGATGAATT 
ATCGACTTGC 
TGGTGCCGGT 
TTGCCCAAGA 
ACACGTGAAG 
CGAGAAAACA 
GTATGCGTGT 
GTCGAGGGTC 
TCAAGCAGGT 
TTGGTTACCA 
ATCACATCAA 



TGAGATTAAC 
GTATCGTGCT 
ATTGCCATGG 
CACTGGTCGT 
TCTTTAACGT 
GATGCAGAGC 
GTCAACATCT 
TAGCCCCTTA 
GTTGGTAAAA 
GCACGGTGGT 
GTAATGACCT 
GCCATGGTCT 
TGCCCTTACT 
AAGACGTTCT 
TCTGAGGTTT 
ACCTACACTT 
CTAAAAAAGG 



AATGCATTGG 
CGAAGTAGCT 
AATCTACTGA 
CCAATCAGCG 
TCTTGGTGAT 
GTGAACCAAT 
ACTGAAATCC 
CCTTAAGGGT 
CCGTTCTTAT 
ATTTCCGTGT 
TTACTGGGAA 
TCGGTCAAAT 
GGTTTGACAA 
TCTCTTCATC 
CTGCCCTTCT 
GCTACTGAAA 
TTCTGTTACA 



TCGTTTATAC 50 

CTTGAACTTG 100 

TGGATTGACT 150 

TTCCTGTTGG 2 00 

ACCATTGACT 2 50 

TCACAAAAAA 3 00 

TTGAAACAGG 3 50 

GGTAAAGTCG 4 00 

TCAAGAGTTG 4 50 

TTACAGGTGT 500 

ATGAAAGAAT 550 

GAACGAACCA 60 0 

TTGCGGAATA 650 

GATAACATCT 700 

TGGTCGTATG 750 

TGGGTCAATT 800 

TCT 843 



35 



2) INFORMATION FOR SEQ ID NO: 388 

(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



50 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus suis 

(B) STRAIN: ATCC. 43765 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388 



TTGCAGCAGA 
AAAAATGATG 

55 TGGTGATGGC 
CACGTGGGAT 
GGTAAAGAAA 
CCTTGAAGAG 
AAGCGCCGGC 

60 GGGATTAAGG 



AGATAAACTT 
ATTCCAAACA 
GTTGTACGGA 
GGAAGTTCTC 
CGCTGGGTCG 
TCTTTTCCGG 
TTTTGACGAA 
TTATCGACCT 



CCTGAGATTA ACAACGCACT 
AAAAGTCGTG CTTGAAGTGG 
CCATTGCCAT GGAATCAACG 
GATACAGGTC GTCCCATCTC 
TGTCTTCAAT GTGTTGGGAG 
CAGATTTTGA ACGTGAGCCT 
TTATCTACTT CAAGCGAAAT 
CCTAGCACCT TATCTAAAAG 
232 



CGTTGTATAT 50 

CTTTGGAACT 100 

GATGGATTGA 150 

TGTTCCAGTC 2 00 

ATACCATTGA 250 

ATCCATAAGA 3 00 

TTTGGAAACA 3 50 

GTGGTAAGGT 4 00 
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TGGTCTCTTC GGTGGTGCTG GTGTTGGTAA 

TGATTCACAA TATTGCCCAA GAACACGGTG 

GTTGGCGAGC GTACCCGTGA AGGGAACGAT 

ATCAGGTGTT ATTGAAAAAA CGGCCATGGT 

5 CACCAGGAGC CCGTATGCGT GTTGCTCTTA 

TACTTCCGTG ATGTGGAAGG GCAGGATGTT 

CTTCCGTTTC ACACAGGCTG GTTCAGAAGT 

TGCCATCAGC CGTTGGTTAT CAGCCAACAC 

TTGCAGGAGC GTATTACCTC AACCAAGAAG 



PCT/CAOO/01150 

AACCGTTCTT ATCCAAGAAT 4 50 

GTATCTCTGT ATTTACCGGA 50 0 

CTTTACTGGG AAATGAAAGA 550 

ATTTGGTCAG ATGAATGAGC 600 

CTGGTTTGAC TATTGCGGAA 650 

CTTCTGTTCA TCGATAATAT 700 

GTCTGCCCTC TTGGGTCGTA 750 

TTGCGACGGA GATGGGACAA 800 

GGTTCTGTTA C 841 



2) INFORMATION FOR SEQ ID NO: 389 

15 ( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus uberis 
25 (B) STRAIN: ATCC 19436 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389 

GCAAACGGTG AAAAATTACC AGAGATTAAT AATGCATTGA TAGTTTATAA 50 

3 0 AGGTAGCGAT AAAAAACAAA AGATTGTTCT TGAAGTTGCT TTGGAACTTG 10 0 

GGGACGGAAT GGTTCGTACA ATCGCTATGG AATCAACTGA TGGGCTTACA 15 0 

CGTGGATTAG AAGTTTTAGA TACTGGCCGT GCCATTAGTG TACCAGTCGG 200 

AAAAGAAACT TTGGGTCGTG TTTTCAATGT GCTTGGTGAA ACCATTGATT 250 

TGGATGAAC C ATTTGCCGCT GATGCTGCAA GAGAACCCAT CCATAAAAAA 3 00 

3 5 GCCCCAGCAT TTGATGAACT ATCAACGTCT TCAGAAATTC TTGAAACCGG 350 

AATAAAAGTT ATTGACTTAT TAGCCCCTTA TCTCAAAGGT GGTAAAGTTG 4 00 

GTTTATTTGG TGGTGCCGGA GTAGGTAAAA CGGTTTTAAT TCAAGAATTA 4 50 

ATTCATAATA TTGCACAAGA ACATGGTGGT ATTTCAGTAT TTACCGGTGT 500 

TGGTGAAAGA ACTCGTGAAG GTAATGACCT TTATTGGGAA ATGAAAGAAT 550 

4 0 CTGGCGTTAT TGAAAAAACA GCCATGGTAT TTGGACAAAT GAACGAACCA 600 

CCAGGAGCAC GTATGCGCGT TGCTTTAACA GGTTTAACCA TTGCTGAATA 650 

TTTCCGGGAT GTTGAAGGTC AAGATGTTTT GCTCTTTATT GACAACATTT 700 

TCCGTTTCAC GCAAGCTGGT TCAGAAGTTT CAGCCCTATT GGGTCGTATG 750 

CCTTCAGCGG TAGGATACCA ACCAACACTT GCTACCGAAA TGGGACAATT 800 

45 GCAAGAAAGA ATTACCTCAA CTAACAAGGG ATCTGTTACT TCTA 844 



50 



2) INFORMATION FOR SEQ ID NO: 390 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 96 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Tatumella ptyseos 
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(B) STRAIN: ATCC 333 01 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390 

5 TTCCCTCAGG ACGCTGTACC ACAGGTGTAC AACGCTCTTG AGGTTGAAAA 50 

TGGTGATACC CGTCTGGTGC TGGAAGTTCA GCAGCAGCTG GGCGGTGGTG 100 

TCGTTCGTAC GATTGCAATG GGAACCTCTG ACGGCCTGAA ACGTGGCCTT 150 

AAGGTGACCG ATCTGCAAAA ACCGATTCAG GTACCGGTCG GTAAAGCGAC 2 00 

GCTGGGCCGT ATCATGAACG TACTGGGTCA GCCAATCGAT ATGAAAGGCG 250 

10 ACCTGAAGAA CGAAGATGGT AGCAATGTTG AGGTGAACTC TATTCACCGT 300 

GCAGCGCCAA GCTACGAAGA ACTGGCTAAC TCTACTGAGC TGCTGGAAAC 350 

GGGTATCAAG GTTATCGACC TGATCTGTCC GTTTGCAAAA GGCGGTAAAG 4 00 

TGGGTCTGTT CGGTGGTGCG GGTGTAGGTA AGACCGTCAA CATGATGGAA 450 

CTGATCCGTA ACATCGCTAT CGAGCACTCT GGTTACTCTG TATTTGCAGG 50 0 

15 GGTGGGTGAG CGTACCCGTG AAGGTAACGA CTTCTACCAC GAAATGACCG 550 

AGTCTAACGT TCTGGATAAA GTTGCTCTGG TTTATGGCCA GATGAACGAG 600 

CCACCAGGAA ACCGTCTGCG CGTTGCGCTG ACCGGTCTGA CTATGGCTGA 650 

AAAATTCCGT GACGAAGGCC GTGACGTACT GCTGTTCGTT GATAACATCT 70 0 

ATCGTTATAC CCTGGCCGGT ACTGAAGTTT CAGCACTGCT GGGTCGTATG 750 

2 0 CCTTCTGCGG TAGGTTATCA GCCAACACTG GCCGAAGAAA TGGGTGTTCT 800 

TCAGGAACGT ATCACGTCAA CCAAAACCGG TTCAATCACT TCCGTA 896 



25 2) INFORMATION FOR SEQ ID NO: 391 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trabulsiella guamensls 

(B) STRAIN: ATCC 49490 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391 



TTCCCTCAGG ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTTATGAA 50 

TGGTAGTGAG CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGTGGTGGTA 10 0 

TCGTACGTAC TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 150 

GATGTAAAAG ATCTCGAGCA TCCGATCGAA GTCCCGGTAG GTAAAGCAAC 2 00 

4 5 GCTGGGTCGT ATCATGAACG TGCTGGGTCA GCCGATCGAT ATGAAAGGCG 2 50 

ACATCGGCGA AGAAGAGCGT TGGGCTATCC ACCGCGCAGC ACCGTCCTAC 300 

GAAGAGCTGT CCAGCTCTCA GGAACTGCTG GAAACCGGCA TCAAAGTTAT 3 50 

CGACCTGATG TGTCCGTTCG CGAAGGGCGG TAAAGTCGGT CTGTTCGGTG 4 00 

GTGCGGGTGT AGGTAAAACC GTAAACATGA TGGAGCTGAT TCGTAACATC 450 

50 GCGATCGAGC ACTCCGGTTA CTCTGTGTTT GCGGGCGTGG GTGAACGTAC 500 

TCGTGAGGGT AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTCTGG 550 

ACAAAGTATC CCTGGTGTAT GGACAGATGA ACGAGCCGCC GGGAAACCGT 600 

CTGCGCGTTG CACTGACCGG TCTGACCATG GCTGAGAAGT TCCGTGACGA 650 

AGGTCGTGAC GTTCTGCTGT TCGTCGATAA CATCTACCGT TACACCCTGG 700 

55 CGGGTACTGA AGTATCTGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT 750 

TACCAGCCGA CCCTGGCGGA AGAGATGGGT GTTCTTCAGG AACGTATCAC 800 

CTCAACCAAA ACCGGTTCTA TCACCTCCG 82 9 



60 
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2) INFORMATION FOR SEQ ID NO: 3 92 



PCT/CA00/01150 



10 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia, bercovieri 

(B) STRAIN: ATCC 43970 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392 

CGAATTCCCC CAAGACGCTG TACCAAAAGT GTACAACGCC CTTGAGGTTG 50 

AAGGCACAGC TCAGAAGCTG GTGCTGGAAG TTCAGCAACA GCTGGGCGGT 10 0 

GGTGTTGTTC GTTGTATCGC AATGGGCTCT TCCGATGGTC TGAGCCGCGG 150 

2 0 GTTGAAAGTC ATCAACCTGG AACACCCAAT TGAAGTGCCG GTGGGTAAAT 2 00 

CAACTCTGGG CCGTATCATG AACGTATTGG GTGACCCAAT CGACATGAAA 2 50 

GGTCCTATCG GTGAAGAAGA GCGTTGGGCA ATCCACCGCG AAGCGCCTTC 3 00 

TTACGAAGAG CTTGCCAGCT CGCAAGATCT GTTAGAAACC GGTATCAAGG 350 

TAATGGATCT GATTTGTCCG TTCGCTAAGG GCGGTAAAGT CGGTCTGTTC 4 00 

25 GGTGGTGCGG GTGTGGGTAA AACAGTCAAC ATGATGGAGC TGATTCGTAA 450 

TATTGCGATT GAGCACTCAG GTTATTCTGT ATTTGCCGGT GTGGGTGAGC 50 0 

GTACTCGTGA GGGTAACGAC TTCTACCACG AGATGACTGA CTCCAACGTT 550 

CTGGACAAAG TATCCTTGGT TTATGGCCAG ATGAATGAGC CACCAGGTAA 60 0 

CCGTCTGCGC GTTGCACTGA CCGGCTTGAC CATGGCGGAG AAATTCCGTG 650 

3 0 ATGAAGGTCG TGATGTACTG TTATTCATCG ATAACATCTA TCGTTATACC 700 

CTGGCCGGTA CAGAGGTATC TGCACTGCTA GGTCGTATGC CATCAGCGGT 750 

AGGCTATCAG CCAACGCTGG CAGAAGAGAT GGGTGTGTTG CAGGAACGTA 8 00 

TCACTTCCAC CAAGACGGGT TCAATCACCT CCGTA 835 



35 



50 



2) INFORMATION FOR SEQ ID NO: 3 93 



{ i ) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia enterocolitica 

(B) STRAIN: ATCC 9610 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 393 



GCTGTACCAA AAGTGTACAA CGCCCTTGAG GTTGAAGGCG CAGCTGAGAA 50 

GCTGGTGCTG GAAGTTCAGC AACAGCTGGG CGGTGGTGTT GTTCGTTGTA 100 

55 TCGCAATGGG CTCTTCCGAT GGTCTGAGCC GTGGGTTGAA AGTCATCAAC 150 

CTGGAACACC CAATTGAAGT GCCTGTGGGC AAGTCAACTC TGGGCCGTAT 2 00 

CATGAACGTA TTGGGTGACC CAATCGACAT GAAAGGTCCT ATCGGCGAAG 2 50 

AAGAGCGTTG GGCAATCCAT CGTGAAGCGC CTTCTTACGA AGATCTTGCC 3 00 

AGCTCGCAAG ACTTGTTAGA AACCGGTATC AAGGTAATGG ACTTGATTTG 3 50 

60 TCCGTTCGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 
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GTAAAACGGT AAACATGATG GAGCTTATTC GTAACATTGC GATTGAGCAC 4 50 

TCAGGTTATT CCGTATTTGC TGGCGTGGGT GAGCGTACTC GTGAGGGTAA 500 

CGACTTCTAC CACGAGATGA CTGACTCCAA CGTTCTGGAC AAAGTATCCT 550 

TGGTTTATGG CCAAATGAAT GAGCCACCAG GTAACCGTCT GCGCGTTGCA 6 00 

5 CTGACCGGCT TGACCATGGC GGAGAAATTC CGTGATGAAG GTCGTGACGT 650 

ATTGCTGTTC ATCGATAACA TCTATCGCTA TACCTTAGCC GGTACGGAAG 700 

TTTCCGCACT GCTGGGTCGT ATGCCATCTG CCGTAGGTTA CCAGCCAACG 750 

CTGGCAGAAG AGATGGGTGT GTTGCAGGAA CGTATTACTT CCACCAAGAC 800 

GGGTTCAATC AC 812 

10 

2) INFORMATION FOR SEQ ID NO: 394 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia frederiksenii 
25 (B) STRAIN: ATCC 33641 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 94 

AAAGTGTACA ACGCCCTTGA GGTTGAAGGT ACTGCTGAGA AGTTAGTACT 50 

3 0 GGAAGTTCAG CAACAGCTGG GCGGTGGTGT TGCTCGTTGT ATCGCCATGG 100 

GCTCTTCCGA TGGTTTGAGC CGCGGGTTGA AAGTTGTCAA CCTGGAACAC 150 

CCAATTGAAG TACCGGTTGG TAAATCAACT CTGGGCCGTA TCATGAACGT 2 00 

ATTGGGTGAC CCAATCGACA TGAAAGGTCC TATCGGTGAA GAAGAGCGTT 250 

GGGCAATCCA CCGCGAAGCG CCTTCTTACG AAGAGCTTGC CAGCTCGCAA 3 00 

3 5 GATCTGTTAG AAACCGGTAT CAAGGTAATG GATCTGATTT GCCCGTTCGC 3 50 

TAAAGGCGGT AAAGTCGGTC TGTTCGGTGG TGCGGGTGTA GGTAAAACGG 400 

TAAACATGAT GGAGCTGATC CGTAATATCG CGATCGAGCA CTCAGGTTAT 450 

TCCGTATTTG CGGGTGTTGG TGAACGTACC CGTGAGGGTA ACGACTTCTA 500 

CCACGAGATG ACTGACTCCA ACGTTCTGGA CAAAGTATCC TTGGTTTATG 550 

4 0 GCCAGATGAA TGAGCCACCA GGTAACCGTC TTCGCGTTGC ACTGACCGGT 600 

CTGACCATGG CGGAGAAATT CCGTGATGAA GGTCGTGACG TATTGCTGTT 650 

CATCGATAAC ATCTATCGTT ATACCTTGGC CGGTACGGAA GTATCCGCAC 700 

TGCTGGGTCG TATGCCATCT GCGGTAGGCT ATCAGCCAAC GCTGGCAGAA 750 

GAGATGGGTG TGTTGCAGGA ACGTATTACT TCCACCAAGA CGGGTTCAAT 800 

45 CA 802 



2) INFORMATION FOR SEQ ID NO: 395 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Yersinia, intermedia 
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(B) STRAIN: ATCC 29909 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395 

5 GCTGTACCAA GAGTGTACAA CGCCCTTGAG GTTGAAGGCA CTGCTGAGAA 50 

GCTGGTGCTG GAAGTTCAGC AACAGCTAGG CGGTGGTGTT GTTCGTTGTA 100 

TCGCAATGGG CTCTTCCGAT GGTCTGAGCC GCGGGTTGAA AGTCATCAAC 150 

CTGGAACACC CAATTGAAGT GCCGGTTGGT AAATCAACTC TGGGCCGTAT 2 00 

CATGAACGTA TTGGGTGACC CAATCGACAT GAAAGGTCCT ATCGGTGAAG 2 50 

10 AAGAGCGTTG GGCAATCCAC CGCGAAGCGC CTTCTTACGA AGAGCTTGCC 3 00 

AGCTCACAAG ATTTGTTAGA AACCGGTATC AAAGTAATGG ACTTGATTTG 3 50 

CCCGTTCGCT AAGGGCGGTA AAGTGGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACAGT AAACATGATG GAGCTTATTC GTAACATCGC GATTGAGCAC 450 

TCAGGTTATT CTGTATTTGC TGGTGTGGGT GAGCGTACTC GTGAGGGTAA 500 

15 CGACTTCTAC CACGAGATGA CTGACTCCAA CGTTCTGGAC AAAGTATCCT 550 

TGGTGTATGG CCAGATGAAT GAGCCACCAG GTAACCGTCT GCGCGTTGCA 600 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGATGAAG GTCGTGACGT 650 

ACTGTTGTTC ATCGATAACA TCTATCGCTA TACCTTGGCC GGTACGGAAG 700 

TATCCGCACT GCTGGGTCGT ATGCCATCAG CGGTAGGCTA CCAGCCAACG 750 

2 0 CTGGCAGAAG AGATGGGTGT GTTGCAGGAA CGTATTACGT CCACCAAGAC 800 

GGGTTC 8 06 



2 5 2) INFORMATION FOR SEQ ID NO: 3 96 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia, pseudotuberculosis 

(B) STRAIN: ATCC 29833 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396 



GCTGTACCAA AAGTGTACAA CGCCCTTGAG GTAGAAGGCA CAACTGAAAA 50 

GTTAGTGCTG GAAGTTCAGC AACAGTTGGG CGGTGGTGTT GTTCGTTGTA 100 

TCGCAATGGG CTCTTCCGAT GGTCTGAGCC GTGGGTTGAA AGTAACCAAC 150 

CTGGAACACC CGATCGAAGT ACCGGTTGGT AAAGCGACCC TTGGCCGTAT 200 

4 5 CATGAACGTA TTGGGTGAAC CAATCGACAT GAAAGGTCCT ATCGGTGAAG 2 50 

AAGAGCGTTG GGCAATCCAT CGCGAAGCGC CTTCTTATGA AGAGCTTGCT 3 00 

AGCTCACAAG ATCTGTTAGA AACCGGTATC AAGGTTATGG ACCTGATTTG 3 50 

TCCGTTTGCT AAGGGCGGTA AAGTCGGTCT GTTCGGTGGT GCGGGTGTAG 4 00 

GTAAAACAGT AAACATGATG GAGCTGATCC GTAACATCGC GATCGAGCAC 4 50 

50 TCTGGGTATT CTGTATTTGC CGGTGTAGGT GAGCGTACCC GTGAGGGTAA 500 

TGACTTCTAC CATGAAATGA CTGACTCCAA CGTTTTGGAC AAAGTATCCT 550 

TGGTTTACGG CCAGATGAAT GAGCCACCAG GTAACCGTCT ACGCGTTGCA 600 

CTGACCGGCC TGACCATGGC GGAGAAATTC CGTGATGAAG GTCGTGACGT 650 

ACTGCTGTTC ATCGATAATA TCTATCGTTA TACCCTAGCT GGTACGGAAG 700 

55 TATCCGCATT GCTGGGTCGT ATGCCATCAG CGGTAGGTTA TCAGCCAACA 750 

CTGGCTGAAG AGATGGGTGT GTTGCAGGAA CGTATTACTT CCACTAAGAC 800 

GGGTTC 806 



60 
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397 



10 



15 



20 



25 



30 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia rohdei 

(B) STRAIN: ATCC 43380 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 97 



TTCCCCCAAG 
TGCAGCTGAG 
TTGTTCGTTG 
AAAGTTATCA 
TCTGGGCCGT 
CTATCGGTGA 
GAAGAGCTTG 
GGATCTGATT 
GTGCGGGTGT 
GCGATTGAGC 
TCGTGAGGGT 
ACAAAGTATC 
CTGCGCGTTG 
AGGCCGTGAC 
CCGGTACGGA 
TATCAGCCAA 
TTCCACTAAG 



ACGCTGTACC 
AAGCTTGTGC 
TATCGCAATG 
ACCTGGAACA 
ATCATGAACG 
AGAAGAGCGT 
CCAGCTCGCA 
TGTCCGTTCG 
TGGTAAAACA 
ACTCAGGTTA 
AACGACTTCT 
CTTGGTTTAT 
CACTGACCGG 
GTATTGCTGT 
AGTATCCGCA 
CACTGGCAGA 
ACGGGTTCAA 



AAAAGTGTAC 
TGGAAGTTCA 
GGCTCTTCCG 
CCCAATTGAA 
TATTGGGTGA 
TGGGCAATCC 
AGATCTGTTA 
CTAAGGGCGG 
GTAAACATGA 
TTCTGTATTT 
ACCACGAGAT 
GGCCAGATGA 
CTTGACCATG 
TCATCGATAA 
CTGCTGGGTC 
AGAGATGGGT 
TCACCTCCG 



AACGCCCTTG 
GCAGCAGCTG 
ATGGTTTGAG 
GTGCCAGTTG 
CCCAATCGAC 
ACCGTGAAGC 
GAAACCGGTA 
TAAAGTCGGT 
TGGAGCTTAT 
GCCGGGGTAG 
GACTGACTCC 
ATGAGCCACC 
GCGGAAAAAT 
CATTTATCGT 
GTATGCCATC 
GTGTTGCAGG 



AGGTTGAAGG 50 

GGCGGTGGTG 100 

CCGTGGGTTG 150 

GTAAATCAAC 2 00 

ATGAAAGGCC 2 50 

GCCTTCTTAC 3 00 

TCAAGGTAAT 3 50 

CTGTTCGGTG 4 00 

TCGTAACATC 4 50 

GTGAACGTAC 500 

AACGTTCTGG 550 

AGGTAACCGT 600 

TCCGTGATGA 650 

TATACCCTAG 70 0 

TGCGGTAGGC 750 

AACGTATTAC 800 

829 



35 



40 



45 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 398 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yokenella regensburgei 

(B) STRAIN: ATCC 35313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398 

ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTACAAAA 
AAACTGGTGC TGGAAGCTCA GCAGCAGCTC GGCGGCGGTA 
TATCGCCATG GGTTCTTCCG ACGGTCTGCG TCGTGGTCTG 
ACCTCGAGCA CCCGATCGAA GTCCCGGTAG GTAAAGCAAC 
ATCATGAACG TCCTGGGTCA GCCGATCGAC ATGAAAGGCG 
AGAAGAGCGT TGGGCTATCC ACCGCGCAGC ACCTTCCTAT 
CCAGCTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTAAT 
TGCCCGTTCG CTAAGGGTGG TAAAGTCGGT CTGTTCGGTG 

238 



TGGTAACGAG 50 

TCGTGCGTAC 100 

GAAGTTAAAG 150 

CCTGGGCCGT 200 

ACATCGGTGA 250 

GAAGAGCTGT 3 00 

GGATCTGATC 350 

GTGCGGGTGT 4 00 
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AGGTAAAACT GTAAACATGA TGGAGCTTAT 
ACTCCGGTTA CTCTGTGTTT GCAGGCGTGG 
AACGACTTCT ACCACGAAAT GACCGACTCC 
CCTGGTGTAC GGCCAGATGA ACGAGCCGCC 
5 CGCTGACCGG CCTGACCATG GCTGAGAAAT 
GTTCTGCTGT TCGTCGATAA CATCTACCGT 
AGTATCCGCA CTGCTGGGTC GTATGCCTTC 
CTCTGGCGGA AGAGATGGGT GTTCTTCAGG 
ACCGGTTCTA TCACCTCCG 



PCT/CA00/01150 

CCGTAACATC GCGATTGAGC ' 4^0' 

GTGAACGTAC TCGTGAGGGT 500 

AACGTACTGG ATAAAGTATC 550 

GGGAAACCGT CTGCGCGTTG 600 

TCCGTGACGA AGGCCGTGAC 650 

TATACCCTGG CCGGTACGGA 7 00 

TGCGGTAGGT TATCAGCCAA 750 

AACGTATCAC CTCTACCAAA 800 

819 



2) INFORMATION FOR SEQ ID NO: 399 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1097 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yarrowia lipolytica 
25 (B) STRAIN: ATCC 38295 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399 

AAGC TTAAGG CTGAGCGAGA GCGAGGTATC ACCATTGATA TCGCTCTCTG 50 

3 0 GAAGTTCCAG ACCCCTAAGT ACTACGTCAC CGTTATTGAT GCTCCCGGTC 100 

ACCGAGATTT CATCAAGAAC ATGATTACCG GTACTTCCCA GGCTGACTGC 150 

GCCATCCTCA TCATTGCTGG TGGTGTTGGT GAGTTCGAGG CTGGTATCTC 200 

CAAGGATGGT CAGACCCGAG AGCACGCCCT GCTCGCTTTC ACCCTCGGTG 2 50 

TTAAGCAGCT CATTGTTGCT ATCAACAAGA TGGACTCCGT CAAGTGGTCT 3 00 

3 5 CAGGATCGAT ACCTCGAGAT TTGCAAGGAG ACTGCCAACT TCGTCAAGAA 3 50 

GGTCGGTTAC AACCCCAAGG CTGTCCCCTT CGTCCCCATT TCCGGATGGA 400 

ACGGTGACAA CATGATCGAG CCCTCTACCA ACTGTGACTG GTACAAGGGA 450 

TGGACCAAGG AGACCAAGGC CGGCGAGATC AAGGGTAAGA CCCTCCTCGA 50 0 

GGCCATTGAT GCCATTGAGC CCCCCGTGCG ACCCCACGAC AAGCCCCTCC 550 

4 0 GACTTCCCCT CCAGGATGTC TACAAGATCG GTGGTATCGG CACAGTGCCC 600 

GTTGGCCGAG TCGAGACCGG TGTTATCAAG GCCGGTATGG KTGTTACCTT 650 

CGCTCCCGCC AACGTGACCA CTGAGGTCAA GTCTGTCGAG ATGCACCACG 700 

AGATCCTCCC CGATGGAGGT TTCCCCGGTG ACAACGTCGG TTTCAACGTC 750 

AAGAACGTTT CCGTCAAGGA TATCCGACGA GGTAACGTTG CTGGTGACTC 800 

4 5 CAAGAACGAC CCCCCCAAGG GCTGCGACTC TTTCAACGCT CAGGTCATTG 850 

TTCTTAACCA CCCCGGTCAG ATCGGTGCTG GTTACGCTCC CGTCCTTGAT 900 

TGCCACACTG CCCACATTGC TTGCAAGTTC GACACCCTGA TCGAGAAGAT 950 

CGAC CGACGA AC CGGTAAGA AGATGGAGGA CTCCCCCAAG TTCATCAAGT 1000 

CTGGTGATGC TGCCATTGTC AAGATGGTTC CCTCCAAGCC CATGTGTGTC " 1050 

50 GAGGCCTTCA CTGAGTACCC CCCTCTTGGT CGATTCGCCG TCCGAGA 1097 



2) INFORMATION FOR SEQ ID NO: 400 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1233 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Absidia corymbifera 

(B) STRAIN: ATCC 46775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 400 



10 



15 



20 



25 



30 



35 



CAAGCTTAAG 
GGAAGTTCGA 
CATCGTGATT 
CGGTATCTTG 
CCAAGGATGG 
GTCCGTCAAT 
TGAGGCCCGT 
AGATTGGTTT 
AACGGTGACA 
ATGGAACAAG 
AAGCCATTGA 
CGTCTTCCCC 
TGTCGGTCGT 
TCGCTCCCGC 
GAGCAACTTG 
GAACGTTTCC 
AGAACGACCC 
TTGAACCACC 
CCACACTGCT 
ATYGTCGTTC 
TTAATCTYTG 
ACTTTTTGAT 
GTAAGAAGTT 
ATCGTCAAGA 
ATATCCTCCT 



GCTGAACGTG 
GACTCCCAAG 
TCATCAAGAA 
ATTATTGCTG 
TCAAACCCGT 
TGATTGTCGC 
TACAACGAAA 
CAACCCCAAG 
ACATGTTGGA 
GAGACTAAGG 
CAACATTGAT 
TTCAAGATGT 
GTTGAGACTG 
TAACGTCACC 
CTGAAGGTGT 
GTCAAGGATA 
CGCCAAGGAA 
CTGGTCARAT 
CACATTGCAT 
CGGTAAGTAA 
GATTTTGCCT 
CATTTGCTAA 
GGAAGACTCC 
TGGTTCCTTC 
CTTGGTCGTT 



AGCGTGGTAT 
TACCACGTTA 
CATGATTACT 
CTGGTACTGG 
GAACACGCTT 
TATCAACAAG 
TTGTCAAGGA 
TCCGTTCCTT 
GGARTCCACC 
CTGGTGCCAA 
CCCCCTGTTC 
CTACAAGATC 
GTGTCATCAA 
ACTGAAGTCA 
TCCCGGTGAC 
TCCGCCGTGG 
TCCGCTTCCT 
TGGTGCTGGT 
GCAAGTTCTY 
ATANTTTGGT 
CAATTGCTCC 
TCCAAACCCT 
CCCAAGTTCG 
CAAGCCCATG 
TCGCTGTCCG 



CACCATCGAT 
CCGTCATTGA 
GGTACTTCCC 
TGAATTCGAA 
TGCTTGCTTT 
ATGGATTCCA 
AGTCTCCACC 
TCGTCCCTAT 
AACATGCCTT 
GACYGGCAAG 
GTCCTTCCGA 
GGTGGTATTG 
GCCTGGTATG 
AGTCCGTYGA 
AACGTCGGTT 
TAACGTYTGC 
TCACCGCTCA 
TACTCTCCTG 
TKAGCTTCTT 
TTRGGATATG 
TTCCTTGATC 
TTCCATTTYA 
TCAAGWSYGG 
TGCGTTGAAG 
TGA 



ATCGCTCTCT 50 

TGCCCCTGGC 100 

AAGCTGACTG 150 

GCTGGTATCT 2 00 

CACCCTTGGT 2 50 

CCAAGTACTC 3 00 

TTCATCAAGA 3 50 

CTCTGGCTGG 400 

GGTTCAAGGG 4 50 

ACCCTTCTTG 500 

CAAGCCCCTT 550 

GTACAGTTCC 600 

GTTGTCACCT 650 

AATGCAC CAC 700 

TCAACGTCAA 750 

TCTGACTCCA 8 00 

AGTTATTGTC 850 

TCTTGGATTG 900 

KAGAAGATCG 950 

GGTATTGGGC 100 0 

TTTCTCGATT 1050 

TTGAAAACAG 1100 

TGACTCTGCY 1150 

CCTACACTGA 12 0 0 
1233 



2) INFORMATION FOR SEQ ID NO: 401 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1151 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alternaria alternata 
50 (B) STRAIN: ATCC 62099 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401 

CAAGTTGAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCTCTCT 50 

55 GGAAGTTCGA GACTCCCAAG GTTAGTACCC CTCTGCCTAC TACATCAAGT 10 0 

TCTTTACAAT GCTAACATGT TGTACTCAGT ACTATGTCAC CGTCATTGAC 150 

GCCCCCGGTC ACCGTGATTT CATCAAGAAC ATGATCACTG GTACCTCCCA 2 00 

GGCCGACTGC GCTATTCTCA TCATTGCCGC CGGTACTGGT GAGTTCGAGG 250 

CTGGTATCTC CAAGGATGGC CAGACTCGTG AGCACGCTCT CCTCGCTTAC 3 00 

60 ACCCTCGGTG TCAAGCAGCT CATCGTTGCC ATCAACAAGA TGGACACCAC 3 50 

240 
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CAAGTGGTCC GAGGAGCGTT 
TCATCAAGAA GGTCGGCTAC 
TCCGGTTTCA ACGGTGACAA 
GTACAAGGGT TGGGAGAAGG 
5 TCCTCGAGGC CATCGACGCC 
CCCCTCCGTC TTCCCCTYCA 
GGTGCCCGTC GGTCGTGTCG 
TCACCTTCGC CCCCGCTGGT 
CACCACGAGC AGCTCACCGA 

10 CGTCAAGAAC GTCTCCGTCA 
ACTCCAAGAA CGACCCCCCC 
ATCGTCCTCA ACCACCCTGG 
CGACTGCCAC ACCGCCCACA 
AGATTGACCG CCGTACCGGA 

15 AAGTCCGGTG ACGCCGCCAT 
CGTTGAGGCT TTCACTGACT 
A 



ACCAGGAGAT CATCAAGGAG 
AACCCCAAGC ACGTTCCCTT 
CATGATTGAG GCCTCATCCA 
AGACCAAGGC CAAGGCCACT 
ATCGACCCTY CCAGCCGTCC 
GGATGTTTAC AAGATTGGTG 
AGACCGGTAT CATCAAGGCC 
GTCACCACTG AAGTCAAGTC 
GGGTGTCCCC GGTGACAACG 
AGGAGATCCG TCGTGGTAAC 
AAGGGTGCCG AGTCCTTCAA 
TCAGGTCGGT GCTGGTTACG 
TTGCTTGCAA GTTCTCTGAG 
AAGTCTGTTG AGAACTCTCC 
CGTCAAGATG GTTCCCTCCA 
ACCCTCCTCT CGGTCGTTTC 



ACCTCCAACT' 


4~00 


CGTCCCCATC 


450 


ACTGCCCCTG 


500 


GGTAAGACCC 


550 


CACCGACAAG 


600 


GTATTGGCAC 


650 


GGTATGGTCG 


700 


CGTCGAGATG 


750 


TCGGCTTCAA 


800 


GTTGCCGGTG 


850 


CGCCCAGGTC 


900 


CCCCAGTCCT 


950 


CTCCTCGAGA 


1000 


CAAGTTCATC 


1050 


AGCCCATGTG 


1100 


GCTGTCCGTG 


1150 




1151 



20 



2) INFORMATION FOR SEQ ID NO: 402 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 12 83 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus flavus 

(B) STRAIN: ATCC 2 6947 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402 

CAAGCTCAAG TCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT 50 

GGAAGTTCCA GACCTCCAAG TATGAGGTCA CCGTCATTGG TAAGCATTTG 10 0 

AGTTCCAACC TACGTTGCCC AACATTTACA GTCATCTAAC AAAGTTCAAT 150 

4 0 AGATGCCCCC GGTCACCGTG ACTTCATCAA GAACATGATC ACTGGTACTT 200 

CCCAGGCTGA CTGCGCTATC CTCATCATTG CCTCCGGTAC TGGTGAATTC 2 50 

GAGGCTGGTA TCTCCAAGGA TGGTCAGACC CGTGAGCACG CTCTGCTCGC 3 00 

TTTCACCCTC GGTGTCCGTC AGCTCATCGT TGCCCTCAAC AAGATGGACA 3 50 

CCTGCAAGTG GTCTCAGGAT CGTTACAACG AAATCGTTAA GGAGACTTCC 4 00 

45 AACTTCATCA AGAAGGTCGG ATACAACCCC AAGAGCGTTC CTTTCGTCCC 4 50 

CATCTCCGGT TTCAACGGTG ACAACATGAT TGAGGCCTCC ACCAACTGCC 5 00 

CCTGGTACAA GGGTTGGGAG AAGGAGACCA AGGCTGGCAA GTCCACCGGT 5 50 

AAGACCCTTC TCGAGGCCAT CGATGCCATC GAGCCCCCCG TCCGTCCCAC 600 

CGACAAGCCT CTCCGTCTTC CCCTYCAGGA TGTCTACAAG ATCTCTGGTA 650 

50 TCGGTACTGT GCCCGTCGGT CGTGTCGAGA CTGGTGTCAT CAAGCCTGGT 700 

ATGGTCGTTA CTTTCGCTCC TGCCAACGTG ACCACTGAAG TCAAGTCCGT 750 

TGAAATGCAC CACCAGCAGC TCCAGGCCGG TAACCCCGGT GACAACGTTG 8 00 

GTTTCAACGT CAAGAACGTC TCCGTCAAGG AAGTCCGCCG TGGTAACGTT 850 

GCCGGTGACT CCAAGAACGA CCCCCCTGCT GGCTGCGATT CCTTCAACGC 900 

55 CCAGGTCATC GTCCTTAACC ACCCCGGTCA GGTCGGCAAC GGTTACGCTC 950 

CCGTCCTGGA CTGCCACACC GCTCACATTG CTTGCAAGTT CGCTGAGCTC 1000 

CTTGAGAAGA TTGACCGCCG TACCGGTAAA TCTGTTGAGG ACAAGCCCAA 1050 

GTTCATCAAG TCTGGTGATG CTGCCATCGT CAAGATGATT CCCTCCAAGC 1100 

CCATGTGTGT GGAGTCTTTC ACTGACTTCC CCCCTCTTGG TCGTTTCGCT 1150 

60 GTCCGTGACG TAAGTTTTTC CCTCTTGACT ATCTTCACAA TTTTTCACAT 12 00 

241 
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15 



55 
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ATTTTCACGC CTCGTCCCAC TCTTTTTCCT CCCTTCCTCT TTGGTTCCCC 12 50 

TTTTTGCCTG CAAGTTCTCT ATAGCTAACA TGA 12 83 



2) INFORMATION FOR SEQ ID NO: 403 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1103 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: DAL95 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403 

TCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT GGAAGTTCCA 50 

GACTCCCAAG TATGAGGTCA CTGTCATCGG TAAGCTCGAC TCGCCCCGAT 100 

ATGTTTTGGT GCTGTAGCTA ACACGATCTG AAGATGCCCC CGGTCACCGT 150 

25 GACTTCATCA AGAACATGAT CACTGGTACC TCCCAGGCTG ACTGCGCTAT 2 00 

CCTCATCATT GCCTCCGGTA CTGGTGAGTT CGAGGCTGGT ATCTCCAAGG 2 50 

ATGGCCAGAC CCGTGAGCAC GCTCTGCTGG CTTTCACCCT CGGTGTCAAG 3 00 

CAGCTCATCG TCGCCCTCAA CAAGATGGAC ACCTGCAAGT GGTCCGAGGA 3 50 

TCGTTACAAC GAAATTGTCA AGGAAACCTC CAACTTCATC AAGAAGGTCG 4 00 

3 0 GCTACAACCC CAAGGCCGTT CCCTTCGTCC CCATCTCTGG CTTCAACGGT 450 

GACAACATGC TTGAGCCCTC CTCCAACTGC CCCTGGTACA AGGGATGGGA 500 

GAAGGAGACC AAGGCCGGCA AGGTCACTGG TAAGACCCTC ATCGAGGCCA 550 

TCGACGCCAT TGAGCCCCCT GTCCGTCCCT CCAACAAGCC CCTCCGTCTT 600 

CCCCTCCAGG ATGTCTACAA GATCTCTGGT ATCGGAACGG TCCCTGTCGG 650 

3 5 CCGTGTCGAG ACCGGTATCA TCAAGCCCGG CATGGTCGTC ACCTTCGCCC 700 

CCGCCAACGT CACCACTGAA GTCAAGTCCG TCGAGATGCA CCACCAGCAG 750 

CTC CAGGAGG GTGTCCCCGG TGACAACGTC GGTTTCAACG TCAAGAACGT 800 

TTCCGTCAAG GAAGTCCGCC GTGGTAACGT CTGCGGTGAC TCCAAGAACG 850 

ATCCCCCTCA GGGTGCTGCC TCCTTCAACG CCCAGGTCAT CGTCCTCAAC 900 

40 CACCCCGGTC AGGTCGGCGC TGGTTACGCC CCCGTCCTCG ACTGCCACAC 950 

TGCCCACATT GCTTGCAAGT TCTCTGAGCT GCTTGAGAAG ATTGACCGCC 10 00 

GTACCGGCAA GTCTGTTGAG AACAACCCCA AGTTCATCAA GTCCGGTGAT 1050 

GCCGCCATCG TGAAGATGGT TCCTTCCAAG CCCATGTGTG TCGAGTCCTT 1100 

CAC 1103 

45 

2) INFORMATION FOR SEQ ID NO: 4 04 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 
60 (B) STRAIN: WSA-172 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404 

AAGCTCAAGT CCGAGCGTGA GCGTGGTATC ACCATCGACA TTGCCCTCTG 50 

5 GAAGTTCCAG ACTCCCAAGT ATGAGGTCAC TGTCATCGGT AAGCTCGACT 100 

CGCCCCGATA TGTTTTGGTG CTGTAGCTAA CACGATCTGA AGATGCCCCC 150 

GGTCACCGTG ACTTCATCAA GAACATGATC ACTGGTACCT CCCAGGCTGA 2 00 

CTGCGCTATC CTCATCATTG CCTCCGGTAC TGGTGAGTTC GAGGCTGGTA 2 50 

TCTCCAAGGA TGGCCAGACC CGTGAGCACG CTCTGCTGGC TTTCACCCTC 3 00 

10 GGTGTCAAGC AGCTCATCGT CGCCCTCAAC AAGATGGACA CCTGCAAGTG 3 50 

GTCCGAGGAT CGTTACAACG AAATTGTCAA GGAAACCTCC AACTTCATCA 4 00 

AGAAGGTCGG CTACAACCCC AAGGCCGTTC CCTTCGTCCC CATCTCTGGC 4 50 

TTCAACGGTG ACAACATGCT TGAGCCCTCC TCCAACTGCC CCTGGTACAA 500 

GGGATGGGAG AAGGAGACCA AGGCCGGCAA GGTCACTGGT AAGACCCTCA 550 

15 TCGAGGCCAT CGACGCCATT GAGCCCCCTG TCCGTCCCTC CAACAAGCCC 600 

CTCCGTCTTC CCCTCCAGGA TGTCTACAAG ATCTCTGGTA TCGGAACGGT 650 

CCCTGTCGGC CGTGTCGAGA CCGGTATCAT CAAGCCCGGC ATGGTCGTCA 700 

CCTTCGCCCC CGCCAACGTC AC CACTGAAG TCAAGTCCGT CGAGATGCAC 750 

CACCAGCAGC TCCAGGAGGG TGTCCCCGGT GACAACGTCG GTTTCAACGT 8 00 

2 0 CAAGAACGTT TCCGTCAAGG AAGTCCGCCG TGGTAACGTC TGCGGTGACT 8 50 

CCAAGAACGA TCCCCCTCAG GGTGCTGCCT CCTTCAACGC CCAGGTCATC 900 

GTCCTCAACC ACCCCGGTCA GGTCGGCGCT GGTTACGCCC CCGTCCTCGA 950 

CTGCCACACT GCCCACATTG CTTGCAAGTT CTCTGAGCTG CTTGAGAAGA 10 00 

TTGACCGCCG TACCGGCAAG TCTGTTGAGA ACAACCCCAA GTTCATCAAG 1050 

2 5 TCCGGTGATG CCGCCATCGT GAAGATGGTT CCTTCCAAGC CCATGTGTGT 1100 

CGAGTCCTTC ACTGACTACC CCCCTCTGGG TCGTTTCGCC GTCCGTGAC 114 9 



30 2) INFORMATION FOR SEQ ID NO: 405 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1151 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus niger 

(B) STRAIN: ATCC 9508 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405 



CAAGCTCAAG TCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT 50 

GGAAGTTCCA GACTGGCAAG TATGAGGTCA CCGTCATTGG TATGTACTCA 100 

CAGAGTTCTC TTTTCATCAA AGCAATATAC TAACGTCCAT CATAGACGCC 150 

CCCGGTCACC GTGACTTCAT CAAGAACATG ATCACTGGTA CCTCCCAGGC 2 00 

50 TGACTGCGCT ATCCTCATCA TTGCCTCCGG TACTGGTGAG TTCGAGGCTG 250 

GTATCTCCAA GGATGGCCAG ACTCGTGAGC ACGCTCTGCT TGCTTTCACC 300 

CTCGGTGTCC GCCAGCTCAT CGTTGCCCTC AACAAGATGG ACACCTGCAA 350 

GTGGTCCGAG GACCGTTACA ACGAAATCGT TAAGGAGACC TCCAACTTCA 4 00 

TCAAGAAGGT CGGATACAAC CCCAAGGGTG TTCCTTTCGT CCCCATCTCC 450 

55 GGTTTCAACG GTGACAACAT GCTCGAGCCC TCCCCCAACT GCCCCTGGTA 500 

CAAGGGTTGG GAGAAGGAGA CCAAGGCCGG CAAGGTCACC GGTAAGACCC 550 

TCCTTGAGGC CATCGACGCC ATCGAGCCCC CCGTCCGTCC CTCCAACAAG 600 

CCCCTCCGTC TTCCCCTCCA GGATGTCTAC AAGATCTCCG GTATTGGAAC 650 

TGTTCCCGTC GGTCGTGTCG AGACCGGTAT CATTGCCCCT GGTATGGTCG 700 

60 TGACCTTCGC TCCCGCCAAC GTCACCACTG AAGTCAAGTC CGTTGAGATG 750 

243 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

CACCACCAGC AGCTCAAGGA AGGTGTCCCC 
CGTCAAGAAC GTTTCCGTCA AGGAGGTTCG 
ACTCCAAGAA CGACCCCCCT CTTGGCTGTG 
ATCGTCCTCA ACCACCCCGG TCAGGTCGGC 
5 GGACTGCCAC ACTGCTCACA TTGCTTGCAA 
AGATTGACCG CCGTACCGGA AAGTCTGTTG 
AAGTCCGGTG ACGCTGCCAT CGTCAAGATG 
TGTTGAGGCT TTCACTGACT ACCCCCCTCT 
A 



PCT/CA00/01150 

GGTGACAACG TTGGTTTCAA 800 

CCGTGGTAAC GTTGCCGGTG 850 

AGAGCTTCAC CGCCCAGGTC 90 0 

GCTGGTTACG CTCCCGTCCT 950 

GTTCGCTGAG CTCCTTGAGA 1000 

AATCTTCCCC CAAGTTCATC 1050 

ATTCCCTCCA AGCCCATGTG 1100 

TGGTCGTTTC GCCGTCCGCG 1150 

1151 



2) INFORMATION FOR SEQ ID NO: 4 06 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capitatus 
25 (B) STRAIN: ATCC 10663 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406 

GCTTAAAGCT GAACGTGAAC GTGGTATCAC CATTGATATC GCTCTCTGGA 50 

3 0 AGTTCGAAAC TCCTAAGTAC TACGTTACTG TTATTGATGC TCCAGGTCAC 100 

CGTGATTTCA TCAAGAACAT GATTACTGGT ACTTCCCAAG CCGATTGCGC 15 0 

CATTCTTATC ATTGCTGCCG GTGTCGGTGA ATTCGAAGCT GGTATCTCCA 20 0 

AGGAAGGTCA AACCAGAGAA CACGCTCTTC TCGCTTTCAC CCTTGGTGTC 2 50 

AGACAACTTA TCATTGCCAT CAACAAGATG GACTCTGTCA AGTGGGACCA 300 

35 AAAGAGATAC GAAGAAATCG TCAAGGAGGC TTCCAACTTC GTCAAGAAGG 350 

TTGGTTACAA CCCCAAGTCT GTTCCATTCG TTCCTATCTC TGGTTGGAAC 4 00 

GGTGACAACA TGTTGGAACC TACCACCAAC GCCCCATGGT ACAAGGGATG 4 50 

GACCAAGGAA ACCAAGGCTG GTGCCACTAA GGGTATGACT CTTATTGAAG 500 

CCATTGACGC CATTGAACCA CCAGTAAGAC CATCCGACAA GCCACTCCGT 550 

4 0 CTCCCACTCC AAGATGTTTA CAAGATTGGT GGTATCGGAA CTGTGCCAGT 600 

CGGCCGTGTC GAAACCGGTA TCATCAAGGC CGGTATGGTC GTTACCTTTG 650 

CTCCACCAAT GGTCACAACT GAAGTTAAGT CCGTTGAAAT GCACCACGAA 700 

CAACTTGCTC AAGGTAACCC AGGTGACAAC GTTGGTTTCA ACGTCAAGAA 750 

CGTTTCCGTT AAGGAAATCA GACGTGGTAA CGTCTGTGGT GACTCCAAGA 800 

45 ACGATCCACC AAAGGGCTGC GAATCTTTCA ACGCTCAAGT TATCGTCTTG 8 50 

AACCACCCTG GTCAAATCTC TGCTGGTTAC TCTCCAGTTC TCGATTGCCA 900 

CACTGCCCAC ATTGCCTGCA GATTCGACGA ACTCCTTGAA AAGATCGACC 950 

GTCGTTCTGG TAAGAAGATT GAAGACTCTC CAAAGTTTGT CAAGTCTGGT 1000 

GATGCCGCTA TCGTCAAGAT GATCCCAACC AAGC CAATGT GCGTTGAAAC 1050 

50 CTTCACTGAA TACCCACCAC TTGGTCGTTT CGCCGTCCGT GAT 1093 



55 



2) INFORMATION FOR SEQ ID NO: 407 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida, albicans 

(B) STRAIN: ATCC 10231 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407 

CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 10 0 

CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 2 00 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 2 50 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 3 00 

ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCCAACTTCG 3 50 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTYGT TCCAATCTCT 4 00 

GGTTGGAATG GTGACAACAT GATTGAACCA TCCACCAACT GTCCATGGTA 4 50 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAAGTTACT GGTAAGACCT 5 00 

TGTTAGAAGC TATTGACGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 6 00 

TGTGCCAGTC GGTAGAGTTG AAACTGGTAT CATCAAAGCC GGTATGGTTG 6 50 

TTACTTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 70 0 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAC GTTTCCGTTA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 8 00 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTTTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 90 0 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 95 0 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 10 0 0 

AAATCCGGTG ATGCTGCTAT CGTCAAGATG GTCCCAACCA AACCAATGTG 105 0 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

A - 1101 



2) INFORMATION FOR SEQ ID NO: 408 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1089 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 18804 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408 

GAAGGCTGAA AGAGAAAGAG GTATCACCAT TGATATCGCT TTGTGGAAAT 50 

TCGAAACTCC AAAATACCAC GTTACCGTCA TTGATGCTCC AGGTCACAGA 100 

GATTTCATCA AGAATATGAT CACTGGTACT TCTCAAGCTG ATTGTGCTAT 150 

TTTGATTATT GCTGGTGGTA CTGGTGAATT CGAAGCCGGT ATTTCTAAGG 2 00 

ATGGTCAAAC CAGAGAACAC GCTTTGTTGG CTTACACTTT GGGTGTCAAA 2 50 

CAATTGATTG TTGCTGTCAA CAAGATGGAC TCTGTCAAAT GGGACAAAAA 3 00 

CAGATTTGAA GAAATCATCA AGGAAACCTC CAACTTCGTC AAGAAGGTTG 350 

GTTACAACCC AAAGACTGTT CCATTCGTTC CAATCTCTGG TTGGAATGGT 4 00 

GACAACATGA TTGAACCATC CACCAACTGT CCATGGTACA AGGGTTGGGA 4 50 
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AAAGGAAACC 
TTGACGCTAT 
CCATTGCAAG 
TAGAGTTGAA 
CAGCTGGTGT 
TTGGCTGAAG 
TTCCGTTAAA 
ATCCACCAAA 
CATCCAGGTC 
TGCCCACATT 
GAACTGGTAA 
GCTGCTATCG 
CACTGACTAC 



AAATC CGGTA 
TGAACCACCA 
ATGTTTACAA 
ACTGGTATCA 
TACCACTGAA 
GTGTTCCAGG 
GAAATTAGAA 
GGGTTGTGAC 
AAATCTCTGC 
GCTTGTAAAT 
GAAATTGGAA 
TCAAGATGGT 
CCACCATTAG 



AAGTTACTGG 
ACCAGACCAA 
GATCGGTGGT 
TCAAAGCCGG 
GTCAAATCCG 
TGACAATGTT 
GAGGTAACGT 
TCTTTCAATG 
TGGTTACTCT 
TCGACACTTT 
GAAAATCCAA 
CCCAACCAAA 
GTAGATTCGC 



TAAGACCTTG 
CCGACAAACC 
ATTGGTACTG 
TATGGTTGTT 
TTGAAATGCA 
GGTTTCAACG 
TTGTGGTGAC 
CCCAAGTCAT 
CCAGTCTTGG 
GGTTGAAAAG 
AATTCGTCAA 
CCAATGTGTG 
TGTCAGAGA 



PCT/CA00/01150 

TTAGAAGCTA 500 

ATTGAGATTG 550 

TGCCAGTCGG 600 

ACTTTCGCCC 650 

TCACGAACAA 700 

TTAAGAACGT 750 

TCCAAGAACG 8 00 

TGTTTTGAAC 850 

ATTGTCACAC 900 

ATTGACAGAA 950 

ATCCGGTGAT 1000 

TTGAAGCTTT 1050 
1089 



15 



20 



25 



30 



2) INFORMATION FOR SEQ ID NO: 409 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida, albicans 

(B) STRAIN: ATCC 56884 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409 



CTTGGACAAA 
CTTTGTGGAA 

35 CCAGGTCACA 
TGATTGTGCT 
GTATTTCTAA 
TTGGGTGTCA 
ATGGGACAAA 

40 TCAAGAAGGT 
GGTTGGAATG 
CAAGGGTTGG 
TGTTAGAAGC 
CCATTGAGAT 

45 TGTGCCAGTC 
TTACTTTCGC 
CATCACGAAC 
CGTTAAGAAC 
ACTCCAAGAA 

50 ATTGTTTTGA 
GGATTGTCAC 
AGATTGACAG 
AAATCCGGTG 
TGTTGAAGCT 

55 A 



TTGAAGGCTG 
ATTCGAAACT 
GAGATTTCAT 
ATTTTGATTA 
GGATGGTCAA 
AACAATTGAT 
AACAGATTTG 
TGGTTACAAC 
GTGACAACAT 
GAAAAGGAAA 
TATTGACGCT 
TGCCATTGCA 
GGTAGAGTTG 
CCCAGCTGGT 
AATTGGCTGA 
GTTTCCGTTA 
CGATCCACCA 
ACCATCCAGG 
ACTGCCCACA 
AAGAACTGGT 
ATGCTGCTAT 
TTCACTGACT 



AAAGAGAAAG 
CCAAAATACC 
CAAGAATATG 
TTGCTGGTGG 
ACCAGAGAAC 
TGTTGCTGTC 
AAGAAATCAT 
CCAAAGACTG 
GATTGAACCA 
CCAAATCCGG 
ATTGAACCAC 
AGATGTTTAC 
AAACTGGTAT 
GTTACCACTG 
AGGTGTTCCA 
AAGAAATTAG 
AAGGGTTGTG 
TCAAATCTCT 
TTGCTTGTAA 
AAGAAATTGG 
CGTCAAGATG 
ACCCACCATT 



AGGTATCACC 
ACGTTACCGT 
ATCACTGGTA 
TACTGGTGAA 
ACGCTTTGTT 
AACAAGATGG 
CAAGGAAACC 
TTCCATTCGT 
TCCACCAACT 
TAAAGTTACT 
CAACCAGACC 
AAGATCGGTG 
CATCAAAGCC 
AAGTCAAATC 
GGTGACAATG 
AAGAGGTAAC 
ACTCTTTCAA 
GCTGGTTACT 
ATTCGACACT 
AAGAAAATCC 
GTCCCAACCA 
AGGTAGATTC 



ATTGATATCG 50 

CATTGATGCT 100 

CTTCTCAAGC 150 

TTCGAAGCCG 2 00 

GGCTTACACT 2 50 

ACTCTGTCAA 3 00 

TCCAACTTCG 350 

TCCAATCTCT 400 

GTCCATGGTA 450 

GGTAAGACCT 500 

AACCGACAAA 550 

GTATTGGTAC 600 

GGTATGGTTG 650 

CGTTGAAATG 700 

TTGGTTTCAA 750 

GTTTGTGGTG 800 

TGCCCAAGTC 850 

CTCCAGTCTT 90 0 

TTGGTTGAAA 950 

AAAATTCGTC 1000 

AACCAATGTG 1050 

GCTGTCAGAG 1100 
1101 



60 



2 ) INFORMATION FOR SEQ ID NO: 410 
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( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 60193 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410 

CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 2 00 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 2 50 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 3 00 

ATGGGACAAA AACAGATTTG AAGAAATCAT CAAGGAAACC TCCAACTTCG 3 50 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 4 00 

GGTTGGAATG GTGACAACAT GATTGAACCA TCCACCAACT GTCCATGGTA 4 50 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAAGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGACGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTAT CATCAAAGCC GGTATGGTTG 650 

TTACTTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGCTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAC GTTTCCGTTA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTTTTGA ACCATCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTCTT 90 0 

GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

AAATCCGGTG ATGCTGCTAT CGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 1100 

AT H02 



2) INFORMATION FOR SEQ ID NO: 411 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 90028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 411 

CTTGGACAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 5 0 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAATATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 2 00 

247 
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GTATTTCTAA 
TTGGGTGTCA 
ATGGGACAAA 
TCAAGAAGGT 
GGTTGGAATG 
CAAGGGTTGG 
TGTTAGAAGC 
CCATTGAGAT 
TGTGCCAGTC 
TTACTTTCGC 
CATCACGAAC 
CGTTAAGAAC 
ACTCCAAGAA 
ATTGTTTTGA 
GGATTGTCAC 
AGATTGACAG 
AAATCCGGTG 
TGTTGAAGCT 
AT 
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GGATGGTCAA 
AACAATTGAT 
AACAGATTTG 
TGGTTACAAC 
GTGACAACAT 
GAAAAGGAAA 
TATTGACGCT 
TGCCATTGCA 
GGTAGAGTTG 
CCCAGCTGGT 
AATTGGCTGA 
GTTTCCGTTA 
CGATCCACCA 
ACCATCCAGG 
ACTGCCCACA 
AAGAACTGGT 
ATGCTGCTAT 
TTCACTGACT 



ACCAGAGAAC 
TGTTGCTGTC 
AAGAAATCAT 
CCAAAGACTG 
GATTGAACCA 
CCAAATCCGG 
ATTGAACCAC 
AGATGTTTAC 
AAACTGGTAT 
GTTACCACTG 
AGGTGTTCCA 
AAGAAATTAG 
AAGGGTTGTG 
TCAAATCTCT 
TTGCTTGTAA 
AAGAAATTGG 
CGTCAAGATG 
ACCCACCATT 



ACGCTTTGTT 
AACAAGATGG 
CAAGGAAACC 
TTCCATTCGT 
TCCACCAACT 
TAAAGTTACT 
CAACCAGACC 
AAGATCGGTG 
CATCAAAGCC 
AAGTCAAATC 
GGTGACAATG 
AAGAGGTAAC 
ACTCTTTCAA 
GCTGGTTACT 
ATTCGACACT 
AAGAAAATCC 
GTCCCAACCA 
AGGTAGATTC 



GGCTTACACT 
ACTCTGTCAA 
TCCAACTTCG 
TCCAATCTCT 
GTCCATGGTA 
GGTAAGACCT 
AACCGACAAA 
GTATTGGTAC 
GGTATGGTTG 
CGTTGAAATG 
TTGGTTTCAA 
GTTTGTGGTG 
TGCCCAAGTC 
CTCCAGTCTT 
TTGGTTGAAA 
AAAATTCGTC 
AACCAATGTG 
GCTGTCAGAG 



25(f 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1102 



25 



30 



35 



40 



45 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 412 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

. (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 

(B) STRAIN: NCPF 3108 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412 



CTTGGATAAA 
CTTTGTGGAA 
CCAGGTCACA 
TGATTGTGCT 
GTATTTCTAA 
TTGGGTGTCA 
ATGGGACAAA 
TCAAGAAGGT 
GGTTGGAATG 
CAAGGGTTGG 
TGTTAGAAGC 
CCATTGAGAT 
TGTGCCAGTC 
TCACTTTTGC 
CATCACGAAC 
CGTTAAGAAT 
ACTCCAAGAA 
ATTGTCTTGA 
GGATTGTCAC 
AGATTGACAG 
AAATCCGGTG 
TGTTGAAGCT 



TTGAAGGCTG 
ATTCGAAACT 
GAGATTTCAT 
ATTTTGATTA 
GGATGGTCAA 
AACAATTGAT 
AACAGATTTG 
TGGTTACAAC 
GTGACAACAT 
GAAAAGGAAA 
TATTGATGCT 
TGCCATTGCA 
GGTAGAGTTG 
CCCAGCTGGT 
AATTGGCTGA 
GTTTCTGTCA 
CGATCCACCA 
ACCATCCAGG 
ACTGCCCACA 
AAGAACTGGT 
ACGCTGCTAT 
TTCACTGACT 



AAAGAGAAAG 
CCAAAATACC 
CAAGAACATG 
TTGCTGGTGG 
ACCAGAGAAC 
TGTTGCTGTC 
AAGAAATCAT 
CCAAAGACTG 
GATTGAAGCT 
CCAAATCCGG 
ATTGAACCAC 
AGATGTTTAC 
AAACTGGTGT 
GTTACCACTG 
AGGTGTTCCA 
AAGAAATTAG 
AAGGGTTGTG 
TCAAATCTCT 
TTGCTTGTAA 
AAGAAATTGG 
YGTCAAGATG 
ACCCACCATT 
24 



AGGTATCACC 
ACGTTACCGT 
ATCACTGGTA 
TACTGGTGAA 
ACGCTTTGTT 
AACAAGATGG 
CAAGGAAACC 
TTCCATTCGT 
TCCACCAACT 
TAAGGTTACT 
CAACCAGACC 
AAGATCGGTG 
CATTAAAGCC 
AAGTCAAATC 
GGTGACAATG 
AAGAGGTAAC 
ACTCTTTCAA 
GCTGGTTACT 
ATTCGACACT 
AAGAAAATCC 
GTCCCAACCA 
AGGTAGATTC 
8 



ATTGATATCG 50 

CATTGATGCT 100 

CTTCTCAAGC 150 

TTCGAAGCCG 200 

GGCTTACACT 250 

ACTCTGTCAA 300 

TCTAACTTCG 350 

TCCAATCTCT 4 00 

GTCCATGGTA 4 50 

GGTAAGACCT 500 

AACCGACAAA 550 

GTATTGGTAC 600 

GGTATGGTTG 650 

CGTTGAAATG 700 

TTGGTTTCAA 750 

GTTTGTGGTG 800 

TGCCCAAGTC 850 

CTCCAGTCTT 900 

TTGGTTGAAA 950 

AAAATTCGTC 1000 

AACCAATGTG 1050 

GCTGTCAGAG 1100 
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5 2) INFORMATION FOR SEQ ID NO: 413 

{ i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida catenulata 

(B) STRAIN: ATCC 10565 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413 



GGACAAGCTT AAGGCTGAGC GTGAGAGAGG TATCACCATT GACATTGCCT 50 

TGTGGAAGTT CGAGACTCCC AAGTACCACG TCACTGTCAT TGACGCCCCC 100 

GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACCT CGCAGGCTGA 150 

CTGTGCTATC TTGATCATTG CTTCCGGTGT CGGTGAGTTC GAGGCTGGTA 2 00 

25 TCTCCAAGGA CGGTCAGACC CGTGAGCACG CCTTGTTGGC CTACACCTTG 2 50 

GGTGTCAAGC AGTTGATCGT TGCCATCAAC AAGATGGACT CCGTCAAGTG 3 00 

GGACAAGAAC AGATTCGAGG AGATTGTCAA GGAGACCACC AACTTCGTCA 3 50 

AGAAGGTTGG TTACAACCCC AAGGCTGTCC CCTTCGTCCC CATCTCTGGC 4 00 

TGGAACGGTG ACAACATGAT TGAGGCCTCC ACCAACTGCC CCTGGTACAA 4 50 

3 0 GGGCTGGGAG AAGGAGACCA AGGCCGGTAA GTCTACCGGT AAGACCTTGT 500 

TGGAGGCCAT TGACGCCATT GAGCCCCCTA CCAGACCCAC CGACAAGCCC 550 

TTGAGATTGC CCTTGCAGGA TGTCTACAAG ATCGGTGGTA TTGGTACGGT 600 

GCCCGTCGGC CGTGTCGAGA CCGGTGTCAT CAAGCCCGGT ATGGTCGTCA 650 

CCTTCGCCCC CGCTGGTGTC ACCACTGAAG TCAAGTCCGT CGAGATGCAC 700 

3 5 CACGAGCAGT TGTCCGAGGG TGTCCCCGGT GACAACGTTG GTTTCAACGT 750 

CAAGAACGTC TCTGTTAAGG AGATCAGACG TGGTAACGTC TGCGGTGACT 800 

CCAAGAACGA CCCCCCCATG GGTTGCTCTT CTTTCAACGC CCAGGTTATC 850 

GTGTTGAACC ACCCCGGTCA GATCTCTGCC GGTTACTCTC CCGTCTTGGA 900 

CTGCCACACC GCCCACATTG CTTGCAAGTT CGCCGAGTTG ATCGAGAAGA 950 

40 TTGACAGACG TACCGGTAAG TCCATGGAGG CTAACCCCAA GTTCGTCAAG 1000 

TCTGGTGACG CCGCCATCGT CAAGATGGAG CCCACTAAGC CCATGTGTGT 1050 

TGAGGCCTTC ACTGACTTCC CTCCTTTGGG TAGATTCGCC GTCAGAGA 1098 



45 



55 



2) INFORMATION FOR SEQ ID NO: 414 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1102 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 

(B) STRAIN: NCPF 3949 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414 
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CTTGGATAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATCACTGGTA CTTCTCAAGC 150 

5 TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 3 00 

ATGGGACAAA AACAGATTCG AAGAAATCAT CAAGGAAACC TCTAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 400 

10 GGTTGGAATG GTGACAACAT GATTGAAGCT TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAGGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGATGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 60 0 

TGTGCCAGTC GGTAGAGTTG AAACTGGTGT CATTAAAGCC GGTATGGTTG 650 

15 TCACTTTTGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGTTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAT GTTTCTGTCA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

ATTGTCTTGA ACCATCCAGG TCAAATCTCT GCTGGTTAYT CTCCAGTCTT 900 

20 GGATTGTCAC ACTGCCCACA TTGCTTGTAA ATTCGACACT TTGGTTGAAA 950 

AGATTGACAG AAGAACTGGT AAGAAATTGG AAGAAAATCC AAAATTCGTC 1000 

AAATCCGGTG ACGCTGCTAT YGTCAAGATG GTCCCAACCA AACCAATGTG 1050 

TGTTGAAGCT TTCACTGACT ACCCACCATT AGGTAGATTC GCTGTCAGAG 110 0 

AT 1102 

25 



2) INFORMATION FOR SEQ ID NO: 415 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 
40 (B) STRAIN: CBS 7987 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415 

CTTGGATAAA TTGAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

45 CTTTGTGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATCACTGGTA CTTCTCAAGC 150 

TGATTGTGCT ATTTTGATTA TTGCTGGTGG TACTGGTGAA TTCGAAGCCG 200 

GTATTTCTAA GGATGGTCAA ACCAGAGAAC ACGCTTTGTT GGCTTACACT 250 

TTGGGTGTCA AACAATTGAT TGTTGCTGTC AACAAGATGG ACTCTGTCAA 300 

50 ATGGGACAAA AACAGATTYG AAGAAATCAT CAAGGAAACC TCTAACTTCG 3 50 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATCTCT 400 

GGTTGGAATG GTGACAACAT GATTGAAGCT TCCACCAACT GTCCATGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAATCCGG TAAGGTTACT GGTAAGACCT 500 

TGTTAGAAGC TATTGATGCT ATTGAACCAC CAACCAGACC AACCGACAAA 550 

55 CCATTGAGAT TGCCATTGCA AGATGTTTAC AAGATCGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACTGGTGT CATTAAAGCC GGTATGGTTG 650 

TCACTTTTGC CCCAGCTGGT GTTACCACTG AAGTCAAATC CGTTGAAATG 700 

CATCACGAAC AATTGGTTGA AGGTGTTCCA GGTGACAATG TTGGTTTCAA 750 

CGTTAAGAAT GTTTCTGTCA AAGAAATTAG AAGAGGTAAC GTTTGTGGTG 800 

60 ACTCCAAGAA CGATCCACCA AAGGGTTGTG ACTCTTTCAA TGCCCAAGTC 850 

250 
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ATTGTCTTGA ACCATCCAGG 
GGATTGTCAC ACTGCCCACA 
AGATTGACAG AAGAACTGGT 
AAATCCGGTG ACGCTGCTAT 
5 TGTTGAAGCT TTCACTGACT 
AT 



TCAAATCTCT GCTGGTTAYT 
TTGCTTGTAA ATTCGACACT 
AAGAAATTGG AAGAAAATCC 
YGTCAAGATG GTCCCAACCA 
ACCCACCATT AGGTAGATTC 



PCT/CA00/01150 

CTCCAGTCTT 900 
TTGGTTGAAA 950 
AAAATTCGTC 10 00 

AACCAATGTG 1050 
GCTGTCAGAG 1100 
1102 



10 2) INFORMATION FOR SEQ ID NO: 416 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida famata 

(B) STRAIN: ATCC 628 94 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 416 



AATTGAAGGC TGAAAGAGAA AGAGGTATCA CCATTGATAT CGCTTTATGG 50 

AAATTCGAAA CTCCAAAATA CCACGTTACC GTTATTGATG CTCCAGGTCA 100 

CAGAGATTTC ATCAAGAACA TGATTACTGG TACTTCTCAA GCTGATTGTG 150 

CTATTTTRAT TATTGCTGGT GGTGTCGGTG AATTCGAAGC CGGTATCTCT 2 00 

3 0 AAGGATGGTC AAACCAGAGA ACACGCTTTA TTGGCTTACA CCTTAGGTGT 2 50 

TAGACAATTG ATTGTTGCCG TCAACAAGAT GGACTCTGTT AAATGGGACA 3 00 

AGGCTAGATT CGAAGAAATC ATCAAGGAAA CCTCTAACTT CGTCAAGAAG 350 

GTTGGTTACA ACCCTAAGAC TGTTCCTTTC GTYCCAATTT CTGGATGGAA 4 00 

CGGTGACAAC ATGATTGAAG CCTCCACCAA CTGTCCATGG TACAAGGGTT 450 

3 5 GGGAAAAGGA AACCAAGGCT GGTAAATCTA CTGGTAAGAC TTTGTTAGAA 500 

GCCATTGATG CCATTGAACC ACCAACCAGA CCAACCGAAA AGCCATTGAG 550 

ATTACCATTA CAAGATGTCT ACAAGATCGG TGGTATTGGT ACTGTGCCAG 600 

TCGGTAGAGT TGAAACCGGT GTTATCAAGG GTGGTATGGT TGTTACCTTT 650 

GCCCCAGCCG GTGTCACTAC CGAAGTCAAA TCCGTTGAAA TGCACCACGA 70 0 

4 0 ACAATTAGCT GAAGGTGTTC CAGGTGACAA TGTTGGTTTC AACGTCAAGA 750 

ACGTTTCCGT TAAGGAAATC AGAAGAGGTA ACGTTTGTGG TGACTCCAAG 800 

AACGACCCAC CAAAGGGTGC TGAATCTTTC ACCGCTCAAG TTATTGTCTT 85 0 

GAACCACCCA GGTCARATCT CTGCTGGTTA CTCTCCAGTC TTAGATTGTC 900 

ACACCGCCCA CATTGCTTGT AAATTCGATG CTTTACTCGA AAAGATTGAC 950 

45 AGAAGATCCG GTAAGAAATT AGAAGACGAA CCAAAATTCG TCAAGTCCGG 10 00 

TGATGCTGCT ATCGTCAAGA TGGTCCCAAC CAAACCAATG TGTGTTGAAG 1050 

CTTTCACTGA ATACCCACCA TTAGGTAGAT TCGCTGTTAG AGAT 10 94 



50 



60 



2) INFORMATION FOR SEQ ID NO: 417 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1101 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

251 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida grlaJbrata 

(B) STRAIN: ATCC 66032 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417 

TTTGGACAAG TTGAAGGCTG AAAGAGAAAG AGGTATCACT ATCGATATCG 50 

CTTTGTGGAA GTTCGAAACT CCAAAGTACC ACGTYACCGT TATCGATGCY 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATTACTGGTA CTTCTCAAGC 150 

10 TGACTGTGCT ATCTTGATTA TTGCTGGTGG TGTCGGTGAA TTCGAAGCYG 200 

GTATCTCCAA GGATGGTCAA ACCAGAGAAC ACGCTCTATT GGCTTTCACC 2 50 

CTAGGTGTTA GACAATTGAT TGTYGCTGTC AACAAGATGG ACTCTGTCAA 300 

GTGGGATGAA TCCAGATTCG CTGAAATCGT TAAGGAAACC TCCAACTTCA 350 

TCAAGAAGGT CGGTTACAAC CCAAAGACTG TTCCATTCGT CCCAATCTCT 4 00 

15 GGTTGGAACG GTGACAACAT GATTGAAGCC ACCACCAACG CTTCCTGGTA 450 

CAAGGGTTGG GAAAAGGAAA CCAAGGCTGG TGTCGTCAAG GGTAAGACCT 500 

TGTTGGAAGC CATTGACGCT ATCGAACCAC CAACCAGACC AACTGACAAG 550 

CCATTGAGAT TGCCATTGCA AGATGTCTAC AAGATCGGTG GTATCGGTAC 600 

GGTGCCAGTC GGTAGAGTCG AAACCGGTGT CATCAAGCCA GGTATGGTTG 650 

2 0 TTACCTTCGC CCCAGCTGGT GTTACCACTG AAGTCAAGTC CGTTGAAATG 70 0 

CACCACGAAC AATTGACTGA AGGTTTGCCA GGTGACAACG TTGGTTTCAA 750 

CGTTAAGAAC GTTTCCGTTA AGGAAATCAG AAGAGGTAAT GTCTGTGGTG 8 00 

ACTCCAAGAA CGACCCACCA AAGGCTGCTG CTTCTTTCAA CGCTACCGTC 850 

ATTGTCTTGA ACCACCCAGG TCAAATCTCT GCTGGTTACT CTCCAGTTTT 900 

25 GGACTGTCAC ACCGCCCACA TTGCTTGTAA GTTCGAAGAA TTGTTGGAAA 950 

AGAACGACAG AAGATCCGGT AAGAAGTTGG AAGACTCTCC AAAGTTCTTG 1000 

AAGTCCGGTG ACGCTGCTTT GGTTAAGTTC GTTCCATCCA AGCCAATGTG 1050 

TGTCGAAGCT TTCTCCGACT ACCCACCATT GGGTAGATTC GCTGTCAGAG 1100 

A 1101 

30 



2) INFORMATION FOR SEQ ID NO: 418 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1080 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Doubl e 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida guilliermondii 
45 (B) STRAIN: ATCC 6260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 418 

AGAGAAAGAG GTATCACCAT TGACATTGCT TTGTGGAAAT TCGAGACTCC 50 

50 AAAGTACCAC GTTACYGTCA TTGATGCCCC AGGTCACAGA GATTTCATCA 100 

AGAACATGAT CACTGGTACT TCTCAAGCTG ACTGTGCTAT TTTGATTATT 150 

GCTGGTGGTA CCGGTGAATT CGAAGCTGGT ATCTCTAAGG ATGGTCAAAC 2 00 

CAGAGAGCAC GCTTTGTTGG CTTACACCTT GGGTGTTAGA CAATTGATTG 250 

TTGCTGTCAA CAAGATGGAC TCCGTCAART GGGACAAGAA CAGATTYGAG 3 00 

55 GAAATCATCA AGGAAACCTC TAACTTCGTC AAGAAGGTTG GTTACAACCC 3 50 

TAAGACTGTG CCATTCGTTC CTATCTCTGG ATGGAAYGGT GACAACATGA 4 00 

TTGAGGCTTC TACCAACTGT CCTTGGTACA AGGGATGGGA GAAGGAGACC 450 

AAGGCTGGTA AGTCCACCGG TAAGACTTTG TTGGAGGCCA TTGACGCCAT 500 

TGAGCCACCT CAAAGAC C AA CCGACAAGCC ATTGAGATTG CCATTGCAAG 550 

60 ATGTYTACAA GATTGGTGGT ATTGGAACGG TGCCAGTCGG TAGAGTTGAA 600 

252 
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ACCGGTATCA TYAAGGCCGG TATGGTTGTT 
YACCACTGAA GTCAAGTCCG TGGAAATGCA 
GTGTTCCAGG TGACAATGTT GGTTTCAACG 
GAAATTAGAA GAGGTAACGT TTGTGGTGAC 
GGGTTGTGAC TCTTTCACCG CTCAAGTTAT 
AAATCTCTGC TGGTTACTCT CCAGTTTTGG 
GCTTGTAAAT TCGACACCTT GTTGGAGAAG 
GAAGATGGAG GACAACCCCA AGTTTGTCAA 
TCAAGATGGT GCCATCCAAG CCAATGTGTG 
CCACCATTGG GAAGATTCGC CGTCAGAGAC 



ACCTTTGCCC CAGCTGGTGT 650 

CCACGAACAA TTGGTTGAAG 700 

TTAAGAACGT TTCCGTTAAG 750 

TCCAAGAACG ACCCACCAAA 800 

TGTGTTGAAC CACCCTGGTC 850 

ACTGTCACAC CGCCCACATT 900 

ATTGACAGAA GAACCGGTAA 950 

GTCCGGTGAC GCTTCTATCG 100 0 

TTGAGGCTTT CACCGACTAC 1050 

1080 



2) INFORMATION FOR SEQ ID NO: 419 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida, haemulonii 

(B) STRAIN: ATCC 22991 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 419 



TCTGTCAAGT GGGACAAGGC CAGATACGAG 

TAACTTCGTC AAGAAGGTTG GTTACAACCC 

CAATCTCTGG TTGGAACGGT GACAACATGA 

GACTGGTACA AGGGTTGGGA GAAGGAGACC 

TAAGACCTTG TTGGAGGCCA TTGACGCCAT 

CCGACAAGCC ATTGAGATTG CCATTGCAGG 

ATCGGAACTG TGCCAGTCGG CAGAGTTGAG 

TATGGTTGTC ACCTTCGCCC CAGCTGGTGT 

TCGAGATGCA CCACGAGCAG TTGCCAGAGG 

GGTTTCAACG TCAAGAACGT TTCCGTTAAG 

CTGTGGTGAC TCCAAGCAGG ACCCACCAAA 

CTCAGGTTAT TGTGTTGAAC CACCCAGGTC 

CCAGTTTTGG ACTGTCACAC TGCCCACATT 

GGTTGAGAAG ATCGACAGAA GAACCGGTAA 

AGTTCATCAA GTCCGGTGAC GCTGCTATCG 
C 



GAAATCGTCA AGGAGACCTC 50 

TAAGACTGTT CCATTCGTCC 100 

TTGAGGCTTC TACCAACTGT 150 

AAGTCTGGTA AGTCCACCGG 200 

TGAGCCACCA ACCAGACCAA 2 50 

ATGTCTACAA GATTGGTGGT 3 00 

ACCGGTGTTA TCAAGGCCGG 3 50 

CACCACTGAA GTCAAGTCTG 4 00 

GTGTCCCAGG TGACAACGTT 450 

GAAATCAGAA GAGGTAACGT 500 

GGGCTGTGAC TCTTTCACCG 550 

AGATCTCTTC TGGTTACTCT 600 

GCTTGTAAGT TCGACACCTT 650 

GAAGTTGGAA GATGAGCCAA 700 

TCAAGATGGT CCCAACCAAG 750 

751 



2) INFORMATION FOR SEQ ID NO: 420 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida inconspicua 

253 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(B) STRAIN: ATCC 16783 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420 

5 TCTTGACAAG TTAAAGGCTG AAAGAGAAAG AGGTATCACT ATTGATATTG 50 

CTTTATGGAA ATTCGAAACT CCAAAGTATC ACGTTACCGT CATTGATGCT 100 

CCAGGTCACA GAGATTTCAT TAAGAACATG ATTACTGGTA CTTCTCAAGC 150 

AGATTGTGCT ATTTTGATTA TTGCTGGTGG TGTCGGTGAA TTCGAAGCTG 2 00 

GTATTTCCAA GGATGGTCAA ACTAGAGAAC ACGCTTTATT AGCATTCACC 2 50 

10 TTAGGTGTTA AGCAATTGAT TGTTGCTATC AACAAGATGG ATTCTGTTAA 3 00 

GTGGGATGAA AAGAGATTTG AAGAAATTGT CAAGGAAACC CAAAACTTCA 3 50 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTC CATTCGT TCCAATTTCT 4 00 

GGTTGGAATG GTGACAACAT GATTGAACCA TCTTCTAACT GTCCATGGTA 450 

CAAGGGTTGG ACTAAGGAAA CCAAGGCAGG TGTTGTCAAG GGTAAGACCT 500 

15 TATTAGAAGC TATTGATGCT ATTGAACCAC CTGTCAGACC AACTGATAAG 550 

CCATTAAGAT TACCATTACA AGATGTTTAC AAGATTGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACCGGTAT TATTAAGCCA GGTATGGTTG 650 

TTGTTTTCGC ACCATCTGGT GTTACCACTG AAGTCAAGTC CGTTGAAATG 700 

CACCATGAAC AATTAGAAGA AGGTGTCCCA GGTGACAATG TTGGTTTCAA 750 

2 0 CGTCAAGAAC GTCTCTGTTA AGGATATCAA GAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA TGACCCACCA CAAGGTTGTG CTTCCTTCAA TGCTCAAGTC 850 

ATTGTCTTGA ACCACCCTGG TCAAATTTCT GCTGGTTACT CTCCAGTTTT 900 

AGATTGTCAC ACTGCCCACA TTGCATGTAA ATTCGATGAA TTAATTGAAA 950 

AGATTGACAG AAGAACTGGT AAGTCCGTTG AAGACCATCC AAAGTCTGTT 1000 

25 AAGTCTGGTG ATGCAGCTAT CGTTAAGATG GTTCCAACCA AGCCAATGTG 1050 

TGTTGAAGCT TTCACTGAAT ACCCACCATT AGGTAGATTC GCAGTCAGAG 1100 

AT 1102 



30 



40 



2) INFORMATION FOR SEQ ID NO: 421 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1099 bases 
35 (B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Doubl e 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida kefyr 

(B) STRAIN: ATCC 28838 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421 

TGGACAAGTT AAAGGCTGAA AGAGAAAGAG GTATCACCAT CGATATCGCT 50 

TTGTGGAAGT TCGAAACTCC AAAGTACCAA GTTACCGTTA TCGATGCTCC 100 

AGGTCACAGA GATTTCATCA AGAACATGAT TACTGGTACT TCTCAAGCTG 150 

50 ACTGTGCTAT CTTGATTATT GCTGGTGGTG TCGGTGAATT CGAAGCCGGT 2 00 

ATCTCCAAGG ATGGTCAAAC CAGAGAACAC GCTTTGTTGG CTTACACCTT 2 50 

GGGTGTTAGA CAATTGATTG TTGCTATCAA CAAGATGGAC TCTGTTAAGT 3 00 

GGGATGAATC TCGTTACCAA GAAATTGTTA AGGAAACCTC CAACTTCATC 350 

AAGAAGGTCG GTTACAACCC AAAGAATGTT CCATTCGTCC CAATCTCTGG 400 

55 TTGGAACGGT GACAACATGA TTGAAGCCAC CACCAACGCT CCATGGTACA 450 

AGGGTTGGGA AAAGGAAACC AAGGCTGGTA CCGTCAAGGG TAAGACCTTG 5 00 

TTGGAAGCTA TTGACGCTAT CGAACCACCA ACCAGACCAA CTGACAAGCC 550 

ATTGAGATTG CCATTGCAAG ATGTCTACAA GATCGGTGGT ATTGGTACTG 600 

TGCCAGTCGG TAGAGTCGAA ACCGGTGTCA TCAAGCCAGG TATGGTTGTT 650 

60 ACCTTCGCCC CAGCCGGTGT CACTACCGAA GTTAAGTCCG TCGAAATGCA 700 

254 
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CCACGAACAA TTGGAAGAAG GTCTACCAGG 
TCAAGAACGT TTCCGTTAAG GAAATCAGAA 
TCCAAGAACG ATCCACCAAA GGCTGCTGCT 
CGTCTTGAAC CACCCAGGTC AAATCTCTGC 
ATTGTCACAC TGCTCACATT GCTTGTAAGT 
AACGACAGAA GATCCGGTAA GAAGTTGGAA 
GTCTGGTGAC GCTGCTTTGG TTAAGTTCGT 
TTGAAGCATT CTCTGACTAC CCACCATTGG 



PCT/CA00/01150 

TGACAACGTC GGTTTCAACG x 75 0 

GAGGTAACGT CTGTGGTGAC 800 

TCTTTCAACG CCACTGTTAT 850 

TGGTTACTCT CCAGTTTTGG 900 

TCGACGAATT GTTGGAAAAG 950 

GACTCTCCAA AGTTCTTGAA 1000 

TCCATCTAAG CCAATGTGTG 1050 

GTAGATTCGC TGTCAGAGA 1099 



2) INFORMATION FOR SEQ ID NO: 422 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida krusei 

(B) STRAIN: ATCC 34135 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 422 

AAGTTAAAGG CAGAAAGAGA AAGAGGTATC ACTATTGATA TTGCTTTATG 50 

GAAGTTYGAA ACTCCAAART ACCACGTTAC CGTTATTGAT GCTCCAGGTC 100 

ACAGAGATTT CATCAAGAAC ATGATTACCG GTACTTCTCA AGCTGATTGT 150 

GCTATTTTGA TTATTGCTGG TGGTGTCGGT GAATTCGAAG CTGGTATCTC 200 

CAAGGATGGT CAAACTAGAG AACACGCTCT ATTGGCTTTC ACCTTAGGTG 250 

TTAGACAATT GATTGTTGCT ATCAACAAGA TGGATTCCGT TAARTGGGAT 3 00 

GAAAACAGAT TTGAAGAAAT TGTCAAGGAA ACCCAAAACT TCATCAAGAA 3 50 

GGTTGGTTAC AACCCAAAGA CTGTTCCATT CGTTCCAATY TCTGGTTGGA 4 00 

ATGGTGACAA CATGATTGAA GCATCCACCA ACTGTCCATG GTACAAGGGT 450 

TGGACTAAGG AAACCAAGGC AGGTGTTGTT AAGGGTAAGA CCTTATTAGA 500 

AGCAATCGAT GCTATTGAAC CACCTGTCAG ACCAACCGAA AAGCCATTAA 550 

GATTACCATT ACAAGATGTT TACAAGATTG GTGGTATTGG TACTGTGCCA 60 0 

GTCGGTAGAG TCGAAACCGG TGTCATTAAG CCAGGTATGG TTGTCACTTT 65 0 

TGCTCCAGCA GGTGTCACCA CCGAAGTCAA GTC CGTTGAA ATGCACCATG 70 0 

AACAATTAGA ACAAGGTGTT CCAGGTGATA ACGTTGGTTT CAACGTTAAG 75 0 

AACGTTTCTG TCAAGGATAT CAAGAGAGGT AACGTTTGTG GTGACTCCAA 8 00 

GAACGACCCA CCAATGGGTG CAGCTTCCTT CAATGCTCAA GTCATTGTCT 850 

TGAACCACCC TGGTCAAATT TCCGCTGGTT ACTCTCCAGT CTTGGATTGT 900 

CACACTGCCC ACATTGCATG TAAGTTCGAC GAATTAATCG AAAAGATTGA 950 

CAGAAGAACT GGTAAGTCTG TTGAAGACCA TCCAAAGTCY GTCAAGTCTG 1000 

GTGATGCAGC TATCGTCAAG ATGGTCCCAA CCAAGCCAAT GTGTGTTGAA 1050 

GCTTTCACTG AATAYCCACC ATTAGGTAGA TTCGCAGTCA GAGAT 1095 



2) INFORMATION FOR SEQ ID NO: 423 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1104 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lambica 
5 (B) STRAIN: ATCC 24750 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423 

CTTGGACAAG CTTAAGGCTG AAAGAGAAAG AGGTATCACC ATTGATATCG 50 

10 CTTTATGGAA GTTCGAAACT CCAAAGTACC ACGTTACCGT CATTGACGCT 100 

CCAGGTCACA GAGATTTCAT CAAGAACATG ATTACTGGTA CCTCTCAAGC 150 

AGATTGTGCT ATTTTRATYA TTGCTGGTGG TGTCGGTGAA TTCGAAGCTG 2 00 

GTATCTCTAA GGATGGTCAA ACCAGAGAAC ACGCTCTTCT TGCATTCACT 250 

CTTGGTGTTA GACAATTGAT TGTTGCTATC AACAAGATGG ACTCTGTCAA 3 00 

15 GTGGGACGAA TCCAGATTCG ATGAAATTTG TAAGGAAACC GCWAACTTCA 3 50 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT CCCAATCTCT 4 00 

GGTTGGAACG GTGACAACAT GATTGAACCA TCTGCTAACT GTCCATGGTA 4 50 

CAAGGGATGG ACTAAGGAAA CCAAGGCTTC CGGTGTCGTC AAGGGTAAGA 5 00 

CCCTTCTTGA AGCAATTGAT GCTATTGAGC CACCTGTCAG ACCAACTGAC 550 

2 0 AAGGCTTTGA GATTGCCATT RCAAGATGTC TACAAGATTG GTGGTATTGG 600 

TACTGTGCCA GTCGGTAGAG TTGAAACCGG TATCATCAAG CCAGGTATGA 650 

TTGTCGTTTT CGCTCCAACC GGTGTTACTA CTGAAGTTAA GTCCGTTGAA 700 

ATGCACCATG AACAATTAGA AGAAGGTGTC CCAGGTGACA ATGTTGGTTT 750 

CAACGTCAAG AACGTCTCTG TTAAGGATAT TAAGAGAGGT AACGTCTGTG 8 00 

2 5 GTGACTCCAA GAACGACCCA CCAATGGGTT GTGCTTCCTT CAATGCTCAA 850 

GTCATTGTTC TTAACCACCC AGGTCAAATT TCTGCTGGTT ACTCACCAGT 900 

TCTTGACTGT CACACTGCCC ACATTGCATG TAAGTTCGAY GAATTACTCG 950 

AAAAGATTGA CAGAAGAACC GGTAAGGCTA CTGAAGACCA TCCAAAGTCT 10 00 

GTCAAGTCTG GTGATGCAGC TATCGTCAAG ATGGTTCCAA CCAAGCCAAT 1050 

3 0 GTGTGTYGAA GCTTTCACTG ACTACCCACC ATTAGGTAGA TTCGCTGTYA 1100 

GAGA 1104 



35 2) INFORMATION FOR SEQ ID NO: 424 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 98 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lusitaniae 

(B) STRAIN: ATCC 66035 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424 



GGACAAGTTG AAGGCTGAGA GAGAAAGAGG TATCACCATC GATATCGCTT 50 

TGTGGAAGTT CGAGACTCCA AAGTACCACG TTACCGTCAT TGACGCTCCA 10 0 

GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACTT CCCAAGCTGA 150 

CTGTGCTATC TTGATTATCG CTGGTGGTGT CGGTGAGTTC GAAGCCGGTA 2 00 

55 TCTCTAAGGA CGGTCAAACC AGAGAGCACG CTTTGTTGGC TTACACCTTG 250 

GGTGTCAAGC AGTTGATTGT TGCTGTCAAC AAGATGGACT CCGTCAAGTG 3 00 

GGACCAATCT AGATTCGAGG AAATCATCAA GGAAACCTCT AACTTCGTCA 350 

AGAAGGTTGG TTACAACCCT AAGACTGTTC CATTCGTCCC AATCTCTGGT 4 00 

TGGAACGGTG ACAACATGAT TGAGCCATCY ACCAACTGCC CATGGTACAA 450 

60 GGGTTGGGAG AAGGAGACCA AGTCYGGTAA GTCCACCGGT AAGACCTTGT 500 
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TGGAGGCCAT TGACGCCATT GAGCCACCTT 
TTGAGATTGC CATTGCAAGA TGTCTACAAG 
GCCAGTCGGT AGAGTTGAGA CCGGTGTCAT 
CCTTTGCTCC AGCTGGTGTC ACCACTGAAG 
5 CACGAACAAT TGGCTGAGGG TGTCCCAGGT 
CAAGAACGTT TCCGTCAAGG AAATCAGAAG 
CCAAGAACGA CCCACCAAAG GCTGCTGCTT 
GTCTTGAACC ACCCAGGTCA AATCTCCTCY 
CTGTCACACT GCYCACATTG CTTGTAAGTT 
10 TCGACAGAAG AACYGGTAAG AAGTTGGAAG 
TCYGGTGACG CTGCTATCGT CAAGATGGTC 
YGAAGCTTTC ACCGACTACC CACCATTGGG 



PCT/CA00/01150 

CGAGACCAAC CGACAAGCTCA' 55 9 

ATYGGTGGTA TTGGTACTGT 600 

CAAGGCCGGT ATGGTTGTCA 650 

TCAAGTCCGT GGAAATGCAC 700 

GACAACGTTG GTTTCAACGT 750 

AGGTAACGTC TGTGGTGACT 800 

CYTTCACTGC TCAAGTYATY 850 

GGTTACTCTC CAGTYTTGGA 900 

CGACACCTTG ATTGAGAAGA 950 

AAGAGCCAAA GTTCATCAAG 1000 

CCAACCAAGC CAATGTGYGT 1050 

TAGATTCGCT GTCAGAGA 1098 



15 



2) INFORMATION FOR SEQ ID NO: 425 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1101 bases 
2 0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegensis 

(B) STRAIN: ATCC 22977 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425 

TCTTGACAAG TTAAAGGCTG AAAGAGAAAG AGGTATCACT ATTGATATTG 50 

CTTTATGGAA ATTCGAAACT CCAAAATACC ACGTTACCGT TATTGATGCT 100 

CCAGGTCACA GAGATTTCAT TAAGAACATG ATTACTGGTA CTTCCCAAGC 150 

35 TGATTGTGCT ATCTTAATTA TTGCTGGTGG TGTCGGTGAA TTCGAAGCTG 2 00 

GTATCTCCAA GGATGGTCAA ACCAGAGAAC ACGCTTTATT AGCATTCACC 2 50 

TTAGGTGTTA AGCAATTAAT TGTTGCTATC AACAAGATGG ACTCTGTTAA 3 00 

GTGGGATGAA AAGAGATTTG AAGAAATTGT CAAGGAAACC CAAAACTTCA 3 50 

TCAAGAAGGT TGGTTACAAC CCAAAGACTG TTCCATTCGT TCCAATTTCT 4 00 

4 0 GGTTGGAATG GTGACAACAT GATTGAACCA TCTACTAACT GTCCATGGTA 450 

CAAGGGTTGG ACTAAGGAAA CCAAGGCAGG TGTTGTTAAG GGTAAGACCT 500 

TATTAGAAGC TATTGATGCT ATTGAACCAC CTGTCAGACC AACTGACAAG 550 

CCATTAAGAT TACCATTACA AGATGTTTAC AAGATTGGTG GTATTGGTAC 600 

TGTGCCAGTC GGTAGAGTTG AAACCGGTGT TATTAAGCCA GGTATGGTTG 650 

4 5 TTGTTTTCGC ACCATCTGGT GTTACCACTG AAGTCAAGTC CGTTGAAATG 700 

CACCATGAAC AATTAGAACA AGGTGTCCCA GGTGACAATG TTGGTTTCAA 750 

CGTCAAGAAC GTCTCTGTTA AGGATATTAA GAGAGGTAAC GTTTGTGGTG 800 

ACTCCAAGAA CGACCCACCA CAAGGTTGTG CTTCCTTCAA TGCTCAAGTC 850 

ATTGTCTTGA ACCACCCTGG TCAAATTTCT GCAGGTTACT CTCCAGTTTT 900 

50 AGATTGTCAC ACTGCCCACA TTGCATGTAA GTTCGATGAA TTAATCGAAA 950 

AGATTGACAG AAGAACTGGT AAGTCCGTTG AAGACCATCC AAAGTCTGTT 1000 

AAGTCTGGTG ATGCAGCTAT CGTTAAGATG GTTCCAACCA AGCCAATGTG 1050 

TGTTGAAGCT TTCACTGAAT ACCCACCATT AGGTAGATTC GCAGTCAGAG 1100 

A 1101 

55 

2) INFORMATION FOR SEQ ID NO: 426 
60 (i) SEQUENCE CHARACTERISTICS: 

257 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(A) LENGTH: 1095 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida parapsilosis 
10 (B) STRAIN: ATCC 90018 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 426 

CAAATTGAAG GCTGAAAGAG AAAGAGGTAT CACCATTGAT ATCGCTTTGT 50 

15 GGAAATTCGA AACTCCAAAA TACCATGTTA CTGTTATTGA TGCTCCAGGT 100 

CACAGAGATT TCATCAAGAA TATGATTACT GGTACTTCTC AAGCTGATTG 150 

TGCTATTTTG ATTATTGCTG GTGGTACTGG TGAATTCGAA GCTGGTATCT 2 00 

CTAAGGATGG TCAAACCAGA GAACACGCTT TGTTGGCTTA CACCTTGGGT 2 50 

GTTAAGCAAT TGATTGTTGC CATCAACAAG ATGGAC T C AG TCAAATGGGA 3 00 

20 CAAGAACAGA TACGAAGAAA TTGTCAAGGA AACTTCCAAC TTCGTCAAGA 3 50 

AGGTTGGTTA CAACCCTAAA GCTGTCCCAT TCGTCCCAAT CTCTGGTTGG 4 00 

AACGGTGACA ATATGATTGA ACCATCAACC AACTGTCCAT GGTACAAGGG 4 50 

TTGGGAAAAG GAAACTAAAG CTGGTAAGGT TACCGGTAAG ACCTTGTTGG 500 

AAGCTATCGA TGCTATCGAA CCACCAACCA GACCAACTGA CAAGCCATTG 550 

2 5 AGATTGCCAT TGCAAGATGT CTACAAGATT GGTGGTATTG GAACTGTGCC 60 0 

AGTTGGTAGA GTTGAAACCG GTATCATCAA GGCTGGTATG GTTGTTACTT 650 

TTGCCCCAGC TGGTGTTACC ACTGAAGTCA AGTCCGTTGA AATGCACCAC 700 

GAACAATTGA CTGAAGGTGT CCCAGGTGAC AATGTTGGTT TCAACGTCAA 750 

GAACGTTTCA GTTAAGGAAA TCAGAAGAGG TAACGTTTGT GGTGACTCCA 800 

30 AGAACGATCC ACCAAAGGGA TGTGAYTCCT TCAATGCTCA AGTTATTGTC 850 

TTGAACCACC CAGGTCAAAT CTCTGCTGGT TACTCACCAG TCTTGGATTG 900 

TCACACTGCC CACATTGCTT GTAAATTCGA CACTTTGATT GAAAAGATTG 950 

ACAGAAGAAC CGGTAAGAAA TTGGAAGATG AACCAAAATT CATCAAGTCC 1000 

GGTGATGCTG CYATCGTCAA GATGGTCCCA ACCAAGCCAA TGTGTGTTGA 1050 

35 AGCTTTCACT GACTACCCAC CATTGGGAAG ATTCGCTGTT AGAGA 10 95 



2) INFORMATION FOR SEQ ID NO: 427 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Candida rugosa 

(B) STRAIN: ATCC 96275 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427 

55 CTCCGTCAAG TGGTCTCAGT CTCGTTTCGA GGAGATCGTC AAGGAGGTTT 50 

CCAACTTCAT CAAGAAGGTT GGTTACAAGC CCGATGAGGT TCCTTTCGTC 100 

CCCATCTCTG GCTGGAACGG CGACAACATG CTTGAGCCCT CCACCAACTG 150 

CCCCTGGTAC AAGGGATGGA CCAAGAAGAC CAAGAAGGGT GAGGTCAAGG 200 

GTAAGACTCT TCTCGAGGCC ATTGACGCCA TCGAGCCCCC CTCCCGTCCT 2 50 

60 ACCGACAAGC CCCTCCGCTT GCCTCTTCAG GATGTCTACA AGATCGGCGG 3 00 

258 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
TATCGGTACG GTACCTGTCG GCCGTGTCGA 
GCATGGTCGT CACTTTCGCC CCCGCTGGTG 
GTCGAGATGC ACCACGAGCA GATCCCCGAG 
CGGTTTCAAC GTCAAGAACG TTACCGTCAA 
5 TCTGCGGTGA CTCCAAGAAC GACCCCCCCA 
GCCCAGGTCA TCGTTTTCAA CCACCCCGGT 
CCCCGTTTTG GACTGCCACA CCGCCCACAT 
TCCAGTCCAA GATGGACCGT CGTACTGGTA 
AAGTTCATCA AGGCTGGTGA CTCCGCTATC 
10 GC 



PCT/CA00/01150 

GACCGGTATC ATCAAGCCCG 3 50 

TCACCACTGA AGTGAAGTCC 4 00 

GGTCTCCCCG GTGACAACGT 4 50 

GGATATCCGC CGTGGTAACG 500 

AGGGCTGCTC TTCCTTCACT 550 

CAGATCTCCA ACGGTTACTC 600 

TGCCTGCCGC TTCGACGAGA 650 

AGACCCTTGA GGAGAACCCC 7 00 

GTCAAGATGG TTCCCTCGAA 750 

752 



2) INFORMATION FOR SEQ ID NO: 428 



15 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Candida, sphaerica 

(B) STRAIN: ATCC 2504 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428 

3 0 AGTTAAAGGC TGAAAGAGAA AGAGGTATCA CCATCGATAT CGCTTTGTGG 50 

AAGTTCGAAA CTCCAAAGTA CCAAGTTACC GTTATCGATG CTCCAGGTCA 100 

CAGAGATTTC ATCAAGAACA TGATTACTGG TACTTCTCAA GCTGACTGTG 150 

CTATCTTGAT TATTGCTGGT GGTGTCGGTG AATTCGAAGC CGGTATCTCC 2 00 

AAGGATGGTC AAACCAGAGA ACACGCTTTG TTGGCTTTCA CCTTGGGTGT 250 

3 5 TAGACAATTG ATTGTTGCTG TTAACAAGAT GGATTCCGTT AAGTGGGATG 300 

AATCTCGTTT CCAAGAAATT GTCAAGGAAA CCTCTAACTT CATCAAGAAG 350 

GTCGGTTACA ACCCAAAGAC TGTTCCATTC GTCCCAATCT CTGGTTGGAA 4 00 

CGGTGACAAC ATGATTGAAG CCACCACCAA TGCTTCATGG TACAAGGGTT 450 

GGGAAAAGGA AACCAAGTCC GGTGTCGTCA AGGGTAAGAC CTTGTTGGAA 500 

4 0 GCTATTGACG CTATCGAACC ACCATCCAGA CCAACTGACA AGCCATTGAG 550 

ATTGCCATTG CAAGATGTCT ACAAGATTGG TGGTATCGGA ACTGTGCCAG 600 

TCGGTAGAGT CGAAACCGGT GTTATCAAGC CAGGTATGAT TGTTACCTTT 650 

GCCCCAGCCG GTGTTACTAC TGAAGTTAAG TCCGTCGAAA TGCACCACGA 70 0 

ACAATTGGAA GAAGGTCTAC CAGGTGACAA CGTCGGTTTC AACGTCAAGA 750 

45 ACGTTTCCGT TAAGGAAATC AGAAGAGGTA ACGTCTGTGG TGACTCCAAG 8 00 

AACGATCCAC CAAAGGCTGC TGCTTCTTTC AACGCCACTG TTATCGTCTT 85 0 

GAACCATCCA GGTCAAATCT CTGCTGGTTA CTCTCCAGTT TTGGATTGTC 900 

ACACTGCTCA CATTGCTTGT AAGTTCGACG AATTGTTGGA AAAGAACGAT 950 

AGAAGATCCG GTAAGAAGTT GGAAGACTCT CCAAAGTTCT TGAAGTCCGG 1000 

50 TGATGCTGCT TTGGTTAAGT TCGTTCCATC TAAGCCAATG TGTGTTGAAG 1050 

CCTTCTCTGA CTACCCACCT CTAGGTAGAT TCGCTGTCAG AGA 1093 



55 2) INFORMATION FOR SEQ ID NO: 429 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

259 



SUBSTITUTE SHEET (RULE 2b) 



WO 01/23604 
(D) 



PCT/CA00/01150 



TOPOLOGY: Linear 



10 



15 



20 



25 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropicalis 

(B) STRAIN: ATCC 13803 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429 



AAATTGAAGG 
GAAATTCGAA 
ACAGAGATTT 
GCTATTTTGA 
TAAAGATGGT 
TCAAACAATT 
AAAAACAGAT 
GGTTGGTTAC 
ATGGTGACAA 
TGGGAAAAAG 
AGCCATTGAT 
GATTGCCATT 
GTCGGTAGAG 
CGCCCCAGCT 
AACAATTGGC 
AACGTTTCTG 
GAACGATCCA 
TGAACCACCC 
CACACTGCTC 
CAGAAGAACT 
GTGATGCTGC 
GCTTTCACTG 



CTGAAAGAGA 
ACTCCAAAAT 
CATCAAGAAC 
TTATTGCTGG 
CAAACCAGAG 
GATTGTTGCT 
TTGAAGAAAT 
AACCCTAAGG 
CATGATTGAA 
AAACCAAGGC 
GCTATTGAAC 
GCAAGATGTT 
TTGAAACTGG 
GGTGTTACCA 
TGAAGGTGTC 
TTAAAGAAAT 
CCAAAGGGTT 
AGGTCAAATC 
ATATTGCTTG 
GGTAAGAAAT 
TATTGTCAAG 
ACTACCCACC 



AAGAGGTATC 
ACCACGTTAC 
ATGATTACTG 
TGGTACTGGT 
AACACGCTTT 
GTCAACAAGA 
TATCAAGGAA 
CTGTTCCATT 
GCTTCTACCA 
TGGTAAGGTT 
CACCTTCAAG 
TACAAGATTG 
TGTCATCAAA 
CTGAAGTCAA 
CCAGGTGACA 
TAGAAGAGGT 
GTGACTCTTT 
TCTGCTGGTT 
TAAATTCGAC 
TGGAAGAAAA 
ATGGTTCCAA 
ATTAGGTAGA 



ACCATTGATA 
CGTTATTGAT 
GTACTTCCCA 
GAATTCGAAG 
GTTGGCTTAC 
TGGACTCTGT 
ACTTCTAACT 
CGTTCCAATC 
ACTGTCCATG 
ACCGGTAAGA 
ACCAACTGAC 
GTGGTATTGG 
GCCGGTATGG 
ATCCGTCGAA 
ATGTTGGTTT 
AACGTTTGTG 
CAACGCTCAA 
ACTCTCCAGT 
ACCTTGGTTG 
TCCAAAATTC 
CCAAACCAAT 
TTCGCTGTCA 



TCGCTTTGTG 50 

GCTCCAGGTC 100 

AGCTGATTGT 150 

CTGGTATTTC 2 00 

ACCTTGGGTG 2 50 

TAAATGGGAC 3 00 

TCGTCAAGAA 3 50 

TCTGGTTGGA 4 00 

GTACAAGGGT 450 

CTTTGTTGGA 5 00 

AAGCCATTGA 550 

TACTGTGCCA 600 

TTGTTACTTT 650 

ATGCACCACG 700 

CAACGTTAAG 750 

GTGACTCCAA 800 

GTTATTGTCT 850 

CTTGGATTGT 900 

AAAAGATTGA 950 

GTCAAATCCG 1000 

GTGTGTTGAA 1050 

GAGA 10 94 



35 



45 



2) INFORMATION FOR SEQ ID NO: 43 0 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1095 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida utilis 

(B) STRAIN: Csp 388 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430 

CAAGCTTAAA GCTGAGAGAG AGAGAGGTAT CACTATCGAC ATTGCTCTCT 50 

GGAAGTTCGA GACTCCAAAG TACCACGTTA CTGTCATTGA TGCCCCAGGT 10 0 

CACAGAGATT TCATCAAGAA CATGATTACT GGTACCTCCC AGGCTGACTG 15 0 

55 TGCTATTCTT ATCATTGCCG GTGGTGTTGG TGAGTTCGAG GCTGGTATCT 2 00 

CTAAGGATGG TCAGACCAGA GAGCACGCTT TGCTCGCTTT CACCCTTGGT 250 

GTTAGACAGA TGATTGTTGC TATCAACAAG ATGGACTCTG TCAAGTGGGA 3 00 

CGAGAAGAGA TTCGAGGAGA TCGTTAAGGA GACCTCTAAC TTCATCAAGA 350 

AGGTTGGTTA CAACCCAAAG ACTGTTCCAT TTGTCCCAAT TTCYGGTTGG 400 

60 AACGGTGACA ACATGATTGA GGCCTCTACC AACTGTCCAT GGTACAAGGG 450 

260 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TTGGGAGAAG GAGACCAAGG CTGGTGTTGT 
ATGCCATTGA CGCCATTGAG CCACCAACAA 
AGATTGCCAC TCCAGGATGT CTACAAGATT 
AGTCGGTAGA GTCGAGACCG GTGTCATCAA 
5 TTGCCCCATC CGGTGTCACC ACTGAGGTTA 
GAGCAGCTTG CTGAGGGTAT CCCAGGTGAC 
GAACGTCTCT GTTAAGGAGA TCAGAAGAGG 
AGAACGACCC ACCACAGGGT GCTGAGTCCT 
TTGAACCACC CAGGTCAGAT CTCTGCTGGT 
10 TCACACCGCC CACATTGCTT GTAAGTTCTC 
ACAGAAGATC CGGTAAGTCC CTTGAGGCCT 
GGTGATGCCG CTATCGTCAA GATGGTTCCA 
GGCCTTCACT GACTACCCAC CACTCGGTAG 

15 



PCT/CA00/01150 

CAAGGGTAAG ACCTTGCTCG 1 &6 

GACCAACTGA CAAGCCATTG 550 

GGTGGTATCG GAACTGTTCC 600 

GCCAGGTATG GTTGTTACCT 650 

AGTCCGTCGA GATGCACCAC 700 

AACGTTGGTT TCAACGTTAA 750 

TAACGTTGCC GGTGACTCCA 800 

TCAACGCTCA GGTCATTGTC 850 

TACTCTCCAG TTTTGGACTG 900 

TGAGCTTTTG GAGAAGATTG 950 

CTCCAAAGTT CGTCAAGTCT 1000 

TCCAAGCCAT TGTGTGTTGA 1050 

ATTCGCTGTC AGAGA 1095 



2) INFORMATION FOR SEQ ID NO: 431 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1085 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida viswanathii 

(B) STRAIN: ATCC 28269 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 431 



GCTGAAAGAG AAAGAGGTAT CACCATCGAT ATCGCTTTGT GGAAATTCGA 50 

AACTCCAAAR TACCACGTTA CCGTCATTGA YGCTCCAGGT CACAGAGATT 100 

35 TCATCAAGAA CATGATYACT GGTACTTCTC AAGCTGATTG TGCTATYTTG 150 

ATTATCGCTG GTGGTACTGG TGAATTCGAA GCTGGTATYT CTAAGGATGG 2 00 

TCAAACCAGA GAACACGCTT TGTTGGCCTA CACCTTGGGT GTCAAGCAAT 2 50 

TGATTGTTGC TGTCAACAAG ATGGACTCTG TCAAATGGGA CAAGAACAGA 3 00 

TTCGAAGAAA TCATCAAGGA AACCTCCAAC TTCGTCAAGA AGGTTGGTTA 3 50 

40 CAACCCAAAG ACTGTTCCAT TCGTCCCAAT CTCTGGTTGG AACGGTGACA 4 00 

ACATGATTGA AGCCTCCACC AACTGCCCAT GGTACAAGGG TTGGGAAAAG 450 

GAAACCAAGG CTGGTAAGGT TACCGGTAAG ACTTTGTTGG AAGCCATTGA 5 00 

CGCTATCGAA CCACCAACCA GACCAACTGA CAAGCCATTG AGATTGCCAT 550 

TGCAAGATGT CTACAAGATT GGTGGTATCG GAACTGTGCC AGTCGGTAGA 600 

45 GTTGAAACTG GTGTCATCAA GGCCGGTATG GTTGTCACTT TYGCCCCAGC 650 

TGGTGTTACC ACTGAAGTCA AGTCCGTTGA AATGCACCAC GAACAATTGG 700 

CTGAAGGTGT CCCAGGTGAC AACGTTGGTT TCAACGTCAA GAACGTTTCC 750 

GTCAAGGAAA TCAGAAGAGG TAACGTCTGT GGTGACTCCA AGAACGACCC 8 00 

ACCAAAGGGT TGTGAS TCTT TCAACGCTCA AGTCATTGTC TTGAACCACC 8 50 

50 CAGGTCAAAT CTCTGCTGGT TACTCTCCAG TCTTGGATTG TCACACTGCC 900 

CACATTGCTT GTAAGTTTGA CACCTTGGTT GAAAAGATTG ACAGAAGAAC 950 

CGGTAAGAAG TTGGAAGAAA ACCCAAAGTT TGTCAAGTCC GGTGACGCTG 1000 

CTATCGTCAA GATGGTCCCA ACCAAGCCAA TGTGTGTTGA AGCYTTCACT 1050 

GACTACCCAC CATTGGGTAG ATTCGCTGTC AGAGA 1085 

55 



2) INFORMATION FOR SEQ ID NO: 432 
60 (i) SEQUENCE CHARACTERISTICS: 

261 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

(A) LENGTH: 1072 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

( i i ) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida zeylanoides 
10 (B) STRAIN: ATCC 7351 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432 

AGGTATTACC ATTGACATTG CCTTGTGGAA GTTCGAGACC CCCAAGTACC 50 

15 AGGTCACCGT CATTGACGCT CCTGGCCACA GAGATTTCAT TAAGAACATG 100 

ATCACTGGTA CCTCCCAGGC TGACTGTGCC ATCTTGATCA TTGCTGGTGG 150 

TGTTGGTGAG TTCGAGGCTG GTATCTCCAA GGATGGCCAG ACCAGAGAGC 2 00 

ACGCCTTGCT TGCCTACACC TTGGGTGTCA AGCAATTGAT TGTTGCTGTC 2 50 

AACAAGATGG ACTCCGTCAA GTGGGACAAG AACAGATTCG AGGAGATTGT 3 00 

2 0 CAAGGAGACC TCCAACTTCG TCAAGAAGGT TGGCTACAAC CCCAAGACTG 3 50 

TCCCCTTCGT TCCCATCTCC GGTTGGAACG GTGACAACAT GATTGAGGCC 4 00 

TCCACCAACT GCCCTTGGTA CAAGGGTTGG GAGAAGGAGA CCAAGGCCGG 4 50 

TAAGGTCACT GGTAAGACCT TGTTGGAGGC TATTGACGCC ATTGAGCCCC 500 

CCACCAGACC CACCGACAAG CCCTTGAGAT TGCCCTTGCA GGATGTCTAC 550 

2 5 AAGATTGGTG GTATTGGAAC GGTGCCCGTT GGCAGAGTTG AGACCGGCAT 600 

CATCAAGGCC GGTATGGTTG TCACCTTTGC CCCCGCTGGT GTCACTACTG 650 

AAGTGAAGTC TGTCGAGATG CACCACGAGC AATTGGCTGA GGGTGTCCCA 700 

GGTGACAATG TTGGTTTCAA CGTGAAGAAC GTTTCCGTTA AGGAGATCAG 750 

AAGAGGTAAC GTTTGCGGTG ACTCCAAGAA CGACCCCCCC AAGGCTGCTG 8 00 

3 0 CTTCTTTCAA CGCCCAGGTT ATCGTCTTAA ACCACCCCGG TCAAATCTCT 8 50 

GCTGGTTACT CTCCGGTTTT GGATTGCCAC ACTGCCCACA TTGCTTGCAG 90 0 

ATTCGACCAG TTGATTGAGA AGATCGACAG AAGAACCGGT AAGAAGATGG 950 

AGGACGACCC TAAGTTCATC AAGTCCGGTG ACGCTGCCAT CGTCAAGATG 1000 

GTTCCTTCCA AGCCCATGTG TGTTGAGGCC TTCACTGACT ACCCTCCCTT 1050 

3 5 GGGTCGTTTC GCTGTCAGAG AC 1072 



2) INFORMATION FOR SEQ ID NO: 433 



40 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

( i i ) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Coccidioides immitis 

(B) STRAIN: Silveira 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433 

55 AGCACCAACT GGTCCGAGCC TCGTTTCAAC GAAATCGTCA AGGAAGTCTC 50 

CAACTTCATC AAGAAGGTCG GATACAACCC CAAGGCTGTT CCATTCGTCC 100 

CCATCTCTGG TTTCGAAGGT GACAACATGA TTCAACCCTC CACCAACGCT 150 

CCTTGGTACA AGGGCTGGAA CAAGGAGACC GCCTCTGGCA AGCACACTGG 2 00 

CAAGACCCTC CTCGACGCCA TTGATGCCAT CGACCCCCCA ACCCGCCCCA 250 

60 CCGAGAAGCC CCTCCGTCTC CCACTTCAGG ATGTGTACAA GATCTCTGGT 3 00 

262 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

ATCGGAACAG TCCCAGTCGG 
TATGGTTGTG ACCTTCGCTC 
TCGAAATGCA CCACCAGCAG 
GGCTTCAACG TCAAGAACGT 
5 CGCTGGTGAC TCCAAGAACG 
CCCAGGTCAT CGTCCTCAAC 
CCAGTCCTTG ACTGCCACAC 
CCTCGAGAAG ATCGACCGCC 
AGTTCATCAA GTCTGGTGAT 
10 C 



CCGTGTCGAA ACCGGTGTTA 
CTTCCAACGT CACCACTGAA 
CTCACCCAGG GTAACCCTGG 
CTCTGTCAAG GAAGTCCGCC 
ACCCACCAAA GGGCTGCGAC 
CACCCTGGTC AAGTCGGTGC 
TGCCCACATT GCTTGCAAGT 
GTACCGGTAA ATCCGTTGAG 
GCCGCTATCG TCAAGATGGT 



PCT/CA00/01150 

TCAAGCCTGG * J50 

GTCAAGTCCG 4 00 

TGACAACGTT 450 

GCGGTAACGT 500 

TCCTTCAACG 550 

TGGTTATGCC 600 

TCTCCGAGCT 650 

AACAACCCCA 700 

TCCATCCAAG 750 

751 



2) INFORMATION FOR SEQ ID NO: 434 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Cryptococcus albidus 

(B) STRAIN: ATCC 66030 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434 

3 0 AAGCTCAAGG CCGAGCGAGA GCGAGGTATC ACCATCGACA TCGCCTTGTG 50 

GAAGTTCGAG ACCCCCAAGT ACAATGTCAC CGTCATTGAC GCCCCCGGTC 100 

ACCGAGACTT CATCAAGAAC ATGATCACCG GTACCTCGCA GGCCGACTGT 150 

GCCATCCTCA TCATCGCCTC CGGTATCGGA GAGTTCGAGG CTGGTATCTC 2 00 

CAAGGACGGT CAGACCCGAG AGCACGCCCT TTTGGCCTTC ACCCTCGGTG 2 50 

35 TCCGACAGCT CATCATTGCC ATCAACAAGA TGGACACCTG CAAGGTTAGT 3 00 

TCGCAGGTCC TGGTCTCTGT ACGAATCTTG CTGACCCCTT TTACAGTGGT 350 

CCGAAGACCG ATACAACGAA ATCGTCAAGG AGGCTTCCGG TTTCATCAAG 4 00 

AAGGTCGGAT ACAACCCCAA GACCGTTCCC TTCGTCCCCA TCTCCGGATG 450 

GCACGGAGAC AACATGTTGG AGGAGTCCAC CAACATGCCC TGGTACAAGG 500 

4 0 GATGGCACAA GGAGTCCAAG GCCGGTGTTG TCAAGGGAAA GACCTTGCTC 550 

GAGGCCATCG ACGCCATCGA GCCCCCTACC CGACCTTCCG ACAAGCCCTT 600 

GCGATTGCCC CTC CAGGATG TCTACAAGAT CGGTGGTATC GGTACGGTGC 650 

CCGTCGGTCG AGT CGAGACC GGTGTCATCA AGGCTGGTAT GGTCGTCACC 700 

TTCGCCCCTG CCAACGTCAC CACCGAAGTC AAGTCCGTCG AAATGCACCA 7 50 

45 CGAACAGCTC GCTGAGGGTG TTCCCGGTGA CAACGTCGGT TTCAACGTCA 800 

AGAACGTTTC CGTCAAGGAC ATCCGACGAG GAAACGTCTG CTCCGACTCG 850 

AAGAACGACC CCGCTATGGA GTCTGCTTCC TTCAACGCTC AGGTCATTGT 900 

CTTGAACCAC CCGGGTCAGA TCGGTGCCGG CTACTCCCCC GTTTTGGACT 950 

GCCACACCGC TCACATTGCC TGCAAGTTCG CTGAGCTCGT TGAGAAGATC 1000 

50 GACCGACGAA CCGGTAAGGT CATGGAGGCC GCCCCCAAGT TCGTCAAGTC 1050 

CGGTGACGCC GCCATCGTCA AGTTGATCCC GTCCAAGCCC ATGTGTGTCG 1100 

AGTCCTACTC CGAGTACCCC CCCTTGGGTC GATTCGCCGT CCGAGA 1146 



55 



2) INFORMATION FOR SEQ ID NO: 435 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1095 bases 
60. (B) TYPE: Nucleic acid 

263 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exophiala jeanselmei 

(B) STRAIN: ATCC 64755 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435 

CAAGCTGAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCTTGT 5 0 

GGAAGTTCGA GACTCCCAAG TACTATGTCA CTGTCATCGA CGCCCCTGGT 100 

CATCGTGACT TTATCAAGAA CATGATCACT GGTACTTCCC AGGCTGACTG 150 

15 CGCCATTCTC ATCATTGCCG CCGGTACTGG TGAATTCGAA GCCGGTATCT 200 

CCAAGGATGG TCAGACTCGT GAGCACGCTC TGCTCGCCTA CACCCTGGGT 25 0 

GTCAAGCAGC TCATTGTCGC CATCAACAAG ATGGACACCA CCAAGTGGTC 3 00 

CGAGGATCGT TTCAACGAAA TCATCAAGGA GACTTCCAGC TTCATCAAGA 3 50 

AGGTCGGCTA CAACCCCAAG TCCGTTCCTT TCGTCCCCAT CTCCGGCTTC 400 

20 AACGGTGACA ACATGATCGA TGTCTCCACC AACTGCCCCT GGTACAAGGG 450 

CTGGGAGAAG GAGACCAAGG CTGGCAAGGC CTCTGGCAAG ACTCTCCTCG 500 

AGGCCATCGA CGCCATCGAC CCCCCCACTC GTCCCACCGA CAAGCCTCTC 550 

CGTCTTCCTC TCCAGGATGT CTACAAGATC TCTGGTATCG GAACGGTGCC 60 0 

CGTCGGTCGT GTTGAGACTG GTGTCATCAA GGCCGGTATG GTCGTTACCT 650 

25 TCGCTCCTGC CAACGTCACC ACTGAAGTCA AGTCCGTCGA AATGCACCAC 70 0 

GAACAACTCG CCGAGGGTGT TCCAGGTGAC AACGTTGGTT TCAACGTCAA 750 

GAACGTCTCC GTCAAGGAGG TTCGTCGTGG AAACGTCTGC GGTGACTCCA 800 

AGAACGACCC ACCCAAGGGT GCTGATTCCT TCAACGCCCA GGTCATCGTC 850 

TTGAACCACC CTGGTCAAGT CGGTGCTGGC TACGCCCCAG TGTTGGATTG 90 0 

3 0 CCACACTGCC CACATTGCTT GCAAGTTCTC TGAGCTTCTC GAGAAGATTG 950 

ACCGCCGTAC CGGTAAATCC ATCGAAAACA ACCCCAAGTT CATCAAGTCT 1000 
GGTGACGCTG CCATCGTCAA GATGGTTCCC AGCAAGCCCA TGTGTGTTGA 1050 
GGCCTTCACT GACTACCCAC CTCTTGGTCG TTTCGCCGTC CGTGA 1095 

35 

2) INFORMATION FOR SEQ ID NO: 436 

( i ) SEQUENCE CHARACTERISTICS : 

4 0 (A) LENGTH: 1113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium oxysporum 

(B) STRAIN: WSA-212 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436 



AAGCTCAAGG CCGAGCGTGA GCGTGGTATC ACCATCGATA TTGCTCTCTG 50 

GAAGTTCGAG ACTCCTCGCT ACTATGTCAC CGTCATTGGT ATGTTGTCGC 100 

55 TCATGCTTCA TTCTACTTCT CTTCGTACTA ACACATCACT CAGACGCTCC 150 

CGGTCACCGT GATTTCATCA AGAACATGAT CACTGGTACT TCCCAGGCCG 20 0 

ATTGCGCCAT TCTCATCATT GCCGCCGGTA CTGGTGAGTT CGAGGCTGGT 250 

ATCTCCAAGG ATGGCCAGAC CCGTGAGCAC GCTCTTCTTG CCTACACCCT 300 

TGGTGTCAAG AACCTCATCG TCGCCATCAA CAAGATGGAC ACCACCAAGT 350 

60 GGTCTGAGGC CCGTTACCAG GAGATCATCA AGGAGACCTC CTCTTTCATC 400 

264 
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AAGAAGGTCG GCTACAACCC CAAGGCTGTC 
TTTCAACGGT GACAACATGC TTACCCCCTC 
AGGGTTGGGA GCGTGAGATC AAGTCCGGCA 
CTCGAGGCCA TTGACTCCAT CGAGCCCCCC 
5 CCTTCGTCTT CCCCTTCAGG ATGTCTACAA 
TTCCCGTCGG CCGTATCGAG ACTGGTGTCA 
ACCTTCGCTC CTTCCAACGT CACCACTGAA 
CCACGAGCAA CTCACTGAGG GCCAGCCCGG 
TGAAGAACGT CTCCGTCAAG GACATCCGAC 

10 TCCAAGAACG ACCCCCCTAT GGGTGCCGCT 
CGTCCTCAAC CACCCCGGTC AGGTCGGTGC 
ATTGTCACAC TGCCCACATT GCCTGCAAGT 
ATCGACCGCC GAACCGGTAA GGCTACTGAG 
GTCTGGTGAC TCCGCCATCG TCAAGATGGT 

15 TTGAGGCTTT CAC 



PCT/CA00/01150_ 

GCTTTCGTCC CCATCTCCGG 4 50 

CACCAACTGC CCCTGGTACA 500 

AGCTCACTGG CAAGACCCTC 550 

AAGCGTCCCG TTGACAAGCC 6 00 

GATCGGTGGT ATTGGAACGG 650 

TCAAGCCCGG TATGGTCGTT 70 0 

GTCAAGTCCG TCGAGATGCA 750 

TGACAACGTT GGTTTCAACG 800 

GTGGTAACGT CGCTGGTGAC 850 

TCTTTCACCG CCCAGGTCAT 90 0 

TGGTTACGCT CCCGTCCTCG 950 

TCGCCGAGAT CCAGGAGAAG 1000 

GCCGCCCCCA AGTTCATCAA 1050 

TCCCTCCAAG CCCATGTGTG 1100 

1113 



2) INFORMATION FOR SEQ ID NO : 43 7 



20 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 72 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Geotrichum spp. 

(B) STRAIN: LEV- 4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 437 

35 GGTCCGAGGA CAGATTCAAC GAGATTGTCA AGGAGACTTC CAACTTCATC 50 

AAGAAGGTTG GTTAYAACCC CAAGACTGTT GCTTTCGTCC CCATCTCTGG 100 

TTGGAACGGT GACAACATGA TTGAGCCCTC CACCAACTGC CCCTGGTACA 150 

AGGGATGGCA GAAGGAGACC AAGGCTGGTG TCACTAAGGG TAAGACCCTC 2 00 

CTTGAGGCCA TCGATGCCAT TGAGCCCCCT GTCAGACCTT CCGACAAGCC 2 50 

4 0 CCTCCGTCTT CCCCTCCAGG ATGTCTACAA GATCGGTGGT ATCGGAACTG 3 00 

TGCCCGTCGG CCGTGTCGAA ACCGGTGTCA TCAAGGCCGG TATGGTCGTC 3 50 

ACCTTCGCCC CCGCTGGTGT CACCACTGAA GTCAAGTCCG TCGAGATGCA 4 00 

CCACGAGCTC CTCACTGAGG GTCTCCCCGG TGACAACGTT GGTTTCAACG 450 

TCAAGAACGT CTCCGTTAAG GATATCAGAC GTGGTAACGT CTGCGGTGAC 500 

45 TCCAAGAACG ATCCCCCCAA GGCTTGCGCT TCTTTCAACG CCCAGGTCAT 550 

TATCTTCAAC CACCCTGGTC AGATCTCTGC TGGATACTCT CCCGTCCTTG 600 

ATTGCCACAC CGCCCATATT GCTTGCAAGT TCGACACTTT GATCGAGAAG 650 

ATTGACCGTC GTACTGGTAA GAAGACTGAG GACTCCCCCA AGTTCGTCAA 700 

GGCCGGTGAT GCTGCTATCG TCAAGA 72 6 

50 



2) INFORMATION FOR SEQ ID NO: 43 8 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 754 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

265 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulation 
5 (B) STRAIN: G186A5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438 

CACCACCAAG TGGTCCGAGT CCCGTTTCAA CGAAATCATC AAGGAGGTTT 50 

10 CCAACTTCAT CAAGAAGGTC GGATATAACC CCAAGGCTGT TCCCTTCGTG 100 

CCAATCTCTG GTTTCGAGGG TGACAACATG ATTGAACCCT CCCCCAACTG 150 

CACATGGTAC AAGGGCTGGA ACAAGGAGAC TGCCTCTGGC AAGTCTTCTG 2 00 

GTAAAACCCT TCTCGATGCC ATTGACGCCA TTGAACCCCC AACCCGTCCT 250 

ACCGATAAGC CCCTCCGTCT TCCCCTCCAG GATGTGTACA AAATCTCTGG 3 00 

15 TATTGGCACT GTTCCCGTCG GACGTGTTGA GACTGGTGTC ATCAAGCCCG 350 

GTATGGTCGT GACTTTCGCT CCCTCCAACG TCACCACTGA AGTCAAGTCC 4 00 

GTCGAAATGC ACCACCAACA ACTCCAGGCT GGTTACCCTG GCGACAACGT 4 50 

CGGCTTCAAC GTCAAGAACG TTTCAGTCAA GGAAGTCCGC CGTGGCAACG 500 

TTGCTGGCGA CTCCAAAAAT GATCCCCCTA AGGGCTGCGA ATCCTTCAAT 550 

2 0 GCCCAGGTCA TCGTCCTTAA CCACCCCGGC CAGGTTGGCG CTGGTTATGC 60 0 

CCCAGTCCTC GACTGCCACA CTGCCCACAT TGCTTGCAAG TTCTCTGAGC 650 

TTATTGAGAA GATCGACCGC CGTACCGGAA AGTCTGTTGA GAACAACCCC 700 

AAGTTCATCA AGTCTGGTGA TGCTGCTATC GTCAAGATGG TTCCCTCCAA 750 

GCCC 754 

25 



2 ) INFORMATION FOR SEQ ID NO : 43 9 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 743 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Issatchenkia orientalis 
40 (B) STRAIN: ATCC 6258 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439 

TGGGATGAAA ACAGATTTGA AGAAATTGTC AAGGAAACCC AAAACTTCAT 50 

45 CAAGAAGGTT GGTTACAACC CAAAGACTGT TCCATTCGTT CCAATCTCTG 100 

GTTGGAATGG TGACAACATG ATTGAAGCAT CCACCAACTG TCCATGGTAC 150 

AAGGGTTGGA CTAAGGAAAC CAAGGCAGGT GTTGTTAAGG GTAAGACCTT 2 00 

ATTAGAAGCA ATCGATGCTA TTGAACCACC TGTCAGACCA ACCGAAAAGC 2 50 

CATTAAGATT ACCATTACAA GATGTTTACA AGATTGGTGG TATTGGTACT 3 00 

50 GTGCCAGTCG GTAGAGTCGA AACCGGTGTC ATTAAGCCAG GTATGGTTGT 3 50 

CACTTTTGCT CCAGCAGGTG TCACCACCGA AGTCAARTCC GTTGAAATGC 4 00 

ACCATGAACA ATTAGAACAA GGTGTTCCAG GTGATAACGT TGGTTTCAAC 4 50 

GTTAAGAACG TCTCTGTCAA GGATATCAAG AGAGGTAACG TTTGTGGTGA 500 

CTCCAAGAAC GACCCACCAA TGGGTGCAGC TTCYTTCAAT GCTCAAGTCA 550 

55 TTGTCTTGAA CCACCCTGGT CAAATTTCCG CTGGTTACTC TCCAGTCTTG 600 

GATTGTCACA CTGCCCACAT TGCATGTAAG TTCGACGAAT TAATCGAAAA 650 

GATTGACAGA AGAACTGGTA AGTCTGTTGA AGACCATCCA AAGTCYGTCA 700 

AGTCTGGTGA TGCAGCTATC GTCAAGATGG TCCCAACCAA GCC 743 

60 
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2) INFORMATION FOR SEQ ID NO: 440 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 91 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia furfur 

(B) STRAIN: ATCC 42132 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440 

CAAGCTCAAG GCTGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCTCTGT 50 

GGAAGTTCGA GACCCCTAAG TACCACGTTA CCGTCATTGA CGCTCCTGGT 10 0 

CACCGTGACT TCATCAAGAA CATGATTACG GGTACCTCGC AGGCTGACTG 150 

CGCTATCCTC ATCATTGCCG GTGGTACCGG TGAGTTCGAG GCTGGTATCT 2 00 

CGAAGGACGG TCAGACCCGT GAGCACGCTC TGCTCGCTTT CACCCTGGGT 250 

GTGCGTCAGC TCATTGTGGC CGTCAACAAG ATGGACACCA CCAAGTACTC 3 00 

GGAGGACCGC TTCAACGAGA TTGTCCGCGA AGTGTCGAAC TTCATCAAGA 3 50 

AGGTCGGTTT CAACCCCAAG ACTGTTGCCT TCGTCCCCAT CTCGGGCTGG 4 00 

CACGGTGACA ACATGATCGA GGCCACCACC AACATGCCTT GGTACAAGGG 4 50 

CTGGGAGAAG GAGACCAAGT CGGGCAAGGT CACTGGTAAG ACTCTGCTGG 50 0 

ACGCCATCGA CGCCATCGAG CCCCCGACCC GCCCCACTGA CAAGCCCCTG 550 

CGTCTCCCTC TGCAGGATGT GTACAAGATC GGTGGTATCG GTACTGTCCC 6 00 

TGTCGGTCGT GTTGAGACCG GTGTGATCAA GCCCGGTATG GTTGTGACCT 6 50 

TCGCTCCCTC GAACGTCACC ACTGAAGTTA AGTCGGTTGA GATGCACCAC 7 00 

GAGTCGCTCC CTGAGGGTCT CCCCGGTGAC AACGTTGGTT TCAACGTGAA 750 

GAACGTCTCG GTTAAGGACA TTCGCCGTGG TAACGTTGCC TCGGACTCGA 800 

AGAACGACCC CGCTCAGGAG GCTGCTTCGT TCAACGCGCA GGTCATTGTC 850 

ATGAACCACC CTGGTCAGAT CAGCAACGGT TACTCGCCCG TGCTTGACTG 900 

CCACACTGCG CACATTGCCT GCCGCTTCAA CAACATCCTC CAGAAGATCG 950 

ACCGTCGCTC GGGTAAGGTG CTTGAGGAGA ACCCCAAGTT CATCAAGTCG 10 00 

GGTGACGCTG CCATGGTGGA GATGATCCCC ACCAAGCCCA TGTGTGTGGA 1050 

GTCGTTCAAC GAGTACCCCC CTCTGGGTCG TTTCGCTGTG C 10 91 



2) INFORMATION FOR SEQ ID NO: 441 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 74 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia pachydermatis 

(B) STRAIN: ATCC 42756 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441 

ACCACCAAGT ACTCGGAGGA CCGCTTCAAC GAGATTATTC GCGAGACTTC 50 
GAACTTCATC AAGAAGGTCG GTTACAACCC GAAGACTGTT GCCTTCGTCC 100 
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CGATCTCGGG CTGGCACGGT GACAACATGA TTGAGGCGAC CACCAACATG 150 

CCGTGGTACA AGGGCTGGGA GAAGGAGACC AAGTCGGGCA AGGCCACTGG 2 00 

TAAGACCCTT CTGGACGCTA TTGACGCCAT TGAGCCGCCG ACGCGCCCGA 2 50 

CCGACAAGCC TCTCCGTCTT CCTCTCCAGG ATGTGTACAA GATCGGTGGT 3 00 

5 ATCGGTACYG TCCCGGTCGG CCGTGTTGAG ACCGGTGTTA TCAAGCCCGG 3 50 

TATGGTTGTG ACCTTCGCTC CGTCGAACGT CACSACTGAA GTTAAGTCGG 4 00 

TCGAGATGCA CCACGAGCAG ATCCCTGAGG GTCTTCCGGG TGACAACGTT 4 50 

GGTTTCAACG TGAAGAACGT GTCGGTCAAG GACATTCGCC GTGGTAACGT 500 

CGCCTCGGAC TCGAAGAACG ACCCGGCTCA GGAGGCTGCC TCGTTCAATG 550 

10 CTCAGGTCAT TGTGATGAAC CACCCTGGTC AGATCAGCAA CGGTTACTCG 600 

CCRGTGCTCG ACTGCCACAC TGCTCACATT GCCTGCCGCT TCAACAACAT 650 

CCTCCAGAAG ATCGACCGTC GTTCGGGTAA GGTTCTYGAA GAGAACCCCA 700 

AGTTCATCAA GTCGGGTGAC GCTGCCATGG TTGAGATGAT CCCGACCAA 74 9 



15 



30 



2) INFORMATION FOR SEQ ID NO: 442 



{ i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 1150 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malbranchea filamentosa 

(B) STRAIN: ATCC 48174 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 442 



ACTGAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC CATGTCACCG TCATTGGTAC GTTCGACATG 100 

35 TTCGACCTTT TGCCTAGTGT CCCCTTCTAA CCACAGTTTA TAGACGCCCC 150 

TGGCCATCGT GATTTCGTCA AGAACATGAT CACTGGTACT TCCCAGGCTG 2 00 

ATTGCGCTAT CCTCATCATT GCTTCCGGCA CTGGTGAATT CGAGGCTGGT 250 

ATCTCCAAGG ATGGCCAGAC CCGTGAGCAC GCTCTGCTTT CCTTCACCCT 3 00 

CGGTGTTAGG CAGCTCATTG TCGCCCTCAA CAAGATGGAC ACTGTCAACT 3 50 

4 0 TCGCTGAGGC CCGTTACAAC GAGATTGTCA AGGAAGTCTC CAACTTCATC 4 00 

AAGAAGGTCG GCTACAACCC CAAGGCCGTT CCTTTCGTCC CCATCTCCGG 4 50 

TTTCGAGGGT GACAACATGA TCGAGGCCTC CACCCGCATT CCTTGGTACA 500 

AGGGCTGGAA CAAGGAGACC GCCAGTGGCA AGAGCACCGG CAAGACCCTY 550 

CTCGAGGCCA TTGATGCCAT CGAACCCCCG GTCCGTCCCA CCGACAAGCC 600 

4 5 CCTYCGTCTC CCTCTTCAGG ATGTGTACAA GATCTCCGGT ATTGGCACTG 650 

TTCCTGTCGG TCGTGTTGAG ACTGGTGTCA TCAAGCCTGG TATGGTCGTT 7 00 

ACTTTCGCCC CCGCCAACGT CACCACTGAA GTCAAGTCCG TCGAGATGCA 750 

CCACCAGCAG CTCCAGGCCG GTAACCCCGG TGACAACGTC GGCTTCAACG 8 00 

TCAAGAACGT TTCCGTCAAG GAAGTCCGCC GTGGCAACGT TGCCTCCGAC 850 

50 TCGAAGAACG ACCCCGCCAA GGGCTGCGAC TCCTTCAACG CCCAGGTCAT 90 0 

CGTCCTTAAC CACCCCGGTC AGGTCGGTGC TGGATACGCT CCCGTCCTCG 950 

ATTGCCACAC TGCCCACATT GCTTGCAAGT TCTCTGAGCT TCTTGAGAAG 1000 

ATCGATCGCC GTACCGGTAA ATCCGTTGAG GACCACCCCA AGTTCATCAA 1050 

GTCTGGTGAT GCCGCTATCG TCAAGATGGT TCCCTCCAAG CCTATGTGCG 1100 

55 TTGAGGCTTT CACTGACTAC CCTCCCCTTG GTCGTTTCGC CGTCCGTGAC 1150 



60 



2) INFORMATION FOR SEQ ID NO: 443 
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( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1099 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Metschnikowia pulcherrima. 

(B) STRAIN: DSM 70336 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443 

15 GGACAAGTTG AAGGCTGAGA GAGAGAGAGG TATCACCATC GACATTGCCT 50 

TGTGGAAGTT CGAGACTCCT AAGTACCACG TCACCGTYAT TGACGCCCCA 100 

GGTCACAGAG ATTTCATCAA GAACATGATC ACTGGTACTT CCCAGGCTGA 150 

CTGTGCYATC TTGATTATCG CYGGTGGTGT TGGTGAGTTC GAGGCTGGTA 2 00 

TCTCCAAGGA TGGCCAGACC AGAGAGCACG CTTTGTTGGC YTACACCTTG 2 50 

2 0 GGTGTTAGAC ARTTGATTGT TGCCGTCAAC AAGATGGACT CTGTCAAGTG 3 00 

GGACAAGAAC AGATTTGAGG AGATCATCAA GGAGACCTCT AACTTCGTCA 350 

AGAAGGTTGG TTACAACCCT AAGACYGTGC CATTCGTGCC AATYTCTGGT 4 00 

TGGAACGGTG ACAACATGAT TGAGGCYTCC ACTAACTGCC CATGGTACAA 450 

GGGTTGGGAG AAGGAGACCA AGGCCGGTAA GTCTWCCGGT AAGACCTTGT 50 0 

25 TGGAGGCCAT TGACGCCATT GAGCCACCAA CCAGACCTAC CGACAAGGCC 550 

TTGAGATTGC CTTTGCAGGA TGTCTACAAG ATCGGTGGTA TCGGAACGGT 600 

GCCAGTCGGC CGTGTCGAGA CCGGTGTCAT TAAGGCCGGT ATGGTTGTYA 650 

CCTTYGCCCC AGCTGGTGTC ACCACTGAGG TCAAGTCCGT CGAGATGCAC 7 00 

CACGAGCAGT TGGTCGAGGG TCTTCCAGGT GACAAYGTTG GTTTCAACGT 750 

3 0 CAAGAACGTC TCCGTTAAGG AGATCAGAAG AGGTAACGTC TGTGGTGACT 800 

CCAAGCAGGA CCCACCAAAG GGTGCCGCTT CTTTCACCGC YCAGGTTATT 850 

GTGTTGAACC ACCCTGGTCA GATCTCCTCT GGTTACTCTC CAGTGTTGGA 900 

CTGYCACACC GCCCACATTG CCTGTAARTT CGACACCTTG TTGGAGAAGA 950 

TTGACAGAAG AACTGGTAAG TCCTTGGAGT CYGAGCCTAA GTTCGTCAAG 1000 

3 5 TCYGGTGACG CCGCCATTGT CAAGATGGTG CCAACCAAGC CAATGTGTGT 1050 

TGAGGCTTTC ACCGACTACC CACCTTTGGG TAGATTCGCC GTCAGAGAC 1099 



40 2) INFORMATION FOR SEQ ID NO: 444 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1153 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paecilomyces lilacinus 

(B) STRAIN: ATCC 42570 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444 



CAAGCTCAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAC ATTGCCCTCT 50 

GGAAGTTCGA GACTCCCAAG TACTATGTCA CCGTCATTGG TACGTCGACT 100 

CGCGCGAGAC TGGTCGCAAT TTCCACGTCG CTAACGTGCT TGAACAGACG 150 

CTCCCGGCCA CCGTGACTTC ATCAAGAACA TGATCACTGG TACCTCCCAG 2 00 

60 GCTGACTGCG CTATCCTCAT TATCGCTGCC GGCACTGGTG AGTTCGAGGC 250 

269 
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TGGTATCTCC AAGGATGGCC 
CCCTCGGTGT TAAGCAGCTC 
AAGTGGTCTG AGGCCCGTTT 
CATCAAGAAG GTCGGCTACA 
5 CTGGTTTCCA CGGCGACAAC 
TACAAGGGCT GGGAGAAGGA 
CCTCCTTGAG GCCATCGACT 
AGCCCCTCCG CCTTCCCCTT 
ACAGTCCCTG TCGGCCGTAT 

10 CGTGACCTTC GCTCCTTCCA 
TGCACCACGA GCAGCTCTCC 
AACGTCAAGA ACGTCTCCGT 
TGACTCCAAG AACGACCCCC 
TCATCGTCCT CAACCACCCC 

15 CTCGACTGCC ACACCGCCCA 
GAAGATCGAC CGCCGTACCG 
TCAAGTCTGG CGACTCTGCC 
TGCGTTGAGG CTTTCACCGA 
TGA 

20 



AGACCCGTGA GCACGCTCTG 
ATCGTCGCTA TCAACAAGAT 
CCAGGAGATC ATCAAGGAGA 
ACCCCAAGAC CGTCGCTTTC 
ATGCTTTCCC CCTCCACCAA 
GACCAAGGCT GGCAAGTCCA 
CCATCGAGCC CCCCAAGCGC 
CAGGATGTGT ACAAGATCGG 
CGAGACTGGT GTCATCAAGC 
ACGTCACCAC CGAAGTCAAG 
GAGGGTGTCC CCGGTGACAA 
CAAGGAGATC CGTCGTGGCA 
CTCTGGGTGC CGCTTCTTTC 
GGCCAGGTCG GTGCTGGCTA 
CATTGCCTGC AAGTTCGCCG 
GCAAGTCTGT CGAGTCCGCC 
ATCGTCAAGA TGATTCCCTC 
CTACCCTCCT CTGGGCCGCT 
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CTCGCCTKCA" 


3(KT 


GGACACCACC 


350 


CCTCCAACTT 


400 


GTCCCCATCT 


450 


CTGCCCCTGG 


500 


CCGGCAAGAC 


550 


CCCAGCGACA 


600 


CGGTATCGGC 


650 


CCGGCATGGT 


700 


TCCGTTGAGA 


750 


CGTCGGCTTC 


800 


ACGTCGCCGG 


850 


GATGCCCAGG 


900 


CGCCCCCGTC 


950 


AGATCAAGGA 


1000 


CCCAAGTTCA 


1050 


CAAGCCCATG 


1100 


TCGCCGTCCG 


1150 




1153 



2) INFORMATION FOR SEQ ID NO: 445 

25 ( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 763 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 
35 (B) STRAIN: ATCC 32071 

(xij SEQUENCE DESCRIPTION: SEQ ID NO: 445 

TACCACTAAG TGGTCCGAGA CCCGATTCAA TGAAATTATC AAGGAAGTCA 50 

40 CCAATTTCAT TAAGAAGGTC GGATATAACC CCAAGACTGT TCCTTTCGTT 100 

CCCATTTCTG GTTTCGAGGG TGACAACATG ATCGAGCCCT CTGCCAACTG 150 

CCCATGGTAC AAGGGCTGGT CCAAGGAGAC TGCTCAGGGC AAGTACTCTG 2 00 

GCAAGACCCT TCTTGAGGCC ATCGACGCCA TTGAGCCCCC CACCCGTCCT 250 

ACCGATAAAC CTCTCCGTCT TCCCCTCCAG GATGTCTACA AGATCTCCGG 300 

45 TATTGGCACT GTTCCTGTCG GACGTGTTGA GACTGGAGTC ATCAAGCCCG 350 

GTATGGTCGT GACCTTCGCT CCCGCCAACG TCACCACTGA AGTCAAGTCC 4 00 

GTCGAAATGC ACCACCAGCA GCTTACCGCC GGTAACCCCG GTGACAACGT 450 

CGGCTTCAAC GTCAAGAATG TTTCCGTCAA AGAAGTCCGC CGTGGTAACG 500 

TTGCCGGTGA CTCTAAGAAT GATCCCCCAA AGGGCTGCGA TTCCTTCAAT 550 

50 GCCCAGGTCA TCGTCCTCAA CCACCCTGGT CAGGTTGGCG CTGGTTATGC 600 

CCCAGTCCTC GACTGCCATA CTGCCCACAT TGCCTGCAAA TTCGCTGAGC 650 

TCCTTGAGAA GATTGATCGA CGAACCGGAA AGTCTGTTGA GAACAACCCC 70 0 

AAGTTCATCA AGTCCGGTGA TGCTGCTATC GTCAAGATGA TTCCTTCCAA 750 

GCCCATGTGC GTC 763 

55 

2) INFORMATION FOR SEQ ID NO: 446 
60 ( i ) SEQUENCE CHARACTERI STI CS : 
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(A) LENGTH: 1346 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium mameffei 
10 (B) STRAIN: ATCC 64101 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446 

AAGCTCAAGG CTGAGCGTGA GCGTGGTATC ACCATCGATA TTGCTCTCTG 50 

15 GAAGTTCCAG ACTGCCAAGT ACGAGGTTAC CGTCATTGAC GCCCCCGGTC 10 0 

ACCGTGATTT CATCAAGAAC ATGATCACTG GTACCTCCCA GGCCGATTGC 150 

GCTATTCTCA TCATTGCCTC TGGTACTGGT GAATTCGAGG CTGGTATCTC 2 00 

CAAGGATGGC CAGACTCGTG AGCACGCTCT TTTGGCTTTC ACCCTCGGTG 2 50 

TCCGTCAGCT CATTGTTGCC CTCAACAAGA TGGACACTTG CAAGTGGTCT 3 00 

2 0 CAGGGTGAGT ACTCGTACCT GCGTTTGGCC TTGAATATCT TACTAATGCA 3 50 

CCATAGATCG TTACAACGAA ATTGTCAAGG AGACTTCCAA CTTCATCAAG 4 00 

AAGGTCGGAT ACAACCCCAA GAACGTTCCT TTCGTTCCTA TCTCCGGTTT 4 50 

CAACGGTGAC AACATGCTTG AGCCCTCCCC CAACTGCCCC TGGTACAAGG 50 0 

GTTGGGAGAA GGAGACCAAG GCCGGTAAGG TCACTGGTAA GACCCTCCTC 5 50 

25 GAGGCCATCG ACGCCATTGA GCCCCCTACC CGTCCCGCCA ACAAGGTTAG 600 

TCCCTCCTCG ACTACTCAAA CCCTCCTCAT AAGTTCAGAT TACTGACTCG 650 

TTCACAGCCC CTCCGTCTTC CCCTCCAGRA TGTCTACAAG ATCGGTGGTA 700 

TTGGAACGGT TCCCGTCGGT CGTGTTGAGA CTGGTACCAT CGTTCCTGGT 7 50 

ATGGTTGTCA CCTTGTAAGT CACTCTCCTC GCTTATCCTA CCTGAAATCA 8 00 

3 0 TCATGTGCTA ACTTGACACT CAGCGCTCCC GCCAACGTCA CCACTGAAGT 850 

CAAGAGTGTT GAAATGCACC ACCAGCAGCT CACTGCCGGT CAGCCCGGTG 9 00 

ACAACGTTGG TTTCAACGTG AAGAACGTCT CCGTCAAGGA AATCCGTCGT 950 

GGTAACGTTG CTGGTGACAG CAAGAACGAC CCCCCTGCCG GTGCTGCCTC 100 0 

CTTCAACGCC CAGGTCATCG TCCTCAACCA CCCCGGTCAG GTCGGTGCTG 1050 

3 5 GTTACGCCCC AGTCCTCGAT TGCCACACTG CCCACATTGC TTGCAAGTTC 1100 

GCTGAGCTCC TCGAGAAGAT TGACCGTCGT ACCGGAAAGT CTGTTGAGGA 1150 

CCACCCCAAG TTCATCAAGT CCGGTGACGC TGCCATCGTC AAGATGATTC 12 00 

CTTCCAAGCC CATGTGTGTT GAGGCTTTCA CCGAGTACCC TCCTCTCGGT 12 50 

CGTTTCGCCG TTCGCGAGTA AGTTTTATCT CCGTTGTCTA TTTTCCATCC 13 00 

4 0 TTCCCTTCTC CTCCGTCTTC CATATATACT TTTTCAGTTA TATGTG 1346 



2) INFORMATION FOR SEQ ID NO: 447 



45 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1094 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Pichia anowala 

(B) STRAIN: ATCC 18205 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447 

60 AAGTTAAAAG CTGAACGTGA AAGAGGTATC ACTATTGATA TTGCTTTATG 50 
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GAAATTCGAA 
ACAGAGATTT 
GCTATYTTAA 
AAAAGATGGT 
5 TTAAACAATT 
GAATCTCGTT 
AGTTGGTTAC 
ATGGTGATAA 
TGGAAAAAAG 

10 AGCCATTGAT 
GTTTACCATT 
GTCGGTAGAG 
TGCCCCAGCT 
AACAATTGAC 

15 AATGTTTCTG 
AAACGATCCA 
TAAACCATCC 
CACACTGCTC 
CAGACGTACT 

2 0 GTGATGCTGC 
GCTTTCACTG 



ACTCCAAAAT 
CATCAAAAAT 
TTATTGCCGG 
CAAACCAGAG 
GATTGTTGCT 
TCGAAGAAAT 
AACCCAAAAA 
CATGATTGAA 
AAACCAAAGC 
GCTATTGACC 
ACAAGATGTT 
TTGAAACCGG 
GGTGTTACCA 
TGAAGGTTTA 
TTAAAGAAAT 
CCAAAAGCTG 
AGGTCAAATC 
ACATTGCTTG 
GGTAAGAAAT 
TATTGTTAAA 
ACTACCCACC 



ACCATGTTAC 
ATGATTACTG 
TGGTATTGGT 
AACACGCTTT 
ATCAACAAGA 
TGTCAAGGAA 
CTGTTCCATT 
CCATCAWCTA 
TGGTGAAGCT 
CACCATCAAG 
TACAARATTG 
TGTTATCAAA 
CTGAAGTCAA 
CCAGGTGACA 
CCGTCGTGGT 
CTGAATCATT 
TCTGCTGGTT 
TAAATTTGAC 
TAGAAGAAGC 
TTTGTTCCAT 
ATTAGGTCGT 



CGTTATTGAT 
GTACTTCCCA 
GAATTCGAAG 
NTTAGCTTAC 
TGGATTCCGT 
ACYTCAAACT 
CGTTCCAATC 
ACTGTCCATG 
AAAGGTAAAA 
ACCAACTGAT 
GTGGTATTGG 
CCAGGTATGG 
ATCTGTTGAA 
ATGTTGGTTT 
AACGTCTGTG 
CAATGCTCAA 
ACTCTCCAGT 
ACTTTAATTG 
TCCAAAATTC 
CAAAACCATT 
TTCGCTGTCA 



PCT/CA00/01150 

GCTCCAGGTC 100 

AGCTGATTGT 150 

CTGGTATCTC 2 00 

ACCTTAGGTG 250 

TAAATGGGAT 3 00 

TTATCAAGAA 350 

TCAGGTTGGA 4 00 

GTACAAAGGT 4 50 

CTTTATTAGA 500 

AAACCATTAC 550 

TACTGTGCCA 60 0 

TTGTTACCTT 650 

ATGCATCATG 700 

CAACGTTAAG 750 

GTGACTCTAA 800 

GTTATTGTCT 850 

TTTAGATTGT 900 

AAAAAATTGA 950 

ATCAAATCAG 1000 

ATCAGTTGAA 1050 

GAGA 1094 



25 2) INFORMATION FOR SEQ ID NO: 448 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1100 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 2149 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448 



CTTAGATAAG TTAAAAGCTG AACGTGAAAG AGGTATCACT ATTGATATTG 50 

CTTTATGGAA ATTCGAAACT CCAAAATACC ATGTTACCGT TATTGATGCT 100 

CCAGGTGACA GAGATTTCAT CAAAAATATG ATTACTGGTA CTTCCCAAGC 150 

TGATTGTGCT ATMTTAATTA TTGCCGGTGG TATTGGTGAA TTCGAAGCTG 2 00 

45 GTATCTCAAA AGATGGTCAA ACCAGAGAAC ACGCTTTRTT AGCTTACACC 2 50 

TTAGGTGTTA AACAATTGAT TGTTGCTATC AACAAGATGG ATTCCGTTAA 3 00 

ATGGGATGAA TCTCGTTTCG AAGAAATTGT CAAGGAAACY TCAAACTTTA 350 

TCAAGAAAGT TGGTACAACC CAAAAACTGT TCCATTCGTT CCAATCTCAG 4 00 

GTTGGAATGG TGATAACATG ATTGAACCAT CAACTAACTG TCCATGGTAC 450 

50 AAAGGTTGGA AAAAAGAAAC CAAAGCTGGT GAAGCTAAAG GTAAAACTTT 500 

ATTAGAAGCC ATTGATGCTA TTGATCCACC ATCAAGACCA ACTGATAAAC 550 

CATTACGTTT ACCATTACAA GATGTTTACA ARATTGGTGG TATTGGTACT 600 

GTGCCAGTCG GTAGAGTTGA AACCGGTGTT ATCAAACCAG GTATGGTTGT 650 

TACCTTTGCC CCAGCTGGTG TTACCACTGA AGTCAAATCT GTTGAAATGC 700 

55 ATCATGAACA ATTGACTGAA GGTTTACCAG GTGACAATGT TGGTTTCAAC 750 

GTTAAGAATG TTTCTGTTAA AGAAATCCGT CGTGGTAACG TCTGTGGTGA 800 

CTCTAAAAAC GATCCACCAA AAGCTGCTGA ATCATTCAAT GCTCAAGTTA 850 

TTGTCTTAAA CCATCCAGGT CAAATCTCTG CTGGTTACTC TCCAGTTTTA 900 

GATTGTCACA CTGCTCACAT TGCTTGTAAA TTTGACACTT TAATTGAAAA 950 

60 AATTGACAGA CGTACTGGTA AGAAATTAGA AGAAGCTCCA AAATTCATCA 1000 

272 
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AATCAGGTGA TGCTGCTATT GTTAAATTTG TTCCATCAAA ACCATTATCA * 1050 
GTTGAAGCTT TCACTGACTA CCCACCATTA GGTCGTTTCG CTGTCAGAGA 110 0 



2) INFORMATION FOR SEQ ID NO: 449 

(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 01 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudallescheria boydii 

(B) STRAIN: ATCC 44331 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 449 

CAAGCTCAAG GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT 50 

GGAAGTTCGA GACCCCCAAG TACCAGGTCA CCGTCATTGG TATGTCTTTG 10 0 

TGCTTTTTGT GCTTTTTGTG CCTGTGCCTC GCACAATTCC AGCCCTCGAT 150 

AATTATGAAC CTCGTACTAA TATGTCGTTC TCCCACTACC CACAGACGCC 2 00 

CCCGGCCATC GTGATTTCAT CAAGAACATG ATTACTGGTA CCTCCCAGGC 250 

TGATTGCGCC ATTCTCATCA TTGCCTCCGG TACTGGTGAG TTCGAGGCTG 3 00 

GCATCTCCAA GGATGGCCAG ACCCGTGAGC ACGCTCTTCT CGCCTTCACC 3 50 

CTCGGTGTCA AGAACCTCAT TGTTGCCATC AACAAGATGG ACACCAACAA 4 00 

CTGGTCCGAG GACCGATACA AGGAGATCAT CAAGGAGACC TCCAACTTCA 4 50 

TCAAGAAGGT CGGCTACAAC CCCAAGGCCG TTCCTTTCGT CCCCATCTCC 500 

GGTTTCCACG GAGACAACAT GCTTACCCCC TCCACCAACT GCCCCTGGTA 550 

CAAGGGTTGG GTCCGTGAGG TCAAGGGTAA CACCCTTACC GGCAAGACCC 600 

TTCTCGAGGC CATCGACTYC ATCGAGCCCC CCAAGCGTCC CACCGAGAAG 650 

CCCCTCCGTC TTCCCCTTCA GGACGTCTAC AAGATCGGTG GTATTGGCAC 70 0 

TGTGCCCGTC GGCCGTATCG AGACCGGTAT CCTCAAGCCC GGTATGGTCG 750 

TCACCTTCGC TCCCTCCAAC GTCACCACTG AAGTCAAGTC CGTCGAGATG 80 0 

CACCACGAGC AGCTTACCGA GGGTGTCCCC GGTGACAACG TTGGTTTCAA 850 

CGTGAAGAAC GTCTCCGTCA AGGATATCCG CCGTGGCAAC GTCTGCGGTG 900 

ACTCCAAGAA CGACCCCCCC GCTGCTGCCG CCTCTTTCCA GGCCCAGGTC 950 

ATTGTCCTCA ACCACCCCGG CCAGATCGGT GCTGGTTACG CTCCCGTTCT 1000 

TGACTGCCAC ACTGCCCACA TTGCTTGCAA GTTCGCCGAG CTCCTTGAGA 1050 

AGATCGACCG CCGTACCGGT AAGTCGGTCG AGAACAACCC CAAGTTCGTC 1100 

AAGTCTGGTG ATGCCGCCAT CGTCAAGATG GTTCCCTCCA AGCCCATGTG 1150 

TGTTGAGTCC TTCACCGAGT ACCCCCCTCT CGGTCGTTTC GCCGTCCGTG 12 00 

A 1201 



2) INFORMATION FOR SEQ ID NO: 450 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 95 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) 
(B) 



PCT/CAOO/01150 



ORGANISM: Rhizopus oryzae 
STRAIN: ATCC 56015 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450 



CAAGCTTAAG 
GGAAGTTCGA 
CACCGTGATT 
TGCTATTCTT 

10. CCAAGGATGG 
GTCCGTCAAT 
CGAAGCTCGT 
AGATTGGTTA 
CACGGTGACA 

15 ATGGAACAAG 
ATGCCATTGA 
CGTCTTCCTC 
CGTCGGTCGT 
TCGCTCCTGC 

20 GAAACCCTCA 
GAACGTCTCC 
AGAACGACCC 
TTGAACCACC 
TCACACTGCT 

2 5 ACAGACGTTC 
GGTGACTCTG 
AGCTTACACT 



GCTGAACGTG 
AACCCCCAAG 
TCATCAAGAA 
ATCATTGCTG 
TCAAACCCGT 
TGATTGTTGC 
TTCAACGAAA 
CAACCCCAAG 
ACATGTTGGA 
GAAACCAAGG 
CAACATTGAC 
TTCAAGATGT 
GTCGAAACTG 
TGCTGTCACC 
CTGAAGGTCT 
GTCAAGGATA 
CGCCAAGGAA 
CTGGTCAAAT 
CACATTGCCT 
CGGTAAGTCC 
CCATCGTCAA 
GACTACCCTC 



AACGTGGTAT 
TACCAAATTA 
CATGATTACC 
GTGGTACTGG 
GAACACGCCC 
TGTCAACAAG 
TYGTCAAGGA 
TCTGTTCCCT 
AGAATCTACC 
CTGGTGCCAA 
CCTCCTACCC 
TTACAAGATT 
GTGTCATCAA 
ACTGAAGTTA 
CCCCGGTGAC 
TCCGTCGTGG 
GCCGGTTCCT 
TGGTGCTGGT 
GTAAGTTCGC 
TTGGAAGCTA 
GATGATCCCC 
CTCTCGGTCG 



CACCATCGAT 
CCGTTATTGA 
GGTACTTCTC 
TGAATTCGAA 
TTTTGGCTTT 
ATGGATACCA 
AGTTTCTTCC 
TCGTCCCCAT 
AACATGCCCT 
GTCTGGTAAG 
GTCCTGTTGA 
GGTGGTATCG 
GGCTGGTATG 
AGTCCGTCGA 
AACGTCGGTT 
TAACGTCTGT 
TCACCGCTCA 
TACGCTCCYG 
TGAATTGATC 
CTCCCAAGTT 
TCCAAGCCCA 
TTTCGCTGTT 



ATTGCTCTCT 50 

TGCTCCCGGT 100 

AAGCCGATTG 150 

GCTGGTATCT 200 

CACTCTCGGT 250 

CCAAGTGGTC 3 00 

TTCATCAAGA 3 50 

CTCTGGTTGG 4 00 

GGTACAAGGG 4 50 

ACCCTCTTGG 500 

CAAGCCTCTC 550 

GTACTGTCCC 600 

GTTGTCACCT 650 

AATGCACCAC 700 

TCAACGTCAA 750 

TCTGACTCCA 800 

AGTCATTATC 850 

TTTTGGATTG 900 

GAAAAGATTG 950 

CGTCAAGTCT 1000 

TGTGTGTTGA 1050 

CGTGA 10 95 



30 



35 



40 



45 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 451 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 92 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotorula minuta 

(B) STRAIN: ATCC 10658 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451 



GCTGAAGGCC 
AGTTCGAGAC 
CGTGATTTCA 
CATCAGCTAA 
GGCCGATTGC 
CTGGTATCTC 
ACCCTCGGTG 
CAAGGTAGGC 
CTGTCTTGTC 
GTCAAGGAGA 
TGTTCCCTTC 
AGTCCACCAA 
GGCGAGTACA 
CCCCACCCGT 



GAGCGAGAGC 
CCCCAAGTAC 
TCAAGAGTGA 
TGCATGTTAT 
GCTATTCTCA 
CAAGGATGGC 
TCAGACAGCT 
TAACTTCACA 
TTCCACCCTC 
CATCCGGTTT 
GTCCCCATCT 
CATGCCTTGG 
AGGGAAAGAC 
CCTACCGACA 



GTGGTATCAC 
AACGTCACCG 
GTTAACCATA 
GCGTCCAGAC 
TCATCGCCAC 
CAGACCCGAG 
CATCGTTGCC 
ACGTCGGCTT 
CAGTTCTCCG 
CATCAAGAAG 
CAGGATGGCA 
TACAAGGGAT 
CCTGCTCGAG 
AGCCTCTCCG 
2 



CATCGATATC 
TCATTGACGC 
ACATCAAACA 
ATGATTACTG 
CGGTGTTGGT 
AGCACGCCCT 
TTGAACAAGA 
CCCATCATTC 
AGTCCCGATA 
GTCGGATTCG 
CGGAGACAAC 
GGAAGAAGAC 
GCCATCGACT 
ACTTCCCCTC 
74 



GCTCTATGGA 
TCCAGGACAT 
GTGTTGCAAA 
GTACTTCCCA 
GAGTTCGAGG 
TCTCGCCTTC 
TGGACTCGGT 
ATTCACTTAC 
CGATGAAATC 
ACCCCAAGGG 
ATGATCGAGG 
CACCAAGACC 
CCATCGAGCC 
CAGGATGTCT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
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ACAAGATTGG TGGTATCGGA ACAGTGCCAG TCGGACGAGT CGAGACTGGT 750 

ACCATCAAGG CTGGTATGGT CGTCACCTTC GCTCCTTCAG CTGTCACCAC 800 

CGAAGTCAAG TCTGTTGAGA TGCACCACGA GCAGCTCGAG GCTGGTCTTC 850 

CAGGTGACAA CGTCGGATTC AACATCAAGA ACGTTTCAGT CAAGGATATC 90 0 

5 CGAAGAGGAA ACGTCTGCGG TGACTCCAAG AACGATCCCC CCAAGGAGGC 95 0 

TGCTTCCTTC ACCGCCCAGG TCATTGTCCT CAACCACCCC GGTCAAATCG 100 0 

GTAACGGATA CTCTCCAGTT CTCGATTGCC ACACTGGTGA GTCATTCTTC 1050 

CATATTAGTT TGAACTCTTT TGAACAATAC TAACGTGAAT CATTATACTT 1100 

TTCAGCCCAC ATTGCATGCA AGTTCGACAC CCTCCTAGAG AAGATTGACC 1150 

10 GACGATCCGG AAAGTCCATC GAAGATACCC CCAAGTTCGT CAAGTCTGGT 12 0 0 

GACGCCGCCA TCGTCAAGAT GGTCCCCACC AAGCCAATGT GCGTTGAGGC 12 50 

TTTCACCGAC TACCCACCTC TTGGACGATT CGCCGTCCGT GA 12 92 

15 

2) INFORMATION FOR SEQ ID NO: 452 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 8 9 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporobolomyces salmonicolor 

(B) STRAIN: ATCC 32311 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452 

AAGCTCAAGG CCGAGCGTGA GCGTGGTATC ACCATCGATA TCGCTCTCTG 50 

GAAGTTCGAG ACCCCCAAGG TGCGTTCTCA CCCCGGCTGA GGAGCACGCA 10 0 

CGCGAGGGCT CACGCTGCGC CTCTTACAGT ACATGATCAC CGTCATCGAC 150 

35 GCCCCGGGTC ACCGTGACTT CATCAAGAAC ATGATCACCG GTACCTCGCA 2 00 

GGCCGACTGC GCCATCCTCA TCATCGCCGC CGGTACCGGT GAGTTCGAGG 2 50 

CTGGTATCTC GAAGGACGGC CAGACCCGCG AGCACGCCCT CCTCGCCTTC 3 00 

AC C CTCGGTG TCCGTCAGCT CATCGTCGCC ATCAACAAGA TGGACACGAC 350 

CAAGTACTCG GAGGCCCGTT TCGAGGAGAT CATCAAGGAG ACCTCCAACT 4 00 

4 0 TCATCAAGAA GGTCGGCTTC AACCCCAAGA ACGTCCCCTT CGTCCCCATC 4 50 

TCGGGATGGC ACGGTGACAA CATGATTGAG GAGACCGCCA ACATGCCCTG 500 

GTACAAGGGA TGGAAGAAGG AGACCAAGGC CGGTGAGGTC AAGGGCAAGA 550 

CCCTCCTCGA CGCCATCGAC GCGATCGAGC CCCCTTCGCG CCCTACCGAC 600 

AAGCCCCTCC GTCTTCCCCT CCAGGTTCGT TTCCCTGCTC GCGGTTTACG 650 

45 CTGCTACTTC GAGCTGACCC GCGAGCTCTG CCCGAACAGG ATGTCTACAA 700 

GATCGGTGGT ATCGGCACAG TCCCCGTCGG CCGTGTCGAG ACCGGCACGA 750 

TCAAGGCCGG TATGGTCGTC GTCTTCGCCC CGGCCAACGT CACCACTGAG 800 

GTCAAGTCGG TCGAGATGCA CCACGAGCAG CTCGAGGCTG GTCTCCCGGG 850 

AGACAACGTC GGCTTCAAGT GCGTCTCATC ATGTTTTTGC TTCGCTCGGC 900 

50 CATTTTTTCA GTCCTGACCC CGTTTTGCCC CTCGACAGCG TCAAGAACGT 950 

TTCCGTTAAG GACATCCGTC GCGGTAACGT CTGCGGTGAC TCGAAGAACG 100 0 

ACCCCCCCAA GGAGGCCGCT TCCTTCAAGG CCCAGGTCAT CGTCATGAAC 10 50 

CACCCCGGTC AGATCGGCAA CGGTTACGCT CCCGTTCTCG ACTGCCACAC 1100 

CGCCCACATT GCCTGCAAGT TCGACACCCT CCTCGAGAAG ATCGACCGTC 1150 

55 GCTCGGGCAA GTCGATTGAG GACCTCCCCA AGTTCGTCAA GTCGGGTGAC 12 00 

GCCGCCATCG TCAAGATGGT TCCCTCCAAG CCGATGTGTG TCGAGTCGTT 12 50 

CGCCGAGTAC CCCCCTCTCG GACGTTTCGC CGTCCGTGA 12 89 

60 
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2) INFORMATION FOR SEQ ID NO: 453 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1070 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: WSA-148 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 453 

GTGAGCGCGG TATCACCATC GATATTG CTC TGTGGAAGTT CGAGACCCCC 50 

AAGTACTACG TCACCGTCAT TGACGCCCCC GGTCATCGCG ATTTCATCAA 100 

GAACATGATC ACTGGTACCT CGCAGGCCGA CTGCGCCATT CTCATCATTG 150 

CCGCTGGTAC TGGTGAGTTC GAGGCTGGTA TCTCCAAGGA TGGCCAGACT 2 00 

CGTGAGCACG CTCTGCTCGC CTACACCCTG GGTGTGCGGC AGCTGATCGT 2 50 

CGCCATCAAC AAGATGGACA CGGCCAAGTG GGCTGAGGCT CGTTACCAGG 3 00 

AGATCATCAA GGAGACCTCC AACTTCATCA AGAAGGTCGG CTACAACCCC 3 50 

AAGACTGTTG CCTTCGTCCC CATCTCGGGC TTCCACGGCG ACAACATGCT 4 00 

TACTCCCTCG ACCAACTGCC CCTGGTACAA GGGC TGGGAG AAGGAGGGCA 4 50 

AGAGCGGCAA GGTTACCGGT AAGACTCTGC TGGACGCCAT TGACGCCGTC 500 

GAGCCCCCCA AGCGCCCCAC GGACAAGCCC CTGCGTCTGC CCCTCCAGGA 550 

TGTCTACAAG ATCGGCGGTA TCGGCACTGT . CCCTGTCGGC CGTATCGAGA 6 00 

CTGGTGTCCT GAAGCCCGGC ATGGTCGTCA CCTTTGCCCC GTCCAACGTC 65 0 

ACCACTGAAG TCAAGTCCGT CGAGATGCAC CACGAGCAGC TTGTTGAGGG 70 0 

TGTTCCCGGC GACAACGTCG GCTTCAACGT CAAGAACGTC TCCGTCAAGG 750 

AGATCCGTCG TGGCAACGTT GCCGGTGACT CCAAGAACGA CCCCCCCTCG 800 

GGCGCCGCCA CCTTCAACGC CCAGGTCATT GTCCTGAACC ACCCCGGCCA 850 

GGTCGG CAAC GGCTACGCCC CGGTTCTGGA CTGCCACACC GCCCACATTG 9 00 

CCTGCAAGTT CACCGAGATC CTTGAGAAGA TCGACCGCCG TACCGGCAAG 950 

TCGGTTGAGA ACAACCCCAA GTTCATCAAG TCGGGTGACG CCGCCATTGT 1000 

CAAGCTGACG CCCTCGAAGC CCATGTGCGT TGAGGCCTTC ACTGACTACC 1050 

CCCCTCTGGG CCGTTTCGCC 107 0 



2) INFORMATION FOR SEQ ID NO: 454 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 92 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stephanoascus ciferrii 

(B) STRAIN: ATCC 52550 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 454 

CTTAAGTCTG AGCGTGAGAG AGGTATCACC ATCGATATTG CTCTCTGGAA 50 

ATTCGAGACT CCTAAGTACA ACGTTACCGT CATTGATGCT CCAGGTCACA 100 

GAGATTTCAT CAAGAACATG ATTACTGGTA CCTCCCAGGC CGATCTTGCC 150 
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ATCCTTATCA TTGCTGGTGG TGTCGGTGAG TTCGAGGCTG GTATCTCCAA ' 2 5o - 

GGACGGTCAG AC CAGAGAGC ACGCTCTTCT TTCTTTCACC CTTGGTGTCA 2 50 

GAAACATGAT TGTTGCTGTC AACAAGATGG ACTCCGTCAA GTGGTCTGAG 3 00 

GATCGTTTCA ACGAAATTGT CAAGGAGACC TCCAACTTCG TCAAGAAGGT 350 

5 TGGTTACAAC CCTAAGAATA TTGCTTTCGT TCCTATCTCC GGTTGGAACG 400 

GTGACAATAT GATTGAGCCA TCCACCAACT GCCCATGGTA CAAGGGTTGG 4 50 

GAGCGTGAGA CCAAGAACGG TACTGCCAAG GGTAAGACCA TCTTGGAGGC 500 

CATTGACTCT ATGGAGCCAC CTTCCAGACC AGTTGACAAG CCTCTCCGTC 550 

TTCCTCTTCA GGACGTTTAC AAGATTGGTG GTATTGGTAC GGTGCCAGTT 600 

10 GGTCGTGTTG AGACTGGTGT TATCAAGCCA GGTATGGTTG TTACCTTTGC 650 

CCCAGCTGGT GTCACCACTG AAGTCAAGTC TGTCGAGATG CACCACGAAC 700 

AGATCCCAGA AGGTACCCCA GGTGACAACG TTGGTTTCAA CGTCAAGAAC 750 

GTCTCCGTCA AGGAAATCAG ACGTGGTAAC GTTACTGGTG ACTCCAAGAA 8 00 

CGACCCACCA AAGGGCTGCG ACTCTTTCAA CGCTCAGGTC ATCATCTTGA 8 50 

15 ACCACCCTGG TCAGATCTCT GCTGGTTACG CTCCAGTTTT GGACTGCCAC 900 

ACTGCTCACA TTGCTTGCAA GTTTGAGGAG CTCATTGAGA AGATTGACAG 950 

ACGTTCCGGT AAGAAGGTCG AAGACTCTCC TAAGTTCGTC AAGGCCGGTG 10 00 

ATGCCGCCAT TGTCAAGATG GTTCCATCCA AGCCAATGTG TGTTGAAACC 1050 

TTCACTGAGT ACCCACCTCT TGGTCGTTTC GCCGTCCGTG AC 10 92 

20 



2) INFORMATION FOR SEQ ID NO: 455 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 9 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNES S : Doubl e 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 
35 (B) STRAIN: WSA-225 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455 

GCCGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCCCTCT GGAAGTTCGA 50 

4 0 GACCCCCAAG TACAATGTCA CCGTCATTGG TATGTTTTCT CTTTACCTTT 100 

CCCCTCCATC GTCTTGCTGT GCCATAACTA ACGAGAGTAG ACGCCCCCGG 150 

TCACCGTGAC TTCATCAAGA ACATGATCAC TGGTACCTCC CAGGCCGACT 2 00 

GTGCTATTCT CATCATTGCT GCCGGTACTG GTGAGTTCGA GGCTGGTATC 250 

TCCAAGGATG GCCAGACCCG TGAGCACGCT CTGCTCGCCT TCACCCTTGG 3 00 

4 5 TGTCAAGCAG CTCATCGTTG CCATCAACAA GATGGACACC ACCAACTGGT 3 50 

CCGAGGACCG TTTCAAGGAA ATCATCAAGG AAGTCACCAA CTTCATCAAG 4 00 

AAGGTTGGCT ACGACCCCAA GGGTGTTCCA TTCGTTCCAA TCTCTGGTTT 450 

CAACGGTGAC AACATGATTG AGGCCTCCAC CAACTGCCCA TGGTACAAGG 500 

GATGGAACAA GGAGACTAAG GCCGGTGGTG CCAAGACTGG CAAGACCCTC 550 

50 CTCGAGGCCA TCGATGCCAT CGACATGCCA ACCCGTCCTA CCGACAAGCC 600 

CCTCCGTCTC CCACTCCAGG ATGTCTACAA GATCTCTGGT ATCGGAACTG 650 

TACCAGTCGG TCGTGTTGAG ACCGGTATCA TCAAGCCCGG TATGGTCGTC 700 

ACCTTCGCCC CTGCCAACGT CACCACTGAA GTCAAGTCCG TCGAAATGCA 750 

CCACCAGCAG CTTCAGCAGG GTGTCCCCGG TGACAACGTC GGCTTCAACG 800 

55 TCAAGAACGT TTCCGTCAAG GAAGTCCGCC GTGGTAACGT TGCCGGTGAC 850 

TCCAAGAACG ACCCACCATC CGGCTGTGCC TCCTTCAACG CCCAGGTCAT 900 

CGTCCTCAAC CACCCCGGCC AGATCGGTGC TGGTTACGCC CCAGTCCTCG 950 

ACTGCCACAC TGCTCACATT GCTTGCAAGT TCGCTGAGCT CCTCGAGAAG 1000 

ATTGACCGCC GTACCGGTAA ATCCGTCGAA GCCAACCCCA AGTTCGTCAA 1050 

60 GTCTGGTGAT GCCGCTATCG CCAAGATGGT TCCCTCCAAG CCTATGTGCG 1100 

277 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

TTGAGGCTTT CACTGACTAC CCCCCACTTG GTCGTTTCGC CGTCCGTGA' tl%9 



5 2) INFORMATION FOR SEQ ID NO: 456 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1101 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichosporon cutaneum 

(B) STRAIN: ATCC 62 965 



20 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456 



TCTTGACAAG CTTAAAGCTG AACGTGAACG TGGTATCACC ATTGATATCG 50 

CTCTCTGGAA GTTCGAAACT CCTAAGTACT ACGTTACTGT TATTGATGCT 100 

CCAGGTCACC GTGATTTCAT CAAGAACATG ATTACTGGTA CTTCCCAAGC 150 

CGACTGCGCC ATTCTTATCA TTGCTGCCGG TGTCGGTGAA TTCGAAGCTG 2 00 

2 5 GTATCTCCAA GGAAGGTCAA ACCAGAGAAC ACGCTCTTCT CGCTTTCACC 250 

CTTGGTGTCA GACAACTTAT CATTGCCATC AACAAGATGG ACTCTGTCAA 3 00 

GTGGGACCAA AAGAGATACG AAGAAATCGT CAAGGAGGCT TCCAACTTCG 350 

TCAAGAAGGT TGGTTACAAC CCCAAGTCTG TTCCATTCGT TCCTATCTCT 4 00 

GGTTGGAACG GTGACAACAT GTTGGAACCT ACCACCAACG CCCCATGGTA 4 50 

3 0 CAAGGGATGG ACCAAGGAAA CCAAGGCTGG TGCCACTAAG GGTATGACTC 500 

TTATTGAAGC CATTGACGCC ATTGAACCAC CAGTAAGACC ATCCGACAAG 550 

CCACTCCGTC TCCCACTCCA AGATGTTTAC AAGATTGGTG GTATCGGAAC 600 

TGTGCCAGTC GGCCGTGTCG AAACCGGTAT CATCAAGGCC GGTATGGTCG 650 

TCACCTTTGC TCCACCAATG GTCACAACTG AAGTTAAGTC CGTTGAAATG 700 

35 CACCACGAAC AACTTGCTCA AGGTAACCCA GGTGACAACG TTGGTTTCAA 750 

CGTCAAGAAC GTTTCCGTTA AGGAAATCAG ACGTGGTAAC GTCTGTGGTG 800 

ACTCCAAGAA CGATCCACCA AAGGGCTGCG AATCTTTCAA CGCTCAAGTT 850 

ATCGTCTTGA ACCACCCTGG TCAAATCTCT GCTGGTTACT CTCCAGTTCT 900 

CGATTGCCAC ACTGCCCACA TTGCCTGCAG ATTCGACGAA CTCCTTGAAA 950 

40 AGATCGACCG TCGTTCCGGT AAGAAGATTG AAGACTCTCC AAAGTTTGTC 10 00 

AAGTCTGGTG ATGCCGCTAT CGTCAAGATG ATCCCAACCA AGCCAATGTG 1050 

CGTTGAAACC TTCACTGAAT ACCCACCACT TGGTCGTTTC GCCGTCCGTG 1100 

A 1101 



2) INFORMATION FOR SEQ ID NO: 457 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1085 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Wangiella dermatitidis 

(B) STRAIN: WSA-229 

60 

278 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457 



PCT/CA00/01150 



GTTGAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC TATGTCACCG TCATCGACGC CCCGGGTCAT 100 

5 CGTGACTTTA TCAAGAACAT GATCACTGGT ACCTCGCAGG CCGACTGCGC 150 

CATCTTGATC ATTGCCGCCG GTACCGGTGA ATTCGAAGCC GGTATCTCCA 2 00 

AGGATGGTCA GACCCGTGAG CACGCTCTGC TCGCCTACAC CTTGGGTGTC 2 50 

AAGCAGCTCA TCGTCGCCAT CAACAAGATG GACACCACCA AGTGGTCCGA 3 00 

GGAGCGTTTC AACGAAATCA TCAAGGAGAC TTCCAACTTC ATCAAGAAGG 3 50 

10 TCGGCTACAA CCCCAAGGCC GTTCCTTTCG TCCCCATCTC CGGCTTCAAC 4 00 

GGTGACAACA TGATTGAGGT CTCCACCAAC TGCCCGTGGT ACAAGGGATG 4 50 

GGAGAAGGAG TCCAAGGCTG GCAAGGCCAC CGGCAAGACC CTCCTCGAGG 500 

CCATTGACGC CATCGACCCA CCCACCCGTC CCACCGACAA GCCTCTCCGT 550 

CTCCCTCTCC AGGATGTCTA CAAGATCTCT GGTATCGGAA CGGTTCCTGT 600 

15 CGGTCGTGTC GAGACCGGTA CCATCAAGGC CGGTATGGTC GTCACCTTCG 650 

CTCCGGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAAAT GCACCACGAG 700 

CAGCTCGCCG AGGGTCTGCC AGGTGACAAC GTTGGCTTCA ACGTCAAGAA 750 

CGTCTCCGTC AAGGAGGTTC GTCGTGGTAA CGTTGCCGGT GACTCCAAGA 800 

ACGACCCGCC CAAGGGTGCC GAGTCCTTCA ACGCCCAGGT CATTGTCCTC 850 

2 0 AACCACCCTG GTCAGATCGG TGCCGGCTAC GCTCCAGTCT TGGATTGCCA 900 

CACTGCCCAC ATTGCTTGCA AGTTCGCCGA GTTGCTCGAG AAGATCGACC 950 

GTCGTACCGG AAAGTCCATC GAGAACAACC CCAAGTTCAT CAAGTCTGGT 1000 

GATGCTGCCA TCGTCAAGAT GATTCCCAGC AAGCCCATGT GTGTCGAGGC 1050 

TTTCACCGAC TATCCTCCTC TGGGTCGTTT CGCTG 1085 

25 



2) INFORMATION FOR SEQ ID NO: 458 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 492 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 
40 (B) STRAIN: DAL- 95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458 

TGTCTTCATC CGGAATTGAT TGTGAGTCGT TCCACATGCT CACCTAGTTT 50 

4 5 TCGCTCGATC TTTTCACTAA CGCAAACCAT GTAGAACAAC ATTGCCAAGG 100 

CCCACGGTGG TTACTCCGTC TTCACTGGTG TTGGTGAGCG TACTCGTGAG 150 

GGTAACGATC TGTACCACGA AATGCAGGAG ACTGGTGTCA TTCAGCTCGA 2 00 

GGGTGAATCC AAGGTCGCAC TGGTGTTCGG ACAGATGAAC GAGCCCCCCG 2 50 

GTGCCCGTGC CCGTGTCGCC CTTACCGGTC TGACCATTGC CGAGTACTTC 3 00 

50 CGTGACGAGG AGGGTCAGGA CGTGCTGCTC TTCATTGACA ACATTTTCCG 3 50 

TTTCACCCAG GCCGGTTCTG AGGTGTCTGC CCTTCTCGGT CGTATCCCCT 4 00 

CTGCCGTCGG TTACCAGCCC ACCCTGGCCG TCGACATGGG TGGTATGCAG 450 

GAGCGTATCA CCACCACCAA GAAGGGTTCT ATTACCTCCG TC 4 92 

55 

2) INFORMATION FOR SEQ ID NO: 459 

(i) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 1154 bases 

279 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capitatus 

(B) STRAIN: ATCC 10663 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459 

GTCCGTGGTC AAGAAGTTAT TGACACTGGT GCCCCAATTA CCATTCCTGT 50 

TGGTCGTGGT ACTCTTGGTA GAATTATCAA CGTCATTGGT GAACCAATTG 100 

15 ACGAACGTGG TCCTATCAAG GCTTCTAAGT ATGCTCCTAT CCATACTGAA 150 

CCACCAACCT TTGCTGAACA ATCTACTTCT GCTGAAGTTC TTGAAACTGG 2 00 

TATCAAGGTT GTCGATCTTC TTGCTCCTTA CGCCCGTGGT GGTAAGATTG 2 50 

GTCTTTTCGG TGGTGCTGGT GTCGGTAAGA CTGTCTTCAT TCAAGAACTT 3 00 

ATTAACAACA TTGCCAAGGC TCACGGTGGT TTCTCTGTCT TCACTGGTGT 3 50 

2 0 CGGTGAAAGA ACCCGTGAAG GTAACGATCT TTACCGTGAA ATGAAGGAAA 400 

CTGGTGTCAT CAACCTCGAA GGTGACTCCA AGGTCGCTCT CGTTTTCGGT 450 

CAAATGAACG AACCTCCAGG TGCCCGTGCC CGTGTCGCTT TGACTGGTCT 500 

TACCATTGCC GAATACTTCC GTGATGAAGA AGGACAAGAT GTCTTGCTTT 550 

TCGTTGACAA CATTTTCAGA TTCACCCAAG CCGGTTCTGA AGTCTCTGCT 600 

25 CTTTTGGGTC GTATTCCATC TGCCGTCGGT TACCAACCTA CCCTTGCTAC 650 

CGATATGGGT GCCCTCCAAG AACGTATTAC CACCACCCAA AAGGGTTCCG 700 

TCACATCTGT CCAAGCCGTC TATGTCCCAG CAGACGATTT GACTGATCCT 750 

GCCCCAGCCA CCACTTTCGC T C AC TTGG AC GCCACCACTG TCTTGTCTCG 8 00 

TTCCATTTCC GAATTGGGTA TCTACCCAGC TGTCGATCCT CTCGATTCCA 850 

3 0 AGTCTCGTCT TTTGGATCCT GAAGTTATTG GACACGAACA CTACGAAGTT 900 

GCCACTCAAG TTCAACAAAC CCTCCAAGCT TACAAGTCTC TCCAAGATAT 950 

CATTGCCATT TTGGGTATGG ATGAATTGTC TGAAGCTGAT AAGCTTACTG 10 00 

TCGAACGTGC CCGTAAGATC CAAAGATTCC TTTCCCAACC ATTCGCTGTT 1050 

GCCGAAGTTT TCACTGGTAT CGAAGGTCGT CTCGTTCCAT TGAAGGAAAC 1100 

3 5 CGTCAGATCT TTCAAGGAAA TCCTTGAAGG TAAGTACGAT CACCTTCCAG 1150 

AAGC 1154 



4 0 2) INFORMATION FOR SEQ ID NO : 460 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 95 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 18804 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460 



CCAATTCGAC GAAGGAAACT TGCCAGCTAT TTTGAATGCT TTGACTTTGA 50 

AGAACGGTGA CCAAGACTTG GTCTTGGAAG TTGCTCAACA TTTGGGTGAA 100 

AACACCGTCA GAGCTATTGC TATGGATGGT ACTGAAGGTT TAGTCAGAGG 150 

TACCGAAGTC AACGATACCG GTGCCCCAAT CTCCGTCCCA GTCGGTAGAG 2 00 

60 GTACTTTAGG TAGAATCATC AATGTTGTTG GTGAACCAAT TGATGACAGA 250 

280 



SUBSTITUTE SHEET (RULE 26) 
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GGTCCAATTG AATGTAAGGA AAAGAAACCA ATTCACGCTG AACCACCATC 3 00 

ATTCGTTGAA CAATCCACTG CTGCCGARAT TTTGGAAACC GGTATCAAGG 3 50 

TTGTCGACTT GTTGGCCCCA TACGCCAGAG GTGGTAARAT TGGTTTATTC 4 00 

GGTGGTGCTG GTGTCGGTAA GACCGTCTTT ATCCAAGAAT TGATTAACAA 4 50 

5 CATTGCCAAA GCCCATGGTG GTTTCTCTGT CTTTACCGGT GTYGGTGAAA 500 

GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAAGA AACCGGTGTC 550 

ATCAACTTGG AAGGTGACTC CAAGGTCGCT TTGGTCTTCG GTCAAATGAA 600 

CGAACCACCA GGTGCTAGAG CTAGAGTTGC TTTGACTGGT TTGACCATTG 650 

CTGAATACTT CAGAGATGAA GAAGGTCAAG ATGTCTTGTT GTTCATTGAT 700 

10 AACATTTTCA GATTCACCCA AGCTGGTTCC GAAGTGTCTG CTTTGTTAGG 750 

TCGTATTCCA TCTGCTGTCG GTTATCAACC AACCTTAGCC ACTGATATGG 8 00 

GTCTTTTGCA AGAACGTATT ACCACCACCA AGAAAGGTTC CGTCACCTCT 850 

GTCCAAGCTG TCTATGTCCC AGCTGATGAT TTGACCGATC CTGCTCCAGC 900 

CACTACATTC GCCCATTTGG ATGCCACTAC TGTCTTGTCT AGAGGTATTT 950 

15 CTGAATTGGG TATCTACCCA GCTGTCGATC CATTGGATTC CAAATCCAGA 10 00 

TTATTGGACG CTTCTGTTGT TGGTCAAGAA CATTACGATG TCGCTACTGG 1050 

TGTTCAACAA ACTTTACAAG CTTACAAATC CTTACAAGAT ATCATTGCTA 1100 

TTTTGGGTAT GGATGAATTG TCTGAAGCTG ATAAATTGAC TGTCGAAAGA 1150 

GCCCGTAAGA TCCAAAGATT CTTGTCTCAA CCATTCGCTG TTGCTGAAGT 12 00 

20 TTTCACTGGT ATCCCAGGTA GATTAGTCAG ATTGCAAGAC ACTGTCAAAT 1250 

CATTCAAGGA TGTTTTGGAA GGTAAATACG ATAACTTGCC AGAAA 1295 



25 2) INFORMATION FOR SEQ ID NO: 461 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 77 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 

(B) STRAIN: NCPF 3949 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 461 



TAACTTGCCA GCTATTTTGA ATGCTTTGAC TTTGAAGAAC GGTGACCAAG 50 

ATTTAGTTTT GGAAGTTGCT CAACATTTGG GTGAAAACAC CGTCAGAGCT 100 

ATTGCTATGG ATGGTACTGA AGGTTTAGTC AGAGGTACTG AAGTCAACGA 150 

TACCGGTGCC CCAATCTCCG TTCCAGTCGG TAGAGGTACC TTAGGTAGAA 200 

45 TCATCAATGT TGTTGGTGAA CCAATTGATG ACAGAGGTCC AATTGAATGT 250 

AAGGAAAAGA AACCAATTCA TGCAGAACCA CCATCCTTCG TTGAGCAATC 3 00 

CACTGCTGCC GAAATTTTGG AAACCGGTAT CAAGGTTGTC GACTTATTGG 350 

CCCCATACGC CAGAGGTGGT AAGATTGGTT TGTTCGGTGG TGCTGGTGTC 4 00 

GGTAAGACCG TCTTTATCCA AGAATTGATT AACAACATTG CTAAAGCCCA 450 

50 TGGTGGTTTC TCCGTCTTTA CCGGTGTCGG TGAAAGAACC AGAGAAGGTA 500 

ACGATTTGTA CCGTGAAATG AAAGAAACCG GTGTCATCAA CTTAGAAGGT 550 

GACTCCAAGG TCGCTTTGGT CTTTGGACAA ATGAACGAAC CACCAGGTGC 600 

TAGAGCTAGA GTTGCTTTGA CTGGTTTGAC TATTGCTGAA TACTTCAGAG 650 

ATGAAGAAGG TCAAGATGTC TTGTTGTTCA TCGATAACAT TTTCAGATTC 700 

55 ACCCAAGCTG GTTCCGAAGT GTCTGCTTTG TTAGGT CGT A TTCCATCTGC 750 

CGTCGGTTAT CAACCAACCT TAGCTACTGA TATGGGTCTT TTGCAAGAAC 800 

GTATTACCAC CACCAAGAAA GGTTCCGTCA CCTCTGTCCA AGCTGTCTAT 850 

GTCCCAGCTG ATGATTTGAC CGATCCTGCT CCAGCCACCA CATTCGCCCA 900 

TTTGGATGCC ACTACTGTCT TGTCTAGAGG TATTTCTGAA TTGGGTATTT 950 

60 ACCCAGCTGT CGATCCATTG GATTCCAAAT CCAGATTATT GGACGCTGCC 1000 

281 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

GTTGTTGGTC AAGAACATTA TGATGTCGCT ACTGGTGTTC ^CAAACT^Sf 1050 

GCAAGCTTAC AAATCCTTAC AAGATATCAT TGCTATTTTG GGTATGGATG 110 0 

AATTGTCTGA AGCTGATAAA TTGACTGTCG AAAGAGCCCG TAAGATTCAA 1150 

AGATTCTTGT CTCAACCATT CGCCGTTGCT GAAGTTTTCA CTGGTATTCC 12 00 

AGGTAGATTA GTCAGATTGC AAGACACTGT CAAATCATTC AAGGATGTTT 1250 

TGGAAGGTAA ATACGATCAC TTGCCAG 1277 



10 2) INFORMATION FOR SEQ ID NO: 462 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 78 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida famata 

(B) STRAIN: ATCC 62894 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462 



GTAACTTGCC AGCTATTTTG AACGCTTTGA CCTTGAAGAA CGGTGAAAAC 5 0 

GACTTAGTTT TAGAAGTTGC CCAACATTTG GGTGAAAACA CCGTCAGAGC 100 
TATTGCTATG GATGGTACTG AAGGTTTAGT TAGAGGTACT CCAGTTACCG 150 

ATTCTGGTGC TCCAATTTCT GTCCCAGTCG GTAGAGGTAC TTTAGGTAGA 2 00 

3 0 ATCTTGAACG TTATTGGTGA ACCAATTGAT GAACAAGGTC CAGTTGATGC 2 50 

CAAGGAAACC AGACCAATTC ACCAAGACCC ACCAGCATTC GTTGATCAAT 3 00 

CCACCAAGGC TGAAGTTTTG GAAACTGGTA TCAAGGTTGT CGATTTATTA 3 50 

GCCCCTTACG CTAGAGGTGG TAAGATTGGT TTATTCGGTG GTGCCGGTGT 4 00 

CGGTAAGACC GTCTTTATCC AAGAATTGAT TAACAACATT GCCAAGGCCC 4 50 

35 ATGGTGGTTT CTCTGTTTTC ACTGGTGTCG GTGAAAGAAC CAGAGAAGGT 500 

AACGATTTAT ATAGAGAAAT GAAGGAAACT GGTGTCATTA ACTTGGAAGG 550 

TGACTCCAAG GTCGCCTTGG TTTTCGGTCA AATGAACGAA CCACCAGGTG 600 

CTAGAGCTAG AGTTGCTTTA ACCGGTTTAA CCATTGCCGA ATACTTCAGA 650 

GACGAAGAAG GTCAAGATGT GTTATTGTTC GTCGATAACA TTTTTAGATT 700 

40 CACCCAAGCC GGTTCCGAAG TGTCTGCTTT GTTAGGTCGT ATTCCATCGG 750 

CTGTCGGTTA TCAACCAACC TTAGCCACTG ATATGGGTCT TTTACAAGAA 8 00 

AGAATTACCA CCACCACCAA GGGTTCCGTT ACTTCTGTCC AAGCTGTCTA 850 

CGTCCCAGCC GATGATTTAA CCGATCCTGC TCCAGCTACC ACTTTCGCCC 900 

ACTTGGATGC TACCACTGTG TTGTCTCGTG GTATCTCTGA ATTGGGTATT 950 

4 5 TACCCAGCTG TCGATCCATT GGATTCCAAA TCCAGATTGT TAGATGCTGA 10 00 

TATCGTTGGT AAAGAACACT ACGAAGTTGC CACTGGTGTC CAACAAACCT 1050 

TACAAGCTTA CAAATCTTTA CAAGATATCA TTGCTATTTT AGGTATGGAT 1100 

GAATTGTCTG AAGCCGATAA ATTGACTGTC GAAAGAGCCA GAAAGATCCA 1150 

AAGATTCTTG TCTCAACCAT TCGCCGTTGC CGAAGTTTTC ACCGGTATCC 12 00 

50 CAGGTAGATT AGTTAGATTG CAAGACACTG TTAAATCTTT CAAGGAAGTC 12 50 

TTAGAAGGTA AATATGATCA CTTACCAG 12 78 



55 2) INFORMATION FOR SEQ ID NO: 463 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1154 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

282 



SUBSTITUTE SHEET (RULE 26) 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida glabrata 

(B) STRAIN: ATCC 66032 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 463 



TCTGGTCAGA GGCGAGAAGG TCGTCGACAC AGGTGCCCCA ATCTCCGTCC 50 

CTGTCGGCAG AGAGACCCTG GGCAGAATCA TCAACGTTAT CGGTGAACCT 100 

ATCGACGAGC GTGGCCCAAT CAACTCAAAG TTGAGAAAGC CTATCCACGC 150 

CGACCCTCCT TCCTTCGCAG AACAGTCCAC CGCCGCCGAA GTCCTGGAAA 200 

15 CAGGTATCAA GGTCGTCGAC TTGCTGGCCC CTTACGCCAG AGGTGGTAAG 250 

ATCGGTCTGT TCGGTGGTGC CGGTGTCGGT AAGACCGTGT TCATCCAAGA 3 00 

ACTGATCAAC AACATCGCAA AGGCTCACGG TGGTTTCTCC GTGTTCACAG 3 50 

GTGTCGGTGA AAGAACCAGA GAAGGTAACG ATTTGTACAG AGAAATGAAG 4 00 

GAAACCGGTG TCATCAACTT GGAAGGTGAC TCTAAGGTCG CCTTGGTCTT 4 50 

2 0 CGGCCAAATG AACGAACCAC CAGGAGCCAG AGCCAGAGTC GCCTTGACCG 500 

GTTTGACCAT CGCAGAATAC TTCAGAGATG AAGAAGGTCA AGATGTCCTG 550 

CTGTTCGTCG ACAACATTTT CAGATTCACC CAAGCCGGTT CAGAAGTCTC 600 

CGCTTTGCTA GGTCGTATCC CATCCGCCGT CGGTTATCAA CCAACCTTGG 650 

CCACCGATAT GGGTCTGTTG CAAGAAAGAA TTACCACCAC AAAGAAGGGT 700 

25 TCCGTCACTT CCGTCCAAGC CGTCTACGTG CCTGCAGATG ATTTAACAGA 750 

TCCTGCCCCT GCCACTACTT TCGCGCACTT GGACGCCACC ACCGTCTTGT 800 

CCAGAAGTAT CTCAGAATTG GGTATCTACC CAGCTGTCGA CCCATTGGAC 850 

TCCAAGTCTA GATTGCTAGA CGCTGCCGTT GTCGGTGAAG AGCATTACAA 900 

CGTCGCCACA AAGGTCCAAG AAACTTTACA AACTTACAAG TCTCTGCAAG 950 

30 ATATCATCGC CATTTTGGGT ATGGATGAAT TGTCCGAACA AGATAAGCTA 10 0 0 

ACTGTCGAAA GAGCAAGAAA GATCCAAAGA TTCTTGTCCC AACCATTCGC 1050 

TGTCGCTGAA GTTTTCACCG GTATCGAAGG TAAGCTGGTC AGATTGAAGG 1100 

ACACCATCTC CTCTTTCAAG GCTGTCTTGG AAGGTAAGTA CGATGATCTT 1150 

CCAG 1154 

35 



2) INFORMATION FOR SEQ ID NO: 464 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida guilliermondii 
50 (B) STRAIN: ATCC 62 60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464 

CCACTACGAG GACGGTAACC TTCCTGCTAT TTTCAACGCC TTGACTCTTA 50 

55 AGAACGGTGA CCAAAACTTG GTTTTGGAAG TTGCCCAGCA TTTGGGTGAA 100 

AACACCGTCA GAACCATTGC TATGGATGGT ACTGAAGGTT TGGTTAGAGG 150 

TGCCAGCGTC ACTGACACTG GTGCTCCTAT CTCTGTGCCT GTTGGTCGTG 2 00 

GTACTTTGGG TCGTATCATC AACGTTATTG GTGAGCCAAT TGACGAGCGT 2 50 

GGACCAATCG AGTCCAAGCA AAAGAAGCCC ATTCACGCTG AACCACCATC 3 00 

60 GTTCGTCGAA CAATCCACTT CTGCCGAGGT TTTGGAAACC GGTATCAAGG 3 50 

283 
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TTGTCGACTT GTTGGCTCCA TACGCCAGAG GTGGTAAGAT TGGATTGTTCT -4 00 

GGTGGTGCCG GTGTCGGTAA GACTGTGTTT ATCCAAGAGT TGATTAACAA 4 50 

CATTGCCAAG GCTCACGGTG GTTTCTCCGT GTTCACCGGT GTCGGTGAAA 500 

GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAGGA AACTGGTGTC 550 

5 ATCAACTTGG AAGGTGAATC CAAGGTGGCC TTGGTGTTCG GTCAAATGAA 600 

CGAACCTCCA GGAGCTAGAG CCAGAGTTGC CCTTACCGGT TTGACCATCG 650 

CTGAATACTT CAGAGATGAG GAGGGTCAAG ATGTGTTGTT GTTCGTCGAC 700 

AACATTTTCA GATTCACTCA AGCTGGTTCT GAAGTGTCGG CTTTGTTGGG 750 

TCGTATTCCT TCGGCTGTCG GTTACCAACC TACTTTGGCC ACCGATATGG 8 00 

10 GTTTGTTGCA AGAGCGTATT ACCACCACCA AGAAGGGTTC CGTCACCTCT 850 

GTCCAAGCTG TCTATGTGCC AGCCGATGAT TTGACCGATC CTGCTCCTGC 90 0 

TACTACTTTT GCTCACTTGG ATGCTACCAC TGTGTTGTCT AGAGGTATCT 950 

CCGAGTTGGG TATTTACCCA GCTGTCGATC CTTTGGATTC CAAGTCGAGA 1000 

TTGTTGGATG CCTCTGTTGT CGGTGAGGAG CACTACTCGG TTGCTTCTAA 1050 

15 CGTTCAACAA ACCTTGCAAG CTTACAAGTC TTTGCAAGAT ATCATTGCCA 1100 

TTTTGGGTAT GGACGAATTG TCGGAAGCTG ACAAGTTGAC CGTCGAGAGA 1150 

GCCCGTAAGA TCGAGAGATT CTTGTCTCAA CCATTTGCTG TTGCCGAAGT 12 00 

TTTCACTGGT ATCAGTGGTA AGTTGGTCAG ATTGGAGGAC ACTATCAGAT 1250 

CTTTCAAGGA AGTCTTGGAA GGTAAGTACG ATCACTTGCC AGA 12 93 

20 

2) INFORMATION FOR SEQ ID NO: 465 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida haemulonii 
35 (B) STRAIN: ATCC 22991 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465 

CACTTTGACG ATGGTAACTT . GCCAGCCATT TTCAACGCCT TGAAGTTGAA 50 

4 0 GAACGGTGAC CAGGACTTGG TCTTGGAGGT CGCCCAGCAC TTGGGTGAGA 10 0 

ACACCGTCAG AACCATTGCC ATGGACGGTA CCGATGGTTT GGTCAGAGGC 150 

GAGGCTGTCA CTGACACTGG TGCTCCAATC TCCGTGCCTG TTGGTCGTGA 2 00 

GACTTTGGGT CGTATCATCA ACGTTATTGG TGAGC CAATT GACGAGAGAG 250 

GACCAATCAA GTCCAAGAAG AGAAACCCAA TCCACACTGA CCCACCAACC 300 

4 5 TTCGTTGAGC AGTCTACTTC TGCTGAGGTT TTGGAGACTG GTATTAAGGT 350 

TGTCGACTTG TTGGCCCCTT ACGCCAGAGG TGGTAAGATT GGTTTGTTCG 4 00 

GTGGTGCCGG TGTCGGTAAG ACCGTCTTTA TCCAAGAGTT GATTAACAAC 450 

ATTGCCAAGG CCCACGGTGG TTTCTCTGTC TTTACCGGTG TCGGTGAGAG 500 

AACCAGAGAA GGTAACGATT TGTACCGTGA AATGCAGGAG ACCGGTGTCA 550 

50 TCAACTTCGA GGGTGACTCC AAGGTCGCCT TGGTCTTCGG TCAGATGAAC 600 

GAGCCACCAG GAGCTAGAGC TAGAGTTGCT TTGACCGGTT TGACCATTGC 650 

CGAATACTTC AGAGATGAAG AAGGTCAGGA TGTGTTGTTG TTCGTTGACA 700 

ACATTTTCAG ATTCACTCAG GCTGGTTCCG AGGTGTCCGC CTTGTTGGGT 750 

CGTATTCCAT CTGCTGTCGG TTACCAGCCA ACCTTGGCCA CCGATATGGG 80 0 

55 TACCTTGCAA GAAAGAATTA CCACCACCAA GAAGGGTTCC GTCACCTCTG 850 

TCCAGGCCGT TTACGTGCCA GCTGATGATT TGACCGATCC TGCCCCAGCT 900 

ACCACTTTCG CTCACTTGGA TGCTACCACT GTGTTGTCTA GAGGTATTTC 950 

CGAGTTGGGT ATCTACCCAG CTGTCGACCC ATTGGACTCC AAGTCTAGAT 1000 

TGTTGGATGC CTCTGTTGTC GGTAAGGAGC ACTACGATGT TGCTACCAAC 1050 

60 GTCCAGCAGA CCTTGCAGGC CTACAAGTCC TTGCAGGATA TCATTGCCAT 1100 

284 
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WO 01/23604 

TTTGGGTATG GATGAATTGT CCGAAGCCGA 

CTAGAAAGAT TCAGAGATTC TTGTCCCAGC 

TTCACTGGTA TCGAGGGTAA GTTGGTCAGA 

TTTCAAGGAG GTCTTGGATG GTAAGTACGA 



PCT/CA00/01150 

CAAGTTGACT GTCGAGAGAG 11BCT 

CATTCGCCGT CGCCGAGGTT 1200 

TTGGAGGAGA CCGTCAAGTC 1250 

CCACTTGCCA GAG 12 93 



2) INFORMATION FOR SEQ ID NO: 466 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1111 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida inconspicua 
20 (B) STRAIN: ATCC 16783 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466 

AACGGTGGAT CTAAGTTAGT CTTAGAAGTT GCTCAACATT TGGGTGAAAA 50 

25 CACTGTCAGA ACCATTGCTA TGGATGGTAC TGAAGGTTTA GTTAGAGGTC 100 

AACCAGTTAA TGATACTGGT GCTCCAATCT CTGTCCCAGT CGGTAGAGGT 150 

ACTTTAGGTA GAATCTTAAA CGTTATTGGT GATCCAGTCG ATGAAAGAGG 2 00 

TCCAATTGAT TGTAAGGAAA GAAAACCAAT TCATCAAGAT CCTCCTGCTT 250 

TCGTTGAACA ATCTACTGAA GCTGAAGTTT TAGAAACCGG TATTAAGGTT 3 00 

3 0 GTTGATTTAT TAGCTCCTTA CGCTAGAGGT GGTAAGATTG GTTTATTCGG 3 50 

TGGTGCTGGT GTCGGTAAAA CCGTTTTCAT TCAAGAATTA ATTAACAATG 4 00 

TTGCAAAGGC TCATGGTGGT TTCTCAGTTT TCACTGGTGT CGGTGAAAGA 4 50 

ACCAGAGAAG GTAATGATTT ATACAGAGAA ATGAAGGAAA CTGGTGTTAT 500 

TAACTTAGAA GGTGAATCTA AGGTCGCCTT AGTTTTCGGT CAAATGAATG 550 

3 5 AACCACCAGG AGCAAGAGCA AGAGTTGCTT TAACCGGTTT AACTATTGCT 600 

GAATACTTCA GAGATGAAGA AGGTCAAGAT GTCTTGTTAT TCATTGATAA 650 

CATTTTCAGA TTTACTCAAG CAGGTTCTGA AGTTTCTGCA TTGTTAGGTA 700 

GAATTCCATC CGCTGTCGGT TATCAACCAA CTTTAGCAAC TGATATGGGT 750 

CTTTTACAAG AAAGAATTAC AACTACTAAG AAAGGTTCCG TTACTTCTGT 800 

4 0 CCAAGCAGTT TATGTCCCAG CAGATGATTT AACTGATCCT GCTCCAGCAA 850 

CTACTTTCGC CCACTTAGAT GCAACTACTG TCTTATCTAG AGGTATTTCC 900 

GAATTAGGTA TTTACCCAGC TGTCGATCCA TTAGATTCTA AATCTAGATT 950 

ATTAGATGTT GCTGTTGTTG GTCAAGAACA TTATGATGTT GCAACTCAAG 1000 

TTCAAGAAAC TTTACAAGCA TACAAATCTT TACAAGATAT TATTGCTATT 1050 

45 TTAGGTATGG ATGAATTATC TGAACAAGAT AAATTAACCG TTGAAAGAGC 1100 

AAGAAAGATT C 1111 



50 2) INFORMATION FOR SEQ ID NO: 467 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 83 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

285 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(A) ORGANISM: Candida kefyr 

(B) STRAIN: ATCC 28838 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 467 

5 

TCGAACAAGG TCAATTGCCA GAAATTTTGA ACGCTTTGGA GATTGAWACT 50 

CCTCAAGGTA AGTTGGTTTT GGAAGTTGCC CAACATTTGG GTGAAAACAC 100 

CGTCAGAACC ATTGCTATGG ACGGTACCGA AGGTTTGGTC CGTGGTGAGA 150 

AGGTTTTGGA CACTGGTGCT CCAATTTCCG TCCCAGTCGG TAGAGAAACT 2 00 

10 TTGGGTAGAA TCATCAACGT TATTGGTGAG CCAATTGACG AAAGAGGCCC 2 50 

AATCAAGTCC AAGATGAGAA AGCCAATTCA CGCTGACCCT CCATCCTTTG 3 00 

TTGAACAATC CACTGCTGCT GAAGTTTTGG AAACCGGTAT CAAGGTTGTC 3 50 

GACTTGTTGG CCCCATACGC CAGAGGTGGT AAGATTGGTT TGTTCGGTGG 4 00 

TGCCGGTGTC GGTAAGACCG TTTTCATCCA AGAGTTGATT AACAACATTG 450 

15 CCAAGGCCCA TGGTGGTTTC TCCGTCTTCA CCGGTGTCGG TGAAAGAACC 500 

AGAGAAGGTA ACGATTTGTA CCGTGAAATG AAGGAAACCG GTGTCATCAA 55 0 

CTTGGAAGGT GACTCCAAGG TCGCCTTGGT CTTCGGTCAA ATGAACGAAC 600 

CACCTGGAGC TAGAGCCAGA GTTGCCTTGA CCGGTTTGAC TATCGCTGAA 650 

TACTTCAGAG ATGAAGAAGG TCAAGATGTG TTGTTGTTTA TCGACAACAT 700 

2 0 TTTCAGATTC ACACAAGCCG GTTC CGAAGT GTCCGCTTTG TTGGGTCGTA 750 

TTCCATCCGC TGTCGGTTAC CAACCTACTT TGGCCACCGA TATGGGTTTG 800 

TTGCAAGAAA GAATTACCAC TACCAAGAAG GGTTCCGTTA CCTCCGTCCA 850 

AGCTGTCTAC GTCCCTGCTG ATGATTTGAC TGATCCTGCT CCAGCTACYA 900 

CTTTCGCCCA TTTGGACGCC ACCACCGTGT TGTCCAGAGG TATCTCCGAA 950 

2 5 TTGGGTATCT ACCCAGCTGT CGATCCATTG GATTCCAAGT CTAGATTGTT 1000 

GGACGCTGCC GTTGTCGGTC AAGAACATTA CGACGTCGCT ACTCAAGTTC 1050 

AACAAACTTT GCAAGCTTAC AAGTCTTTGC AAGATATCAT TGCCATTTTG 110 0 

GGTATGGATG AATTGTCTGA ACAAGACAAG TTGACTGTCG AAAGAGCCAG 1150 

AAAGATCCAA AGATTCTTGT CTCAACCATT CGCCGTCGCC GAAGTTTTCA 12 0 0 

3 0 CTGGTATCCC AGGTAGATTG GTCAGATTAA AGGACACCAT CGCTTCCTTC 12 50 

AAGGCTGTTT TGGAAGGTAA GTACGATCAC TTG 12 83 



35 2) INFORMATION FOR SEQ ID NO: 468 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1287 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida krusei 

(B) STRAIN: ATCC 3413 5 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468 



CTTCGAACAA GGCCAATTAC CACAAATTTT AAACGCTTTA GTTATGGATA 5 0 

ATGGTGGTAA CAAGTTAGTT TTAGAAGTTG CTCAACATTT AGGTGAAAAC 10 0 

ACTGTCAGAA CCATTGCTAT GGATGGTACT GAAGGTTTAG TTAGAGGTCA 15 0 

AACCGTTAAC GATACCGGTG CTCCAATCTC TGTCCCAGTT GGTAGAGGTA 2 00 

55 CCTTAGGTAG AATCTTGAAC GTCATTGGTG ATCCAGTCGA TGAAAGAGGT 2 50 

CCAGTTGACT GTAAGGAAAG AAAGCCAATT CACGCTGATC CTCCAGCTTT 3 00 

CGTTGAACAA TCCACTGAAG CTGAAGTTTT GGAAACTGGT ATTAAGGTTG 350 

TCGATTTATT AGCACCTTAC GCAAGAGGTG GTAAGATTGG TTTATTCGGT 400 

GGTGCTGGTG TTGGTAAGAC CGTTTTTATC CAAGAATTGA TCAACAATGT 4 50 

60 YGCAAAGGCT CATGGTGGTT TCTCCGTTTT CACTGGTGTT GGTGAAAGAA 500 

286 
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CCAGAGAAGG TAACGATTTA TACAGAGAAA TGAAGGAAAC TGGTGTTATT 550 

AACTTGGAAG GTGAATCCAA GGTCGCCTTA GTTTTCGGTC AAATGAACGA 600 

ACCACCAGGA GCTAGAGCAA GAGTTGCTTT AACTGGTTTG ACCATTGCAG 650 

AATATTTCAG AGATGAAGAA GGTCAAGATG TCTTGTTATT CATTGATAAC 700 

5 ATTTTCAGAT TCACCCAAGC AGGTTCTGAA GTCTCTGCAT TATTAGGTAG 750 

AATTCCATCT GCTGTCGGTT ATCAACCAAC TTTAGCAACC GATATGGGTC 800 

TTTTACAAGA AAGAATTACC ACCACCAAGA AGGGTTCCGT TACTTCTGTC 850 

CAAGCTGTTT ATGTCCCAGC CGATGATTTA ACCGATCCTG CTCCAGCTAC 90 0 

TACTTTCGCC CACTTGGATG CAACCACTGT CTTGTCYAGA GGTATTTCCG 950 

10 AATTAGGTAT CTACCCAGCT GTCGATCCAT TAGATTCTAA GTCTAGATTA 1000 

TTAGATGTTG CAGTTGTTGG TCAAGAACAT TATGAAGTTG CAACTCAAGT 1050 

CCAAGAAACT TTACAAGCTT ACAAGTCTTT ACAAGATATT ATTGCTATTT 1100 

TGGGTATGGA TGAATTATCT GAACAAGATA AGTTAACYGT TGAAAGAGCA 1150 

AGAAAGATCC AAAGATTCTT ATCTCAACCA TTCTCTGTTG CAGAAGTTTT 12 00 

15 CACTGGTATT CCAGGTAAGT TAGTCAGATT AGAAGAAACC ATCAAGTCTT 1250 

TCAGGGATGT TCTTGCAGGT AAGTACGATC ACTTACC 12 87 



20 2) INFORMATION FOR SEQ ID NO: 469 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 83 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lambica 

(B) STRAIN: ATCC 24750 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469 



TCGAACAAGG TCAATTACCA CCAATTCTTA ACGCTTTGGT CATGGAAAAC 50 

GATGGTCAAA AGTTAGTTTT GGAAGTTGCT CAACATTTGG GTGAAAACAC 100 

CGTCAGAACC ATTGCTATGG ACGGTACCGA AGGTTTAGTT AGAGGTCAAC 150 

CTGTTAACGA CACTGGTGCT CCAATCTCTG TCCCAGTTGG TAGAGGTACT 2 00 

40 CTTGGTAGAA TCTTGAACGT CACTGGTGAC CCAGTYGATG AAAGAGGTCC 2 50 

TGTCGAMTGT AAGGAGAGAA GACCAATTCA CCAAGACCCA CCTGCTTTCG 3 00 

TTGACCAATC CACTGAAGCT GAAGTTTTGG AAACCGGTAT TAAGGTTGTC 3 50 

GATTTATTAG CACCTTACGC TAGAGGTGGT AAGATTGGTT TGTTCGGTGG 4 00 

TGCTGGTGTC GGTAAGACCG TTTTCATCCA AGAATTGATT AACAATGTTG 4 50 

4 5 CTAAGGCCCA CGGTGGTTTC TCCGTTTTCA CTGGTGTCGG TGAAAGAACC 500 

AGAGAAGGTA ACGATTTATA CAGAGAAATG AAGGAAACCG GTGTTATTAA 550 

CCTTGAAGGT GAATCTAAGG TCGCCCTCGT TTTCGGTCAA ATGAACGAAC 600 

CACCAGGAGC AAGAGCTAGA GTTGCTCTTA CTGGTTTGAC CATTGCTGAA 650 

TACTTCAGAG ATGAAGAAGG TCAAGATGTC TTGCTTTTCA TTGACAACAT 700 

50 TTTCAGATTC ACCCAAGCAG GTTCCGAAGT GTCTGCTTTG TTAGGTAGAA 750 

TTCCWTCTGC CGTTGGTTAC CAACCAACCT TAGCAACCGA TATGGGTTTG 800 

TTACAAGAAA GAATTACCAC CACCAAGAAG GGTTCCGTCA CCTCCGTCCA 850 

AGCTGTTTAC GTCCCAGCTG ATGATTTRAC TGACCCTGCA CCAGCAACCA 900 

CTTTCGCCCA CTTGGATGCT ACCACCGTCT TGTCTAGAGG TATTTCCGAA 950 

55 TTAGGTATTT ACCCAGCTGT CGATCCATTA GATTCTAAGT CTAGATTATT 1000 

GGATGTCGCT GTTGTCGGTC AAGAACATTA TACCGTTGCA ACYCAAGTCC 1050 

AAGAAACTTT ACAAGCTTAC AAGTCCTTAC AAGATATCAT TGCTATTTTG 1100 

GGTATGGACG AATTATCTGA ACAAGATAAG CTTACTGTTG AAAGAGCAAG 1150 

AAAGATCCAA AGATTCCTTT CCCAACCTTT CTCCGTCGCA GAAGTTTTCA 1200 

60 CTGGTATCCC AGGTAAGCTT GTCAGATTAG AAGAAACCAT TATTTCTTTC 1250 
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SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

AGAGATGTCC TCGATGGTAA GTACGACCAC TTA 1283 



5 2) INFORMATION FOR SEQ ID NO: 470 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 114 0 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida lusitaniae 

(B) STRAIN: ATCC 66035 



20 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 47 0 



AGGTGCCTCT GTCACTGACA CTGGTTCTCC AATCTCTGTC CCTGTTGGTC 50 

GTGAAACCTT GGGTAGAATT ATCAACGTTG TTGGTGAGCC AATTGACGAG 100 

AGAGGCCCAA TCAACTCCAA GAAGAGAAAC CCAATT CAC A CTGAGCCACC 150 

ATCGTTTGTT GAACAATCCA CTTCTGCTGA AGTTTTGGAG ACTGGTATCA 2 00 

2 5 AGGTTGTCGA CTTGTTGGCC CCTTACGCCA GAGGTGGTAA GATTGGTTTG 2 50 

TTCGGTGGTG CCGGTGTCGG TAAGACCGTT TTCATCCAAG AGTTGATTAA 3 00 

CAACATTGCC AAGGCCCACG GTGGTTTCTC TGTTTTCACT GGTGTCGGTG 3 50 

AAAGAACCAG AGAAGGTAAC GATTTGTACC GTGAAATGCA AGAGAC CGGT 4 00 

GTCATCAACT TCGAGGGTGA CTCCAAGGTC GCCTTGGTCT TCGGTCAAAT 450 

3 0 GAACGAACCA CCAGGAGCCA GAGCTAGAGT TGCTTTGACC GGTTTGACTA 50 0 

TTGCCGAGTA CTTCAGAGAC GAAGAGGGCC AAGATGTCTT GTTGTTCGTT 550 

GACAACATTT TCAGATTCAC CCAGGCCGGT TCTGAAGTGT CTGCTTTGTT 600 

GGGTCGTATT CCATCCGCTG TCGGTTACCA ACCAACCTTG GCCACCGATA 650 

TGGGTGCTTT GCAAGAGAGA ATTACCACCA CCAAGAAGGG TTCCGTCACC 700 

35 TCTGTCCAAG CCGTTTATGT TCCAGCTGAT GACTTGACTG ACCCTGCTCC 750 

AGCCACCACC TTCGCCCACT TGGACGCCAC CACTGTGTTG TCCAGAGGTA 800 

TCTCTGAATT GGGTATCTAC CCAGCTGTCG ACCCATTGGA CTCCAAGTCT 850 

AGATTGTTGG ACGCTTCTAT TGTTGGTAAG GAGCACTACG AAGTTGCTTC 90 0 

TAACGTTCAA CAAACTTTGC AAGCTTACAA GTCTTTGCAA GATATCATTG 950 

40 CCATTTTGGG TATGGATGAA TTGTCTGAGG CTGACAAGTT GACCGTTGAG 100 0 

AGAGCCAGAA AGATCCAAAG ATTCTTGTCT CAACCATTCG CTGTTGCCGA 1050 

GGTTTTCACT GGTATCCCAG GTAGATTGGT CAGATTGGAG GACACTGTCA 1100 

GATCCTTCAA GGAAGTTTTG GACGGTAAGT ACGACCACTT 114 0 



2 ) INFORMATION FOR SEQ ID NO: 471 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 12 96 bases 

* (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

* 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida, norvegensis 

(B) STRAIN: ATCC 22977 

60 

. 288 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471 



PCT/CAOO/01150 



TCAATTCGAA CAAGGTCAAT TACCACCAAT TTTAAACGCT TTAGTCATGG 50 

ACAACGGTGG AACTAAGTTA GTTTTAGAAG TTGCTCAACA TTTAGGTGAA 100 

5 AACACCGTCA GAACCATTGC TATGGATGGT ACTGAAGGTT TAGTTAGAGG 150 

TCAACCAGTT AATGATACTG GTGCTCCAAT CTCTGTCCCA GTCGGTAGAG 2 00 

GTACTTTAGG TAGAATCTTA AACGTTATTG GTGATCCAGT CGATGAAAGA 2 50 

GGTCCAATCG ATTGTAAGGA AAGAAAACCA ATTCATCAAG ATCCTCCTGC 3 00 

TTTCGTTGAA CAATCAACTG AAGCTGAAGT TTTAGAAACT GGTATTAAAG 3 50 

10 TTGTCGATTT ATTAGCTCCT TACGCTAGAG GTGGTAAGAT TGGTTTATTC 4 00 

GGTGGTGCAG GTGTTGGTAA AACCGTTTTT ATTCAAGAAT TAATTAACAA 4 50 

TGTTGCAAAA GCTCATGGTG GTTTCTCCGT TTTCACTGGT GTCGGTGAAA 500 

GAACTAGAGA AGGTAACGAT TTATACAGAG AAATGAAGGA AACTGGTGTT 550 

ATTAATTTAG AAGGTGATTC TAAGGTCGCA TTAGTTTTCG GTCAAATGAA 600 

15 CGAACCACCT GGAGCAAGAG CAAGAGTTGC TTTAACTGGT TTAACTATTG 650 

CTGAATATTT CAGAGATGAA GAAGGTCAAG ATGTCTTGTT ATTCATTGAT 700 

AACATTTTCA GATTTACTCA AGCAGGTTCT GAAGTTTCCG CTTTGTTAGG 750 

TAGAATTCCA TCCGCTGTCG GTTATCAACC AACTTTAGCA ACTGATATGG 8 00 

GTCTTTTACA AGAAAGAATT ACTACTACTA AGAAAGGTTC TGTTACTTCC 850 

2 0 GTTCAAGCAG TTTATGTCCC AGCAGATGAT TTAACTGATC CTGCTCCAGC 900 

AACTACTTTC GCCCACTTAG ATGCAACTAC TGTCTTATCT AGAGGTATTT 950 

CCGAATTAGG TATTTACCCA GCTGTCGATC CATTAGATTC TAAATCTAGA 1000 

TTATTAGATG TTGCAGTTGT TGGTCAAGAA CATTATGATG TTGCAACTCA 1050 

AGTTCAAGAA ACTTTACAAG CTTACAAATC TTTACAAGAT ATTATTGCTA 1100 

2 5 TTTTAGGTAT GGATGAATTA TCTGAACAAG ATAAATTAAC CGTTGAAAGA 1150 

GCTAGAAAAA TTCAAAGATT CTTATCTCAA CCTTTCTCTG TCGCAGAAGT 12 00 

TTTCACTGGT ATCCCAGGTA AGCTTGTTAG ATTAGAAGAA ACTATTTCTT 1250 

CATTCAGAGA TGTCTTAGCA GGTAAGTACG ATCACTTACC AGAAAA 12 96 



30 



45 



2) INFORMATION FOR SEQ ID NO: 4 72 



(i) SEQUENCE CHARACTERISTICS : 
3 5 (A) LENGTH: 12 97 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida parapsilosis 

(B) STRAIN : ATCC 90018 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472 



ACACTTCGAA GAAGGAAACT TGCCAGCTAT TTTGAACGCT TTGACCTTGA 50 

AAAACGGTAA ACAAGACTTG GTCTTGGAAG TTGCCCAACA TTTGGGTGAA 100 

50 AACACCGTCA GAGCAATTGC TATGGATGGT ACTGAAGGTT TAGTTAGAGG 150 

TACTTCTGTC ACTGACACTG GTGCCCCAAT TTCTGTCCCA GTTGGTAGAG 2 00 

GTACTTTGGG TAGAATCATC AATGTTACTG GTGACCCAAT TGATGAAAGA 250 

GGTCCAATTG AATGTAAGAA GAGAAACCCA ATCCACGCTG AACCACCTTC 300 

ATTCATTGAA CAATCCACTG CTGCTGAAGT TTTGGAAACC GGTATTAAGG 3 50 

55 TTGTCGACTT GTTGGCTCCA TACGCTAGAG GTGGTAAGAT TGGTTTGTTC 4 00 

GGTGGTGCCG GTGTCGGTAA AACCGTGTTT ATCCAAGAAT TGATCAACAA 450 

CATTGCCAAG GCACATGGTG GTTTCTCGGT TTTCACTGGT GTTGGTGAAA 500 

GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAGGA AACTGGTGTT 550 

ATCAACTTGG AAGGTGACTC CAAGGTCGCT TTGGTTTTCG GTCAAATGAA 600 

60 CGAACCACCT GGAGCTAGAG CTAGAGTTGC TTTGACTGGT TTGACCATTG 650 

289 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

CTGAATACTT TAGAGATGAA GAAGGTCAAG ATGTCTTGTT GTTTATTGAC * 700 

AACATTTTCA GATTCACCCA AGCTGGTTCA GAAGTGTCTG CTTTGTTGGG 75 0 

TCGTATTCCA TCCGCTGTCG GTTATCAACC AACCTTGGCC ACCGATATGG 800 

GTCTTTTGCA AGAACGTATT ACCACTACCA AAAAGGGTTC AGTTACTTCA 8 50 

5 GTCCAAGCCG TTTACGTGCC AGCTGATGAT TTAACCGATC CTGCTCCAGC 900 

TACCACTTTC GCTCACTTGG ATGCCACCAC TGTGTTGTCT AGAGGTATTT 950 

CGGAGTTGGG TATTTACCCA GCTGTCGATC CATTGGATTC CAAATCCAGA 10 0 0 

TTGTTGGATG CTGCCGTTGT TGGTCAAGAA CATTACGACG TTGCCACTGG 105 0 

TGTTCAACAA ACTTTACAAG CTTACAAGTC CTTGCAAGAT ATCATTGCTA 1100 

10 TCTTGGGTAT GGATGAATTG TCCGAACAAG ATAAATTGAC TGTCGAAAGA 1150 

GCTCGTAAGA TTCAAAGATT CTTGTCTCAA CCATTTGCTG TCGCTGAAGT 12 00 

TTTCACTGGT ATTCCAGGTA AATTGGTTAG ATTGTCTGAA ACTGTCAAGT 12 50 

CATTCAAGGA AGTCTTGGAA GGTAAGTACG ATAACTTGCC AGAAAAT 12 97 



15 



30 



60 



2) INFORMATION FOR SEQ ID NO: 473 



(i) SEQUENCE CHARACTERISTICS : 
2 0 (A) LENGTH: 12 8 5 bases 

{B> TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida, rugosa 

(B) STRAIN: ATCC 962 75 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473 



CCAGTTTGGC AACGACCTCC CTGCCATTTT GAACGCCCTC ACTCTTCAGC 50 

GTGAAGACGG TAACAAGCTT GTTCTTGAGG TTGCCCAGCA TCTCGGTGAG 10 0 

35 AACACCGTCC GTACCATTGC TATGGACGGT ACCGAGGGTT TGGTGCGTGG 150 

CACTGGTGTC CACGACACCG GACACCCCAT CATGACTCCC GTCGGTGACG 2 00 

GTACCCTGGG ACGTATTCTT AACGTCACCG GTGACCCTGT AGACGAGCGT 2 50 

GGTCCCGTCA AGACTGACAA GTTCCGCCCC ATCCACGCCG AGGCCCCTGC 3 00 

CTTCGATGAG CAGGCTACCA GTGCCGAGGT TCTTGAGACC GGTATCAAGG 3 50 

4 0 TTGTCGACTT GCTCGCTCCT TACGCCAAGG GTGGTAAGAT TGGTTTGTTC 4 00 

GGTGGTGCCG GTGTCGGCAA GACCGTCTTC ATCCAGGAGC TGATTAACAA 4 50 

CATCGCCAAG GCCCACGGTG GTTACTCCGT GTTCACTGGT GTCGGTGAGC 50 0 

GTACTCGTGA GGGTAACGAT TTGTACAAGG AAATGATCGA GTCCGGTGTC 550 

ATCAACCTCG ATGGTGAGTC CAAGGTCGCC TTGGTGTTCG GTCAGATGAA 600 

45 CGAGCCCCCT GGAGCCCGTG CCCGTGTCGC CCTTACCGGT CTTACCATCG 6 50 

CTGAGTATTT CCGTGATGAG GAGGGTAAGG ATGTCTTGTT GTTCGTTGAC 700 

AACATTTTCC GCTTCACTCA GGCCGGTTCT GAGGTGTCCG CCTTGCTTGG 750 

TCGTATTCCT TCCGCTGTCG GTTACCAGCC TACCCTGGCC ACCGATATGG 8 00 

GTGCCCTTCA GGAGCGTATC ACCACCACCA AGAAGGGTTC CGTTACATCC 8 50 

50 GTCCAGGCCG TCTACGTCCC TGCCGATGAT TTGACTGATC CCGCCCCTGC 90 0 

CACCACCTTC GCCCATTTGG ATGCCACCAC TGTCTTGTCT CGTGCCATCT 950 

CTGAGTTGGG TATCTACCCC GCTGTCGACC CTCTCGACTC CAAGTCCCGT 10 00 

CTTCTTGACG CCGCTGTCGT TGGTCAGGAG CACTACGATA CTGCCACCTC 10 5 0 

CGTTCAGCAG ACTTTGCAGG CTTACAAGTC TTTGCAGGAT ATCATTGCCA 1100 

55 TTCTTGGTAT GGATGAGTTG TCCGAGTCTG ACAAGCTCAC CGTCGAGCGT 1150 

GCTCGTAAGA TCCAGCGTTT CCTCTCCCAG CCTTTCGCTG TTGCTGAGGT 12 00 

CTTCACTGGT ATTCAGGGCC GTCTTGTTCC TCTCAAGGAC ACTGTCCGCT 12 50 

CCTTCAAGGA GATTCTCGAA GGTAAGTACG ATGCT 12 85 



290 
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PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 474 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 83 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida sphaerica 

(B) STRAIN: ATCC 2 5 04 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 474 



TTGAACAAGG TCAATTGCCA GCTATTTTGA ACGCTTTGGA AATCGACACT 50 

CCAGAAGGAA AGTTGGTTTT GGAAGTCGCT CAACATTTGG GTGAAAACAC 100 

20 TGTCAGAACC ATTGCTATGG ATGGTACTGA AGGTTTAGTC CGTGGTGAAA 150 

ACGTTTCTGA CACTGGTGCT CCAATTTCCG TCCCAGTTGG TAGAGAAACC 2 00 

TTGGGTAGAA TTATCAACGT TATTGGTGAG CCAATTGACG AAAGAGGTCC 2 50 

AATCAACTCC AAGATGAGAA AGCCAATTCA TGCTGATCCT CCATTATTCG 3 00 

TTGAACAATC CACTGCTGCT GAAGTTTTGG AAACTGGTAT CAAGGTTGTC 3 50 

25 GACTTGTTGG CCCCATACGC CAGAGGTGGT AAGATTGGTT TGTTCGGTGG 4 00 

TGCCGGTGTC GGTAAGACCG TTTTCATCCA AGAATTGATT AACAACATTG 4 50 

CCAAGGCTCA TGGTGGTTTC TCTGTCTTCA CTGGTGTCGG TGAAAGAACC 500 

AGAGAAGGTA ACGATTTGTA CCGTGAAATG AAGGAAACTG GTGTTATCAA 550 

CTTGGAAGGT GATTCTAAGG TCGCGTTGGT TTTCGGTCAA ATGAACGAAC 600 

3 0 CTCCTGGAGC TAGAGCTAGA GTCGCCTTGA CTGGTTTGAC CATCGCTGAA 650 

TACTTCAGAG ATGAAGAAGG TCAAGATGTC TTGTTGTTTA TCGACAACAT 700 

TTTCAGATTC ACACAAGCCG GTTCCGAAGT GTCCGCTTTG TTGGGTCGTA 750 

TTCCATCCGC TGTCGGTTAT CAACCAACTT TGGCCACCGA TATGGGTTTG 800 

TTGCAAGAAA GAATTACTAC CACCAAGAAG GGTTCCGTCA CTTCTGTCCA 850 

35 AGCCGTTTAC GTGCCAGCTG ATGATTTGAC TGATCCTGCT CCAGCTACCA 900 

CTTTCGCGCA TTTGGATGCC ACCACTGTGT TGTCCAGAGG TATCTCTGAA 950 

TTGGGTATCT ACCCAGCTGT CGATCCTTTG GATTCCAAAT CTAGATTGTT 1000 

GGATGCTGCC GTCGTTGGTC AAGAACATTA CGATGTCGCT ACTCAAGTTC 1050 

AACAAACTTT GCAAGCTTAC AAGTCTTTGC AAGATATCAT TGCCATTTTG 1100 

4 0 GGTATGGATG AATTGTCCGA ACAAGATAAG TTGACCGTCG AAAGAGCTAG 1150 

AAAGATTCAA AGATTCTTGT CTCAACCTTT CGCTGTCGCT GAAGTCTTCA 12 00 

CTGGTATCCC AGGTAGATTG GTCAGATTAA AGGACACCAT CTCTTCTTTC 1250 

AAGGCTGTCT TGGACGGTAA GTACGATCAC TTG 12 83 

45 



2) INFORMATION FOR SEQ ID NO: 475 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 12 90 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropical is 

(B) STRAIN: ATCC 13803 

60 

291 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 475 



PCT/CA00/01150 



CGACGAAGGT AACTTGCCAG CTATTTTGAA TGCTTTGACT TTGAAGAACG 50 

GTGACCAAGA CTTGGTTTTG GAAGTTGCTC AACATTTGGG TGAAAACACC 100 

5 GTCAGGGCTA TTGCTATGGA TGGTACCGAA GGTTTAGTCA GAGGTACTGC 150 

TGTCACTGAT ACCGGTGCTC CAATTTCTGT CCCAGTTGGT AGAGGTACCT 200 

TGGGTAGAAT TATCAACGTT GTTGGTGAAC CAATTGATGA CAGAGGTCCA 250 

ATTGAATGTA AGGAAAGAAA GCCAATTCAC GCTGAACCAC CTTCATTCGT 300 

TGAACAATCT ACTGCTGCCG AAATTTTGGA AACCGGTATT AAGGTTGTCG 350 

10 ATTTGTTGGC TCCTTACGCT AGAGGTGGTA AGATTGGTTT GTTCGGTGGT 4 00 

GCTGGTGTCG GTAAAACCGT CTTCATCCAA GAATTGATTA ACAACATTGC 450 

CAAAGCTCAT GGTGGTTTCT CTGTCTTTAC CGGTGTCGGT GAAAGAACCA 50 0 

GAGAAGGTAA CGATTTGTAC CGTGAAATGA AAGAAACTGG TGTCATCAAC 55 0 

TTGGAAGGTG ACTCCAAGGT CGCCTTGGTC TTCGGTCAAA TGAACGAACC 60 0 

15 ACCTGGTGCT AGAGCTAGAG TTGCTTTGAC TGGTTTGACC ATTGCTGAAT 650 

ACTTCAGAGA TGAAGAAGGT CAAGATGTCT TGTTGTTCAT TGACAACATT 700 

TTCAGATTCA CCCAAGCTGG TTCCGAAGTG TCTGCTTTGT TGGGTCGTAT 750 

TCCATCTGCT GTCGGTTATC AACCAACCTT GGCCACTGAT ATGGGTCTTT 800 

TGCAAGAACG TATTACCACC ACCAAGAAAG GTTCTGTTAC TTCTGTCCAA 85 0 

20 GCTGTTTATG TCCCAGCCGA TGATTTAACC GATCCAGCTC CAGCTACCAC 90 0 

CTTCGCTCAC TTGGATGCCA CTACTGTCTT GTCTAGAGGT ATTTCTGAAT 950 

TGGGTATTTA CCCAGCTGTC GATCCATTGG ATTCTAAATC CAGATTATTG 1000 

GATGCTACTG TTGTTGGTCA AGAACATTAT GATGTTGCCA CTGGTGTTCA 1050 

ACAAACTTTA CAAGCTTACA AGTCCTTGCA AGATATCATT GCTATTTTGG 1100 

2 5 GTATGGATGA ATTGTCCGAA GCCGATAAAT TGACTGTCGA AAGAGCTCGT 1150 

AAGATTCAAA GATTCTTGTC CCAACCATTC GCTGTTGCCG AAGTTTTCAC 12 00 

TGGTATCCCA GGTAGATTGG TTAGATTACA AGATACTGTT AAATCTTTCA 1250 

AAGATGTCTT GGAAGGTAAA TACGATCACT TGCCAGAAAA 12 90 



30 



45 



2) INFORMATION FOR SEQ ID NO: 476 



( i ) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 12 67 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candidal utilis 

(B) STRAIN: Csp 388 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 476 



TTTGAACGCC CTTGAGATCA AGAACCACGA GTCTGGAAGA TTGGTCCTCG 50 

AGGTTGCCCA GCACTTGGGT GAGAACACTG TCAGAACTAT TGCCATGGAT 100 

50 GGTACCGAAG GTCTCGTCCG TGGTGAGTCT GTTGTTGACA CCGGTTCTCC 150 

AATTACTGTC CCAGTTGGTC GTGAGACCTT GGGTCGTATC ATCAACGTTA 2 00 

TTGGTGAGCC AATTGATGAG CGTGGTCCAA TCAACACCAA GCACAGAAAC 250 

CCAATTCACG CTGAGCCACC ATCCTTCGCT GAGCAGTCTA CTGCTGCTGA 3 00 

GGTTTTGGAG ACTGGTATCA AGGTTGTCGA CCTTCTCGCC CCATACGCCA 3 50 

55 GAGGTGGTAA GATTGGTCTC TTTGGTGGTG CCGGTGTCGG TAAGACCGTT 400 

TTCATCCAGG AGTTGATTAA CAACATTGCC AAGGCCCACG GTGGTTTCTC 4 50 

TGTTTTCACC GGTGTCGGTG AGAGAACCAG AGAGGGTAAC GATTTGTACC 500 

GTGAGATGAA GGAGACTGGT GTCATCAATC TCGAGGGTGA GTCCAAGGTT 550 

GCTCTCGTCT TCGGTCAGAT GAACGAGCCA CCAGGTGCCC GTGCCCGTGT 600 

60 CGCTTTGACC GGTTTGACCA TTGCTGAGTA CTTCAGAGAT GAGGAGGGTC 650 

292 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
AGGATGTCTT GTTGTTTATT GACAACATTT 
TCCGAGGTGT CTGCCTTGCT TGGTCGTATT 
GCCAACTTTG GCCACCGATA TGGGTTTGTT 
CCCAGAAGGG TTCCGTCACT TCTGTCCAGG 
GATTTGACTG ATCCTGCTCC AGCCACCACT 
CACTGTGTTG TCCCGTGGTA TCTCTGAGTT 
ACCCATTGGA CTCCAAGTCC AGATTGTTGG 
GAGCACTACA ACACCGCCAC CGATGTCCAG 
GTCTCTCCAG GATATCATTG CTATTTTGGG 
CTGACAAGTT GACTGTCGAG AGAGCCAGAA 
CAGCCATTCG CTGTCGCTGA GGTTTTCACC 
TAGACTTCAG GACACCATCA AGTCCTTCAG 
ACGACCACTT GCCAGAG 



PCT/CAOO/01150 

TCAGATTCAC CCAGGCCGGT" 700 

CCATCCGCTG TCGGTTACCA 750 

GCAGGAGAGA ATTACCACCA 8 00 

CCGTTTACGT CCCAGCTGAT 850 

TTCGCCCACT TGGACGCCAC 900 

GGGTATTTAC CCAGCTGTCG 950 

ACGCTGCCGT TGTTGGTGAC 1000 

CAGACCCTTC AGGCTTACAA 1050 

TATGGATGAG TTGTCTGAGG 1100 

AGATTCAGCG TTTCCTTTCC 1150 

GGTATCCCAG GTAGATTGGT 12 00 

AGAGGTTTTG GACGGTAAGT 12 50 

1267 



2) INFORMATION FOR SEQ ID NO: 477 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 96 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida viswanathii 

(B) STRAIN: ATCC 28269 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477 

CCAATTCGAC GAAGGTAGCT TGCCAGCTAT CTTGAACGCC TTGACCTTGA 50 

AGAACGGTGA CCAAGACTTG GTCTTGGAAG TTGCCCAACA CTTGGGTGAA 100 

AACACTGTCA GAGCTATTGC TATGGATGGT ACCGAAGGTT TGGTCAGAGG 150 

TACTGCTGTC ACTGACACCG GTGCTCCAAT CTCCGTCCCA GTCGGTAGAG 2 00 

GTACCTTGGG TAGAATCATC AACGTTGTTG GTGAACCAAT TGACGACAGA 250 

GGTCCAATTG AATGTAAGGA AAGAAAGCCA ATTCACGCTG AACCACCTTC 3 00 

TTTCGTTGAA CAATCCACTG CTGCCGAAAT TTTGGAAACC GGTATCAAGG 3 50 

TTGTCGACTT GTTGGCCCCA TACGCCAGAG GTGGTAAGAT TGGTTTGTTC 4 00 

GGTGGTGCCG GTGTCGGTAA GACTGTCTTT ATCCAAGAGT TGATTAACAA 450 

CATTGCCAAG GCCCATGGTG GTTTCTCTGT CTTCACTGGT GTCGGTGAAA 50 0 

GAACCAGAGA AGGTAACGAT TTGTACCGTG AAATGAAGGA AACCGGTGTC 550 

ATCAACTTGG AAGGTGACTC CAAGGTCGCC TTGGTTTTCG GTCAAATGAA 600 

CGAACCACCT GGTGCTAGAG CTAGAGTTGC TTTGACTGGT TTGACCATTG 650 

CCGAATACTT CAGAGACGAA GAAGGTCAAG ATGTCTTGTT GTTTATTGAC 700 

AACATTTTCA GATTCACCCA AGCCGGTTCC GAAGTGTCTG CTTTGTTGGG 750 

TCGTATTCCA TCTGCCGTCG GTTATCAACC AACCTTGGCC ACCGATATGG 800 

GTCTTTTGCA AGAACGTATT ACCACCACCA AGAAGGGTTC CGTCACCTCT 850 

GTCCAAGCTG TCTATGTCCC AGCCGATGAT TTGACCGATC CTGCTCCAGC 900 

CACCACCTTC GCTCACTTGG ATGCCACTAC TGTCTTGTCT AGAGGTATTT 950 

CCGAATTGGG TATCTACCCA GCTGTCGATC CATTGGACTC CAAGTCCAGA 100 0 

TTGTTGGATG CTGCTGTTGT TGGTCAAGAA CATTACGATG TTGCCACTGG 1050 

TGTCCAACAA ACCTTGCAAG CTTACAAGTC TTTGCAAGAT ATCATTGCCA 1100 

TTTTGGGTAT GGATGAATTG TCTGAATCCG ATAAGTTGAC TGTCGAAAGA 1150 

GCTCGTAAGA TCCAAAGATT CTTGTCGCAA CCATTTGCTG TTGCCGAAGT 12 0 0 

TTTCACTGGT ATCCCAGGTA GATTAGTTAG ATTGCTGGAA ACCGTCCAAT 1250 

CTTTCAAAGA CGTCTTGGCT GGTAAATACG ATCACTTGCC AGAAAA 1296 
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2) INFORMATION FOR SEQ ID NO: 478 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 12 95 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida zeylanoides 

(B) STRAIN: ATCC 7351 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 478 



TCAATTCGAG CAAGGCAACC TCCCTGCCAT CTTGAACGCT CTCACCTTGA 50 

AGAATGGTGA CAACGACTTG GTTTTGGAAG TTGCCCAGCA CTTGGGTGAG 100 

20 AACACCGTCA GAGCCATTGC CATGGATGGT ACCGAGGGTT TGGTTAGAGG 150 

TGCGTCCGTC AAGGACACTG GCGCCCCTAT CTCGGTCCCC GTTGGCCGCG 200 

GGACTTTGGG TCGTATCATC AACGTCACCG GTGACCCCAT TGACGAGAGA 250 

GGTCCCATCG AGCTGACCCA GAGAAACCCT ATCCACGCCG ACCCCCCCTC 300 

GTTCGTTGAG CAGTCCACCA ACGCTGAGGT TTTGGAGACT GGTATCAAGG 350 

2 5 TTGTCGATTT GTTGGCTCCC TACGCCAGAG GTGGTAAGAT TGGTTTGTTC 4 00 

GGTGGTGCCG GTGTCGGTAA GACCGTCTTC ATTCAGGAGT TGATCAACAA 4 50 

CATCGCCAAG GCCCACGGTG GGTTCTCGGT CTTCACTGGT GTCGGTGAGA 500 

GAACTAGAGA GGGTAACGAC TTGTACCGTG AGATGAAGGA GACCGGTGTC 550 

ATCAACTTGG AGGGTGACTC CAAGGTGGCC TTGGTGTTCG GTCAGATGAA 600 

30 CGAGCCCCCT GGAGCCAGAG CCAGAGTCGC CTTGACCGGG TTGACCATTG 650 

CCGAATACTT TAGAGACGAG GAGGGTCAGG ATGTGTTGTT GTTCGTCGAC 700 

AACATCTTCA GATTCACCCA AGCTGGTTCG GAGGTGTCGG CCTTGTTGGG 750 

TCGTATTCCC TCTGCCGTCG GTTACCAGCC CACCTTGGCA ACTGATATGG 8 00 

GATTGTTGCA GGAGCGTATC ACCACGACCA AGAAGGGTTC CGTCACCTCA 850 

35 GTGCAGGCCG TCTACGTCCC CGCTGATGAC TTGACTGACC CTGCTCCCGC 900 

CACCACCTTT GCCCACTTGG ACGCCACCAC CGTGTTGTCC AGAGGTATCT 950 

CTGAGTTGGG TATCTACCCC GCCGTCGACC CCTTGGACTC CAAATCGAGA 1000 

TTGTTGGACG CTGCCGTGGT CGGTCAGGAG CACTACGATG TTGCCTCGAA 105 0 

CGTCCAGCAG ACCTTGCAGG CCTACAAGTC CTTGCAGGAT ATCATTGCCA 1100 

40 TTTTGGGTAT GGATGAGTTG TCCGAGGCTG ACAAGTTGAC CGTTGAGAGA 1150 

GCCAGAAAGA TCCAGAGATT CTTGTCGCAG CCCTTCGCTG TTGCCGAGGT 12 00 

TTTCACTGGT ATCAAGGGTA GATTGGTCAG ATTGGAGGAC ACCGTCAGAT 12 50 

CTTTCAAGGA GGTTTTGGAG GGTAAGTACG ACCACTTGCC CGAGA 12 95 

45 

2) INFORMATION FOR SEQ ID NO: 479 

(i) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 534 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 

(B) " STRAIN: Silveira 

60 

294 
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WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 47 9 



PCT/CA00/01150 



TTCAGGAATT GATTGTAAGT TCTGTTATCA 
CTGATATGCT CTAGAACAAC ATTGC TAAGG 
5 TTCACTGGTG TCGGTGAGCG TACCCGTGAG 
AATGCAAGAG ACCCGTGTCA TTCAACTCGA 
TTGTCTTCGG TCAAATGAAC GAGCCCCCTG 
CTTACCGGTT TGACCATTGC TGAATACTTC 
CGGTAGGCTT CATGCTTCTA TCGCTAGGGG 
10 TCGCTTTTCT AGTGCTTCTC TTTATTGACA 
GCTGGTTCTG AAGTGTCTGC CTTGCTCGGT 
TTACCAACCT ACTCTCGCCG TCGACATGGG 
CCACCACCAC CAAGGGATCC ATTACTTCAG 

15 



ACTAAAGCCG ACAGCGGTTG 50 

CTCACGGTGG TTACTCCGTG 100 

GGTAACGATT TGTACCATGA 150 

CGGAGAGTCC AAGGTCGCTC 2 00 

GTGCCCGTGC CCGTGTTGCC 2 50 

CGTGACGAGG AAGGCCAAGA 3 00 

CGTGTGATAC AGGAGGCTAA 350 

ACATTTTCCG TTTCACTCAA 4 00 

CGTATTCCTT CCGCTGTCGG 4 50 

TGTTATGCAG GAACGTATCA 500 

TGCA 534 



2) INFORMATION FOR SEQ ID NO: 48 0 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 4 94 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 



30 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus albidus 

(B) STRAIN: ATCC 66030 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 80 



CGTCTTGATT CAAGAATTGA TCAACAACAT CGCCAAGGCC CACGGTGGTT 50 

. ACTCGGTCTT CACCGGTGTC GGTGAGCGAA CTCGAGAGGG TAACGATCTG 100 

3 5 TACCACGAAG TGAGTTGCGC CGTCCGAGTG TTTCCCGGGG AATCGCAAGA 150 

CTGATGTTGT CCCTTCTTCT CAGATGAGGG AAACCGGTGT CATCAACCTC 2 00 

GAGGGTGACT CCAAGGTCGC CTTGGTCTTC GGTCAGATGA ACGAGCCCCC 2 50 

TGGAGCCCGA GCCCGAGTCG CCTTGACCGG TTTGACCATT GCCGAATACT 3 00 

TCCGAGACGA GGAGGGTCAG GATGTCTTGT TGTTCATTGA CAACATTTTC 3 50 

4 0 CGATTCACCC AAGCCGGTTC CGAAGTGTCC GCCTTGTTGG GTCGTATCCC 400 

CTCCGCCGTC GGTTACCAGC CCACTCTGTC CACCGACATG GGTACCATGC 45 0 

AGGAGCGAAT TACCACCACC AAGAAGGGTT CCATCACTTC CGTC 4 94 



45 



55 



2) INFORMATION FOR SEQ ID NO : 481 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 415 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium oxysporum 

(B) STRAIN: WSA-212 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 81 

295 
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TTCAGGAGCT TAT CAACAAC 
TTCACTGGTG TCGGTGAGCG 
AATGCAGGAG ACTTCCGTCA 
5 TGGTCTTCGG TCAGATGAAC 
CTTACCGGTC TTACTGTAGC 
CGTGCTGCTC TTCATTGACA 
AGGTGTCTGC CCTTCTCGGT 
ACCCTCGCCG TCGACATGGG 
10 CAAGGGTTCC ATTAC 



ATCGCCAAGG CCCACGGTGG 
AACTCGTGAG GGTAACGATC 
TTCAGCTTGA TGGCGAGTCC 
GAGCCCCCTG GAGCTCGTGC 
TGAATACTTC AGAGATGAGG 
ACATTTTCCG ATTCACTCAG 
CGTATCCCCT CTGCCGTCGG 
TGGTATGCAA GAGCGTATTA 



TTACTCCGTC £o 

TGTACCACGA 100 

AAGGTCGCCC 150 

CCGTGTCGCC 2 00 

AGGGTCAGGA 2 50 

GCCGGTTCCG 3 00 

TTACCAGCCC 3 50 

CCACCACCAC 4 00 

415 



2) INFORMATION FOR SEQ ID NO: 482 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 81 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
2 5 (A) ORGANISM: Geotrichum spp. 

<B) STRAIN: Lev- 4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482 

30 AGGACAACCT CCCCGCTATT CTTAACGCTC TTGAGCTTAA GAGAGATAAC 50 

GGTGAGAAGC TCGTTCTCGA GGTTGC C C AG CATCTGGGTG AGAACACTGT 100 

CCGTACTATT GCTATGGACG GTACTGAGGG TCTCGTCCGT GGTCAGCCCG 150 

TTGTTGACAC CGGTGCCCCC ATTACCATTC CCGTTGGTCG TGGTACTCTT 2 00 

GGTAGAATTA TCAACGTCAT TGGTGAGCCC ATCGATGAGC GTGGACCCAT 2 50 

35 TGAGGCTACC AAGTACCTCC CCATCCACAC CGAGCCCCCC ACCTTCGCTG 300 

AGCAGTCTAC CTCCGCTGAG GTTCTTGAGA CTGGTATCAA GGTTGTCGAT 350 

CTCCTTGCCC CCTACGCCCG TGGTGGTAAG ATTGGTCTCT TCGGTGGTGC 4 00 

CGGTGTCGGT AAGACCGTTT TCATTCAGGA GCTGATTAAC AACATTGCCA 450 

AGGCCCATGG TGGTTTCTCC GTTTTCACCG GTGTCGGTGA GAGAACCCGT 500 

4 0 GAGGGTAACG ATTTGTACCG TGAGATGAAG GAGACCGGTG TCATCAACCT 550 

CGAGGGTGAG TCTAAGGTCG CTCTCGTTTT CGGTCAGATG AACGAGCCCC 600 

CTGGAGCCCG TGCCCGTGTT GCTCTTACTG GTCTTACCAT TGCTGAGTAC 650 

TTCCGTGATG AGGAGGGTCA GGATGTGTTG CTCTTCGTTG ACAACATTTT 700 

CCGTTTCACT CAGGCCGGTT CCGAGGTGTC TGCCCTTTTG GGTCGTATTC 750 

45 CCTCCGCTGT CGGTTACCAG CCCACCCTTG CCACTGATAT GGGTGCCCTG 800 

CAAGAGCGTA TTACCACCAC CCAGAAGGGT TCCGTCACTT CCGTCCAGGC 8 50 

CGTCTACGTC CCTGCCGATG ATTTGACCGA TCCTGCCCCT GCTACCACCT 900 

TCGCCCATTT GGATGCCACC ACCGTCTTGT CGCGTTCCAT TTCTGAGTTG 950 

GGTATCTACC CCGCTGTCGA TCCCCTTGAT TCCAAGTCTC GTCTTTTGGA 1000 

50 TATCACCGTC GTTGGCCAGG AGCACTACGA TGTTGCTACC CAGGTCCAGC 1050 

AGACCCTCCA GTCCTACAAG TCTCTTCAGG ATATCATTGC CATTTTGGGT 1100 

ATGGATGAGT TGTCTGAGGC TGATAAGCTT ACTGTCGAGC GTGCC CGTAA 1150 

GATCCAGAGA TTCCTTTCCC AGCCCTTCAC TGTCGCTGAG GTTTTCACTG 12 00 

GTATCGAGGG CCGTCTCGTT CCTTTGAAGG ACACTGTTCG CTCTTTCAAG 12 50 

55 GAGATCCTTG AGGGCAAGTA CGACCACCTC C 1281 



2) INFORMATION FOR SEQ ID NO: 4 83 

60 

296 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 586 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Histoplasma capsulation 

(B) STRAIN: G185A5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 83 

15 ATTCAAGAAT TGATCGTACG TTCCTCCGCC CCACACACGA TCAATGGAGA 50 

AAGAAACAAA TTTTTTGGGA GTGGTCATTT TTTCTAATAA TTCGAATAGA 100 

ACAACATTGC CAAAGCCCAC GGTGGTTACT CCGTGTTCAC TGGTGTCGGC 150 

GAGCGGACCC GTGAAGGAAA TGACTTGTAC CACGAAATGC AGGAAACCCG 2 00 

TGTTATCCAG CTCGATGGAG AGTCCAAGGT CGCCCTCGTT TTCGGTCAGA 2 50 

2 0 TGAACGAGCC TCCCGGAGCC CGTGCCCGTG TTGCCCTCAC TGGTCTGACC 3 00 

GTTGCCGAGT ACTTCCGTGA CGAGGAAGGC CAAGACGGTA TGTATAAGTA 350 

TACACCGTAG CAAATCAACA CAGAGCTTCA CTCACGCTCG GATTTAGTGC 4 00 

TTCTCTTCAT CGACAACATT TTCCGCTTCA CTCAGGCCGG TTCCGAAGTG 450 

TCTGCCCTGC TCGGCCGTAT TCCCTCCGCC GTCGGTTACC AACCCACCCT 500 

2 5 CGCCGTGGAC ATGGGTGGTA TGCAGGAACG TATCACCACT ACCACCAAGG 550 

GCTCCATCAC CTCTGTGCAR GCCGTCTACG TCCCCG 586 



30 2) INFORMATION FOR SEQ ID NO: 484 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1145 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

. (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia furfur 

(B) STRAIN: ATCC 42132 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484 



GCGTGGCCAG AAGGTCATTG ACACTGGTGC TCCCATCACC ATCCCCGTCG 50 

GTGGTGCCAC ACTGGGTCGT ATTCTGAACG TCACGGGTGA CCCTATTGAC 100 

GAGCGTGGCC CCGTTAAGAC TGACGTTTTC CGCCCCATTC ACCGTGACCC 150 

CCCTGCCTTT GTCGAGCAGT CGACTGATGC CGAGATTCTC GAGACTGGTA 2 00 

50 TCAAAGTCGT TGACCTGATT GCCCCTTACG CCCGTGGTGG TAAGATTGGT 2 50 

CTGTTCGGTG GTGCCGGTGT CGGTAAGACC GTGCTTATCC AGGAGCTCAT 3 00 

CAACAACATC GCCAAGGCCC ACGGTGGTTT CTCCGTGTTC ACTGGTGTCG 3 50 

GTGAGCGTAC TCGTGAGGGT AACGATTTGT ACCACGAGAT GATTGAAACC 4 00 

GGTGTCATTA ACCTCGAGGG TGACTCGAAG GTGGCTCTGG TGTTCGGTCA 450 

55 GATGAACGAG CCCCCGGGTG CCCGTGCGCG TGTCGCTCTT ACTGGTCTGA 500 

CTGTGGCCGA GTACTTCCGT GACGACGAGG GCCAGGATGT GCTGCTGTTC 550 

ATTGACAACA TTTTCCGTTT CACCCAGGCC GGTTCGGAGA CTTCGGCTCT 600 

GCTGGGTCGT ATCCCTTCGT CGGTCGGTTA CCAGCCCACT TTGTCGACCG 650 

ATATGGGTGC CATGCAGGAG CGTATCACCA CCACCAAGAA GGGTTCGATT 700 

60 ACGTCGGTGC AGGCCGTCTA CGTGCCCGCC GATGATGTCA CTGACCCTGC 750 

297 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 _ 

CCCTGCCACT ACCTTCGCCC ACCTTGACGC TACCACTGTG CTTGACCGTT 8 00 

CGATCGCTGA GCTGGGTATC TACCCCGCTG TTGACCCGTT GAACTCGAAC 850 

TCGCGTATGC TTGACCCCGC TATTGTGGGT CAGGAGCACT ACGACGTGGC 900 

CTCTGGCGTG CAGAAGCTGC TCCAGGACTA CAAGTCGCTT CAAGATATCA 950 

5 TTGCCATTCT GGGTATGGAT GAGCTTTCTG AGGAGGACAA GCTCACTGTC 1000 

GAGCGTGCCC GTAAGATGCA GCGTTTCATG TCGCAGCCTT TCGCTGTCGC 1050 

CCAGGTCTTT ACTGGTATCG AAGGTCGTCT TGTTGCCCTG AAGGACACGA 1100 

TCAAGGCCTG CAAGGAGATC CTGTCGGGCA AGCACGACAA CCTCC 1145 

10 

2) INFORMATION FOR SEQ ID NO: 485 

( i ) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 12 61 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malassezia pachydemtatis 

(B) STRAIN: ATCC 42756 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485 



TCCCGCCATC TTCAACGCCC TGGAGGTCCA GGACATGAAG AACGGTGGCC 50 

GCCTTGTTCT GGAGGTTGCC CAGCACCTTG GTGAGAACAC TGTTCGTTGC 100 

30 ATTGCTATGG ACGGTACCGA GGGTCTTGTC CGTGGTCAGA AGGTCCTTGA 150 

CACTGGTGCC CCGATCACTA TCCCTGTCGG TAACGGTACC TTGGGCCGYA 2 00 

TCCTGAACGT CACTGGTGAG CCTGTGGATG AGCGTGGTCC GGTTAAGACT 2 50 

GACGTCTACC GTCCAATCCA CCGTGAGCCC CCGGCGTTCG TTGAGCAGTC 3 00 

GACTGATGCT GAGATTCTTG AGACTGGTAT CAAGGTGGTC GACCTGCTCG 3 50 

3 5 CCCCATACGC TCGTGGTGGT AAGATTGGTC TWTTCGGTGG TGCYGGTGTY 4 00 

GGTAAGACCG TGCTGATTCA GGAGCTTATC AACAACATTG CCAAGGCCCA 4 50 

CGGTGGTTTC TCGGTGTTCA CTGGTGTCGG TGAGCGTACT CGTGAGGGTA 500 

ACGATCTGTA CCATGAAATG ATTGAGACTG GTGTCATCAA CGTTGACGGT 550 

GACTCGAAGG TCGCTCTCGT GTTCGGTCAG ATGAACGAGC CCCCGGGTGC 60 0 

4 0 CCGTGCCCGT GTCGCCCTGA CYGGTCTGAC CATCGCCGAG TACTTCCGTG 650 

ACGACGAGGG TCAGGATGTG CTGCTCTTCA TTGACAACAT TTTCCGTTTC 700 

ACTCAGGCTG GTTCGGAGAC TTCGGCTCTG CTGGGTCGTA TCCCGTCGGC 750 

TGTCGGTTAC CAGCCKACCC TTGCCACGGA TATGGGTGCC ATGCAGGARC 800 

GTATYACCAC CACCAAGAAG GGTTCGATTA CCTCGGTGCA GGCYGTTTAC 850 

45 GTTCCGGCCG ACGATGTGAC TGACCCTGCC CCGGCCACGA CCTTCGCCCA 900 

CTTGGACGCC ACCACGGTGT TGGACCGTTC GATTGCYGAG CTGGGTATCT 950 

ACCCGGCCGT CGACCCGCTG AACTCGAAGT CGCGTATGCT TGACCCGTCA 1000 

ATTGTCGGTG TGGAGCACTA CAACGTTGCT TCGGGTGTCC AGAAGCTTCT 1050 

YCAGGACTAC AAGTCGCTCC AAGATATCAT TGCCATTCTG GGTATGGATG 1100 

50 AGTTGTCGGA GGAGGACAAG CTCACTGTCG AGCGTGCCCG TAAGATGCAG 1150 

CGTTTCCTGT CGCAGCCTTT CGCTGTGGCC CAGGTCTTCA CTGGTATCGA 12 00 

GGGTCGTCTT GTGTCGCTCA AGGACACCAT CAACGCCTGC AAGGAGATTC 1250 

TGTCCGGTAA G 1261 

55 

2) INFORMATION FOR SEQ ID NO: 486 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1282 bases 

298 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Metschnikowia pulcherrima 

(B) STRAIN: DSM 70336 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486 

AGGAGGGCAA CTTGCCAGCT ATCTTCAACG CTTTGACGTT GAAGAACGGC 50 

GAC CAGAAGT TGGTCTTGGA GGTGGCCCAG CACTTGGGTG AGAACACCGT 100 

CAGAACCATT GCCATGGACG GTACCGAGGG TTTGGTCAGA GGCGCCTCTG 150 

TCACCGACAC YGGTGCCCCT ATCTCCGTGC CTGTCGGCCG TGAGACCTTG 2 00 

GGTCGTATTA TCAATGTTGT TGGTGAGCCA ATCGACGAGA GAGGCCCAAT 2 50 

CAACACCAAG AAGAGAAACC CTATTCACAC CGACCCACCT TCGTTTGTCC 300 

AGCAATCCAC TTCCGCCGAG GTCTTGGAGA CTGGTATCAA GGTTGTCGAC 3 50 

TTGTTGGCCC CTTACGCCAG AGGTGGTAAG ATTGGTTTGT TCGGTGGTGC 4 00 

CGGTGT CGGT AAGACCGTGT TCATCCAGGA GTTGATTAAC AACATTGCCA 4 50 

AGGCCCACGG TGGTTTCTCC GTTTTCACCG GTGTCGGTGA GAGAACCAGA 500 

GAGGGTAACG ATTTGTACCG TGAAATGCAG GAGACTGGTG TCATCAACTT 550 

CGAGGGTGAC TCCAAGGTCG CCTTGGTTTT CGGTCAGATG AACGAGCCTC 60 0 

CAGGAGCTAG AGCYAGAGTT GCCTTGACCG GTTTGACCAT CGCCGAGTAC 650 

TTCAGAGACG AGGAGGGTCA GGATGTGTTG TTGTTCGTCG ACAACATTTT 7 00 

CAGATTCACC CAAGCCGGTT CTGAGGTGTC TGCCTTGTTG GGTCGTATTC 750 

CATCCGCTGT CGGATACCAG CCAACCTTGG CCACCGATAT GGGTGCCTTG 800 

CAGGAGAGAA TTACCACCAC CAAGAAGGGT TCCGTCACCT CCGTCCAGGC 850 

CGTCTACGTG CCAGCCGATG ACTTGACTGA CCCTGCGCCA GCCACCACTT 900 

TCGCCCACTT GGACGCCACC ACTGTGTTGT CCAGAGGTAT CTCTGAGTTG 950 

GGTATCTACC CCGCTGTCGA CCCCTTGGAC TCCAACTCCA GATTGTTGGA 1000 

CGCCACCGTT GTTGGCCAGG AGCACTACGA CGTCGCCACC AACGTCCAGC 1050 

AAACTTTACA AGCTTACAAG TCCTTGCAGG ATATCATTGC CATTTTGGGT 1100 

ATGGATGAGT TGTCCGAGAC CCGACAAGTT GACCCGTCGA GAGAGCCAGA 1150 

AAGATCCAGA AGTTCTTGTC CCAGCCATTT GCCGTCGCCG AGGTTTTCAC 12 00 

CGGTATTGAG GGTAGATTGG TTAGATTGGA GGACACCGTT AGATCCTTTA 12 50 

AGGAGGTTTT GGAAGGTAAG TACGACCACT TG 12 82 



2) INFORMATION FOR SEQ ID NO: 487 

( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 482 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: WSA-214 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 487 



TGTCTTTATC CAGGAGTTGA TTGTACGTCT TTACCTTTCT GCCTGACTGT 50 

TTACGACAAC TAACGAAAGC GTAGAACAAC ATTGCCAAGG CTCACGGTGG 100 

TTACTCTGTC TTCACTGGTG TCGGTGAACG TACTCGTGAG GGTAACGATT 150 
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TGTACCACGA AATGCAGGAA 
AAGGTCGCCC TCGTGTTCGG 
CCGTGTCGCT CTTACTGGTT 
AAGGTCAGGA CGTGCTTCTC 
5 GCCGGTTCTG AGGTGTCTGC 
TTACCAGCCC ACCCTTGCCG 
CCACCACCAC CAAGGGTTCC 



ACTGGTGTCA TTCAGCTCGA 
TCAGATGAAC GAGCCCCCCG 
TGACCATTGC CGAGTACTTC 
TTCATTGACA ACATTTTCCG 
CCTTCTGGGT CGTATCCCCT 
TCGACATGGG TATCATGCAG 
ATCACCTCCG TC 



PCT/CAOO/01150 

GGGTGAATCC 2 00 

GTGCCCGTGC 2 50 

CGTGACGAGG 3 00 

TTTCACTCAG 3 50 

CTGCCGTCGG 4 00 

GAGCGTATTA 4 50 

482 



10 



15 



20 



25 



2) INFORMATION FOR SEQ ID NO : 4 88 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 90 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 18205 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488 



TCGAACAAGG 
CCAAGTGGTA 
CACTGTCAGA 
3 0 AACAAGTTAC 
ACTTTAGGTC 
TCCAATTAAC 
TCAGTGAACA 
GTYGATTTAT 

3 5 TGGTGCCGGT 

TTGCTAAAGC 
ACCAGAGAAG 
TAACTTGGAA 
AACCACCAGG 

4 0 GAATACTTCA 

CATTTTCAGA 
GTATTCCATC 
TTGTTACAAG 
CCAAGCTGTT 

4 5 CCACTTTCGC 
GAATTAGGTA 
ATTAGATGCT 
TTCAACAAAC 
TTAGGTATGG 

50 AAGAAAAATC 
TCACTGGTAT 
TTCAAAGATG 



TAACTTACCA 
CTCCATTAGT 
ACTATTGCTA 
TGATACTGGT 
GTATTATCAA 
ACCAAACAAA 
ATCAACTGCT 
TAGCTCCATA 
GTCGGTAAAA 
TCATGGTGGT 
GTAACGATTT 
GGTGATTCTA 
TGCTAGAGCT 
GAGATGAAGA 
TTCACCCAAG 
TGCTGTCGGT 
AACGTATTAC 
TATGTCCCAG 
CCATTTGGAT 
TTTACCCAGC 
TCAGTTGTTG 
TTTACAAGCT 
ATGAATTGTC 
CAAAGATTCT 
YCCAGGTAGA 
TTTTGGAAGG 



GCTATCTTGA 
TTTAGAAGTT 
TGGATGGTAC 
TCTCCAATCA 
CGTTGTTGGT 
GAAACCCAAT 
GCTGAAGTTT 
CGCTAGAGGT 
CTGTCTTTAT 
TTCTCAGTTT 
ATACCGTGAA 
AGGTCGCTTT 
CGTGTTGCTT 
AGGTCAAGAT 
CCGGTTCAGA 
TATCAACCAA 
CACCACACAA 
CTGATGATTT 
GCTACTACTG 
TGTCGATCCA 
GTCAAGAACA 
TACAAATCTT 
TGAACAAGAT 
TATCTCAACC 
TTGGTTAGAT 
TAAATATGAT 



ATGCTTTAGA 
GCTCAACATT 
TGAAGGTTTA 
CTGTCCCAGT 
GAACCAATTG 
TCACGCTGAA 
TAGAAACTGG 
GGTAAAATTG 
CCAAGAATTG 
TCACCGGTGT 
ATGAAAGAAA 
AGTTTTCGGT 
TAACTGGTTT 
GTCTTGTTAT 
AGTTTCTGCC 
CTTTAGCAAC 
AAAGGTTCAG 
AACAGATCCT 
TCTTGTCTCG 
TTAGATTCTA 
TTATGATGTT 
TACAAGATAT 
AAATTGACTG 
ATTTGCTGTT 
TAAAAGACAC 
CACTTACCAG 



AATCAAAACC 50 

TAGGTGAAAA 100 

GTTCGTGGTG 150 

TGGTCGTGAA 2 00 

ATGAACGTGG 250 

CCACCTTCAT 3 00 

TATCAAAGTT 350 

GTTTATTCGG 4 00 

ATTAACAACA 450 

TGGTGAAAGA 500 

CTGGTGTTAT 550 

CAAATGAATG 6 00 

GACCATTGCT 650 

TCGTTGATAA 700 

TTATTAGGTC 750 

TGATATGGGT 800 

TTACTTCTGT 850 

GCTCCAGCTA 900 

TGGTATTTCA 950 

AATCAAGATT 1000 

GCTACCAACG 1050 

TATTGCTATT 1100 

TCGAAAGAGC 1150 

GCCGAAGTTT 12 00 

TATCAAATCA 1250 
1290 



55 

2) INFORMATION FOR SEQ ID NO: 489 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 12 91 bases 
60 (B) TYPE: Nucleic acid 

300 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 214 9 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 89 

CCAATTCGAA CAAGGTAACT TACCAGCTAT CTTGAATGCT TTAGAAATCA 50 

AAACCCCAAG TGGTACTCCA TTAGTTTTAG AAGTTGCTCA ACATTTAGGT 10 0 

GAAAACACTG TCAGAACTAT TGCTATGGAT GGTACTGAAG GTTTAGTCCG 150 

15 TGGTGAACAA GTTACTGATA CTGGTTCTCC AATCACTGTC CCAGTTGGTC 2 00 

GTGAAACTTT AGGTCGTATT ATCAACGTTG TTGGTGAACC AATTGATGAA 250 

CGTGGTCCAA TTAACACCAA ACAAAGAAAC CCAATTCACG CTGAACCACC 3 00 

TTCATTCAGT GAACAATCAA CTGCTGCTGA AGTTTTAGAA ACTGGAATCA 3 50 

AAGTTGTTGA TTTATTAGCT CCATACGCTA GAGGTGGTAA AATTGGTTTA 4 00 

2 0 TTCGGTGGTG CCGGTGTCGG TAAAACTGTC TTTATCCAAG AATTGATTAA 4 50 

CAACATTGCT AAAGCTCATG GTGGTTTCTC AGTTTTCACC GGTGTTGGTG 500 

AAAGAACCAG AGAAGGTAAC GATTTATACC GTGAAATGAA AGAAACTGGT 550 

GTTATTAACT TGGAAGGTGA TTCTAAGGTC GCTTTAGTTT TCGGTCAAAT 6 00 

GAATGAACCA CCAGGTGCTA GAGCTCGTGT TGCTTTAACT GGTTTGACCA 650 

2 5 TTGCTGAATA CTTCAGAGAT GAAGAAGGTC AAGATGTCTT GTTATTCGTT 700 

GATAACATTT TCAGATTCAC CCAAGCCGGT TCAGAAGTTT CTGCCTTATT 750 

AGGTCGTATT CCATCTGCTG TCGGTTATCA ACCAACTTTA GCAACTGATA 80 0 

TGGGTTTGTT ACAAGAACGT ATTACCACCA CACAAAAAGG TTCAGTTACT 850 

TCTGTCCAAG CTGTTTATGT CCCAGCTGAT GATTTAACAG ATCCTGCTCC 900 

3 0 AGCTACCACT TTCGCCCATT TGGATGCTAC TACTGTCTTG TCTCGTGGTA 950 

TTTCAGAATT AGGTATTTAC CCAGCTGTCG ATCCATTAGA TTCTAAATCA 1000 

AGATTATTAG ATGCTTCAGT TGTTGGTCAA GAACATTATG ATGTTGCTAC 1050 

CAACGTTCAA CAAACTTTAC AAGCTTACAA ATCTTTACAA GATATTATTG 110 0 

CTATTTTAGG TATGGATGAA TTGTCTGAAC AAGATAAATT GACTGTCGAA 1150 

3 5 AGAGCAAGAA AAATCCAAAG ATTCTTATCT CAACCATTTG CTGTTGCCGA 12 0 0 

AGTTTTCACT GGTATCCCAG GTAGATTGGT TAGATTAAAA GACACTATCA 12 50 

AATCATTCAA AGATGTTTTG GAAGGTAAAT ATGATCACTT A 12 91 



40 



50 



2) INFORMATION FOR SEQ ID NO: 490 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 508 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotorula minuta 

(B) STRAIN: ATCC 10658 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 490 

CGTATTGATT CGTGAGTGGC CGTTCCCTTA CAGCAAGCTT ATAAAGGAGC 50 

GAAAAAGATC TGACATTCGG CTTATGTGCT ATACAGAGGA ACTCATCAAC 100 

AACGTCGCCA AGGCTCATGG TGGTTACTCT GTCTTCACCG GTGTCGGAGA 150 

60 GCGAACACGT GAAGGTAACG ATCTCTACCA CGAAATGATT GAAACCGGTG 2 00 

301 
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10 



20 



WO 01/23604 PCT/CA00/01150 

TCATTCAGCT CAAGAACGAC AAGTCCAAGG CCGCTCTGGT CTTCGGACXG 25V 

ATGAACGAGC CCCCCGGAGC TCGTGCCCGT GTCGCTCTGA CCGGTCTCAC 300 

CATCGCCGAG TACTTCCGTG ACGTCGAAGG ACAGGATGTG CTACTCTTCA 350 

TCGACAACAT TTTCCGATTC ACCCAGGCCG GTTCAGAGGT ATCTGCCCTG 4 00 

CTCGGACGTA TCCCATCTGC TGTCGGATAC CAGCCCACAC TCTCAACCGA 450 

TATGGGTGGT ATGCAAGAGC GAATCACAAC CACCAAGAAG GGTTCGATTA 500 

CCTCCGTC 508 



2) INFORMATION FOR SEQ ID NO: 491 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 686 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhodotorula mucilaginosa 

(B) STRAIN: ATCC 66034 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 491 

TGTCCTCATT CAGGAGCTCA TCAACAACAT CGCCAAGGCC CACGGTGGTT 50 

ACTCGGTCTT CACCGGTGTC GGCGAGCGTA CCCGTGAGGG TAACGGTGAG 100 

TCTCCCCCTT CAAACTTTTG GCCGGCTAGT TGGCGCAGCG CAAACTGACG 150 

30 CGCGCGCCCT GTCCAGACTT GTACCACGAG ATGATCGAGA CTGGTGTCAT 2 00 

CCAGCTCGAG AACGACAACT CGAAGTGCGC TCTCGTGTTC GGCCAGATGA 250 

ACGAGCCCCC TGGTGCCCGT GCCCGTGTCG CTCTCACTGG GTTCGTCCTT 3 00 

TCTCTCTCTC GAGCGTCCTG GCTTGATACG GAACGCTGAC ACGTCACGCA 350 

GTCTCACTAT TGCTGAGTAC TTCCGTGACG AGGAGGGCCA GGACGTGCTC 4 00 

35 CTCTTCATCG ACAACATCTT CCGTTTCACC CAGGGTGAGC CGCCTCCGCG 450 

GGCATTCTCC CGTTTCTTTC GCGCTGACGT CTGTCCCGTA TAGCCGGTTC 500 

GGAGGTGTCT GCCCTTCTCG GACGTATCCC GTCCGCTGTC GGATACCAGC 550 

CGACTCTCTC GACCGACATG GGTCAGATGC AGGAGCGTAT CACGTAAGTT 600 

TGGCCGCAGC TCCGTCCGCG GCGCCCTTTG TGTCTGACCG TGTTCCACCG 650 

4 0 CTCAGCACCA CCAAGAAGGG CTCGATCACC TGTGTC 686 



2) INFORMATION FOR SEQ ID NO: 4 92 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Sporobolomyces salmoni color 

(B) STRAIN: ATCC 32311 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492 

60 TGTCTTGATT CAGGAGCTCA TCAACAACGT CGCCAAGGCG CACGGTGGTT 50 

302 
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ACTCCGTTTT CACCGGTGTC GGTGAGCGTA CCCGTGAGGG TAACGGTGAG 100 

CACACTTTCG CCGACCTGGC CATTCCGGAA CGTGCGGACT GACGAGGACC 15 0 

ACCTCGAACA GATCTCTACC ACGAGATGAT CGAGACCGGT GTCATTCAGC 2 00 

TCGACAACGA CAAGTCGAAG ACTGCTCTCG TCTTCGGCCA GATGAACGAG 2 50 

5 CCCCCTGGCG CCCGTGCCCG TGTCGCTCTT ACTGGTCTCA CCATCGCGGA 3 00 

GTACTTCCGT GACGACGAGG GCCAGGACGT GCTTCTCTTC ATCGACAACA 3 50 

TCTTCCGTTT CACCCAGGGT ACGTTCGATC CCGCCCGTCC AACACGAATG 4 00 

TCGTGGTGAC TGACAACCTG TTGCGCGTGC AGCCGGTTCG GAGGTGTCTG 4 50 

CCCTTCTCGG TCGTATCCCG TCCGCTGTCG GATACCAGCC CACTCTCTCG 500 

10 ACCGACATGG GTGGCATGCA GGAGCGTATC ACGTACGCCC TCTTCTGCTT 55 0 

TCTCTCGTTT CGCTCTGCAT CGCTCACGCA TGTTCGCCCG ACAGCACCAC 60 0 

CAAGAAGGGT TCGATCACCT GTGTC 62 5 



15 



30 



60 



2) INFORMATION FOR SEQ ID NO: 4 93 



( i ) SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 1211 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: WSA-14 8 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 93 



TCTCGTCCGT GGTGCTAAGG CCACTGACAC TGGTTCCCCC ATTACCATCC 50 

CCGTCGGCCC CGGTACCCTC GGTCGCATCA TGAACGTCAC CGGTGACCCG 10 0 

35 ATCGACGAGC GCGGTCCCAT CAAGACCGAC AAGTTCCGTC CCATCCACGC 150 

TGAGGCTCCC GAGTTCGTTG ACCAGTCGAC CACCGCTGAG GTTCTCGTGA 2 00 

CTGGTATCAA GGTCGTCGAT CTGCTTGCTC CCTACGCCCG TGGTGGTAAG 2 50 

ATTGGTCTGT TTGGCGGTGC CGGTGTTGGC AAGACCGTGT TCATCCAGGA 3 00 

GCTCATCAAC AACATCGCCA AGGCCCACGG TGGTTACTCC GTCTTCACCG 3 50 

40 GTGTCGGCGA GCGTACCCGT GAGGGTAACG ATCTGTACCA CGAAATGCAG 400 

GAGACCTCTG TCATTCAGCT TGACGGTGAC TCCAAGGTCG CCCTGGTGTT 4 50 

CGGTCAGATG AACGAGCCCC CTGGTGCTCG TGCCCGTGTC GCCTTGACCG 500 

GTTTGACTGT CGCTGAGTAC TTCCGTGACG AGGAGGGCCA GGATGGTATG 55 0 

TTTTGAATTA TTTCCTTGTC GTACAGTTCC AAATCGAAGA ATTACTAACT 600 

4 5 TGTCAGTGCT TCTCTTCATC GACAACATTT TCCGCTTCAC CCAGGCCGGT 650 

TCTGAGGTGT CTGCCCTTCT GGGTCGTATT CCCTCCGCTG TCGGTTACCA 700 

GCCCACGCTC GCCGTGGACA TGGGTCTGAT GCAGGAGCGT ATTACCACCA 750 

CCCGCAAGGG CTCAATTACC TCCGTCCAGG CCGTCTACGT GCCCGCTGAC 8 00 

GATCTGACGG ATCCCGCCCC CGCCACCACC TTCGCCCATC TGGACGCCAC 850 

50 CACTGTGCTG TCCCGAGGTA TCTCTGAGCT GGGTATCTAC CCCGCTGTCG 900 

ACCCCCTCGA CTCCAAGTCG CGTATGCTGG ACCCCCGTAT TGTCGGTGAC 950 

GACCACTACG AGACCGCCAC TCGCGTCCAG CAGATCCTCC AAGAGTACAA 10 00 

GTCGCTGCAG GACATCATCG CCATTCTGGG TATGGACGAG CTGTCTGAGG 1050 

CCGACAAGCT TACAGTCGAG CGTGCTCGTA AGATCCAGCG TTTCCTGAGC 1100 

55 CAGCCGTTCA CGGTCGCGCA GGTCTTCACT GGTATCGAAG GCCAGCTGGT 1150 

CGATCTGAAG GACACTATCG CTTCGTTCAA GGCTATCCTG AGCGGTGAGG 12 00 

GTGACAGCCT T 1211 



303 
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2) INFORMATION FOR SEQ ID NO: 4 94 



PCT/CAOO/01150 



10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1133 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stephanoascus ciferrii 

(B) STRAIN: ATCC 52550 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 94 



20 



25 



30 



35 



40 



TCTTGTTAGA 
CAGTTGGTAA 
ATTGATGAGC 
TGAGCCACCA 
CCGGTATTAA 
ATTGGTCTTT 
GCTTATTAAC 
GTGTCGGTGA 
GAGACCGGTG 
CGGTCAGATG 
GTTTGACCAT 
TTGTTCATTG 
TGCCTTGTTG 
CTACTGATAT 
TCCGTCACCT 
TCCTGCCCCA 
CCCGATCTAT 
TCCAAGTCCC 
CGTTGCCGCC 
ATATCATTGC 
ACTGTCGAGC 
CGTCGCTGAG 
ACACCATCCG 



GGCACCCCAG 
CGGCACTTTG 
GTGGACCAGT 
ACCTTCGCTG 
GGTTGTCGAC 
TCGGTGGTGC 
AACATTGCTA 
GCGAACTCGT 
TCATCAACCT 
AACGAGCCTC 
TGCCGAGTAC 
ACAACATTTT 
GGTCGTATCC 
GGGTGGTCTT 
CTGTCCAGGC 
GCTACCACCT 
CTCTGAGTTG 
GTCTTTTGGA 
AACGTCCAAC 
CATTTTGGGT 
GTGCTCGTAA 
GTCTTCACTG 
ATCCTTCAAG 



TCAAGGACAC 
GGCCGTATCG 
CAAGGCTGAC 
ACCAGTCCAC 
TTGCTTGCCC 
CGGTGTCGGT 
AGGCCCACGG 
GAAGGTAACG 
TGAGGGTGAC 
CAGGAGCCCG 
TTCAGAGATG 
CCGATTCACC 
CATCTGCCGT 
CAAGAACGTA 
TGTCTACGTC 
TCGCCCATTT 
GGTATCTACC 
TGCCTCCGTC 
AGACCTTGCA 
ATGGACGAAT 
GATGCAGAGA 
GTCTCGAGGG 
GAGATCCTTG 



TGGTGCTCCA 
TCAACGTTCT 
AAGTTCAGAC 
CTCTGCCGAG 
CTTATGCCAG 
AAGACTGTGT 
TGGTTACTCT 
ATTTGTACCA 
TCCAAGGTGT 
TGCCCGTGTT 
AGGAGGGCCA 
CAGGCCGGTT 
CGGTTACCAA 
TTACCACCAC 
CCAGCTGACG 
GGACGCCACC 
CAGCTGTCGA 
GTCGGCCAAG 
GGCCTACAAG 
TGTCTGAGGC 
TTCCTTTCTC 
TAGACTCGTT 
ACG 



ATTACCATTC 50 

CGGTGAGCCA 100 

CTATTCACGC 150 

GTTCTTGAGA 2 00 

AGGTGGTAAG 250 

TCATTCAGGA 3 00 

GTCTTCACTG 3 50 

CGAAATGATG 400 

CTCTTGTGTT 450 

GCCTTGACCG 500 

GGATGTCTTG 550 

CTGAGGTCTC 600 

CCAACCTTGG 650 

TCAAAAGGGT 700 

ATTTGACTGA 750 

ACCGAATTGT 800 

CCCTCTTGGT 850 

AGCACTACGA 900 

TCTCTCCAGG 950 

TGATAAGCTC 1000 

AGCCATTCAC 1050 

TCTTTGAAGG 1100 
1133 



45 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 4 95 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6 08 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: WSA-225 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 495 

GGAGTTGATT GTAAGTCATT TGAAACCCAG CCCCAAGAAA CAGAAGCTAG 
GCGAAAATTG GACAATTGAG CAATTTAGCC ATTGGAGAAA AGAAATTTCG 

304 



50 
100 
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WO 01/23604 

AGTATTAATT GTTTTTATAG 

TCTGTCTTCA CTGGTGTCGG 

CCATGAAATG CAGGAGACCC 

TCGCCCTGGT CTTCGGCCAG 

5 GTTGCTCTTA CTGGTTTGAC 

TCAAGACGGT GAGTTTCTTA 

TTTTTTTCAA GAAATTCATG 

TCATCGACAA CATTTTCCGT 

CTGCTTGGTC GTATCCCATC 

10 CGACATGGGT GGTATGCAGG 
TTACCTCC 



AACAACATTG CCAAGGCTCA 
AG AG CGT AC C CGTGAAGGAA 
GTGTCATTCA GCTTGATGGC 
ATGAACGAGC CCCCAGGTGC 
CATTGCTGAG TACTTCCGTG 
TGGATAAAAA AAAATTTTTT 
TTCTAACAAA GTGTATTCTA 
TTCACTCAGG CTGGTTCCGA 
TGCCGTCGGT TACCAACCCA 
AACGTATTAC CACCACCAAG 



PCT/CA00/01150 

CGGTGGTTA!C V5® 

ACGATCTCTA 2 00 

GAGTCCAAGG 250 

CCGTGCCCGT 300 

ATGAGGAAGG 350 

TTTTTTTTTT 40 0 

GTGCTTCTCT 4 50 

AGTGTCTGCC 500 

CTCTTGCCGT 550 

AAGGGATCCA 600 

608 



15 2) INFORMATION FOR SEQ ID NO: 496 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 794 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Wangiella dermatitidis 

(B) STRAIN: WSA-229 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496 



GTTTATTCAA GAACTCATTG TGGGTGGCAT TCTCATAATG TTTCGGCCAC 50 

AATTACTGAT TGAAAATAGA ACAACATTGC AAAGGCTCAT GGTGGTTACT 100 

CCGTGTTCAC TGGTGTCGGC GAGCGAACTC GTGAGGGTAA CGACTTGTAC 150 

CACGAAATGC AGGAGACCTC TGTCATTCAG CTCGATGGCG AGTCCAAGGT 2 00 

3 5 CGCGCTGGTG TTTGGTCAAA TGAACGAACC TCCTGGTGCT CGTGCTCGTG 250 

TTGCTCTGAC TGGGTAAGTT GTTCCTTCGC TTCTTGCGCG TATCCACATC 3 00 

CCCATCTTGA GAATACGTCT GCCACCATGT CATGTGATGT TGGGCTGGTT 3 50 

CTGGTTTTTG GGAGGCCCTC AAGTTCAATT TTTGGATGAC AGCACCAGCT 4 00 

TTACAAGATT ATGCTAACTT AATGGAGTCT TACGGTGGCT GAGTTCTTCA 450 

4 0 GGGATGAGGA GGGACAGGAT GGTAAGTTTG ATAACAATCT CGTCGGTGTC 500 

AATATCGACG GCGTACTCTT CGCATCAAAA AACCAAAGAG GTGGTTTGGT 550 

GTGAGAAGTG CGCCGGAAAT AATGGCAACC ACGTGACAAT GACCACGTGT 600 

GGGGCTCCCG TGCTAACACG TGACAGTCTT GCTCTTCATC GACAACATTT 650 

TCCGATTCAC TCAASCCGGT TCTGARGTGT CTGCCTTGCT TGGTCGTATT 700 

45 CCATCTGCCG TCGGTTACCA ACCCACACTC GCCGTCGACA TGGGTCTCAT 750 

GCAGGAACGT ATCACCACCA CCCGGAAGGG ATCCATCACA TCTG 794 



50 2) INFORMATION FOR SEQ ID NO: 497 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 8 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

305 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(A) ORGANISM: Yarrowia lipolytica 

(B) STRAIN: ATCC 38295 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 497 

5 

TCTTGTCCGA GGCACCGCCG TCGCTGACAC CGGTGCTCCC ATCACTATCC 50 

CCGTCGGCCG AGGTACCCTT GGTCGAATCA TCAACGTCTG TGGTGAGCCC 100 

ATTGACGAGC GAGGACCCAT CGAGGCTTCC AAGTACCTCC CCATCCACGC 150 

TGACCCCCCT ACCTTCGCTG AGCAGTCTAC CTCCGCTGAG GTTCTCGAGA 2 00 

10 CCGGTATTAA GGTCGTCGAC CTCCTCGCCC CTTACGCCCG AGGTGGTAAG 250 

ATTGGTCTCT TCGGTGGTGC CGGTGTCGGT AAGACTGTCT TCATCCAGGA 3 00 

GCTGATTAAC AACATTGCCA AGGCCCATGG TGGTTTCTCC GTTTTCTGCG 3 50 

GTGTCGGTGA GCGAACCCGA GAGGGTAACG ATCTTTACCG AGAGATGAAG 4 00 

GAGACTGGTG TCATCAACCT CGAGGGTGAG TCTAAGGTCA CCCTCGTCTT 4 50 

15 CGGTCAGATG AACGAGCCTC CCGGAGCCCG TGCCCGAGTC GCCCTTACTG 500 

GTCTGACCAT TGCCGAGTAC TTCCGAGACG AGGAGGGTCA GGATGTGTTG 550 

CTCTTCGTTG ACAACATTTT CCGATTCACC CAGGCCGGTT CCGAGGTGTC 6 00 

CGCTCTGCTT GGTCGAATTC CCTCCGCTGT CGGTTACCAG CCCACTCTGG 650 

CCACCGATAT GGGTGCCCTC CAGGAGCGAA TTACCACCAC CCAGAAGGGT 700 

20 TCCGTCACTT CCGTCCAGGC CGTCTACGTG CCTGCCGATG ATTTGACCGA 750 

TCCTGCTCCC GCCACCACCT TCGCCCATCT TGACGCCACC ACCGTCCTGT 800 

CCCGAGGTAT TTCCGAGCTG GGTATCTACC CCGCTGTCGA TCCCCTTGAT 850 

TCCAAGTCTC GACTTCTGGA TATCGATGTT GTCGGAAAGG AGCACTACGA 900 

TGTTGCTTCC AACGTCCAGC AGACCCTCCA GGCTTACAAG TCTCTCCAGG 950 

25 ATATCATTGC CATTCTTGGT ATGGATGAGC TGTCCGAGCA GGACAAGCTG 1000 

ACCGTCGAGC GAGCTCGAAA GATCCAGCGA TTCCTGTCTC AGCCCTTCAC 1050 

CGTCGCCGAG GTTTTCACCG GTATTGAGGG ACGACTTGTC TCTCTCAAGG 1100 

ACACTGTCCG ATCCTTCAAG GAGATCCTTG ACGGTAAGCA CGATGCTC 1148 



30 



45 



2) INFORMATION FOR SEQ ID NO: 498 



(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 966 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: WSA-172 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 98 



GCGCTATTGT CGTTGTTGCT GCCTCCGACG GTCAGATGTA GGTGGAACAT 50 

CTTGGGAAAT ACGTCGTAAA ACACGTCGCT TACGTTTTCG CGAATAGGCC 100 

50 CCAGACTCGT GAGCATTTGC TGCTCGCCCG CCAGGTTGGT GTCCAGAAGA 150 

TCGTTGTCTT CGTCAACAAA ATCGATGCTA TTGATGATCC GGAGATGCTG 200 

GAACTGGTCG AACTCGAGAT GCGTGAGCTG CTGAACAGCT ACGGTTTCGA 250 

GGGTGAAGAG ACTCCGATCA TTTTCGGTTC CGCTCTCTGT GCTCTCGAAG 3 00 

GACGCCGTGA CGACATCGGT AAAGACAGAA TTGAGCAGCT TATGAACGCT 350 

55 GTCGACACCT GGATCCCCAC TCCTCAGCGT GACCTCGACA AACCTTTCTT 4 00 

GATGTCTGTC GAGGAAGTGT TCTCTATCGC CGGCCGTGGT ACCGTGGCTT 450 

CTGGTCGTGT CGAGCGTGGT ATCTTGAAGA AGGACTCTGA GGTTGAGATT 500 

GTTGGAGGCT CCTTCGAACC CAAGAAGACC AAAGTCACCG ACATTGAAAC 550 

CTTCAAGAAG AGCTGTGATG AATCGCGTGC TGGTGACAAC TCTGGTCTCC 600 

60 TCCTGCGTGG TATCCGACGT GAAGACGTCA AGCGTGGTAT GGTCATTGCT 650 

306 
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GTTCCCGGCA GCACCAAGGC TCACGACAAG TTCCTCGTCT CCATGTACGT 7 00 

CCTGACCGAG GCGGAGGGTG GTCGTCGTAC TGGCTTCGGT GCCAACTACC 750 

GTCCCCAAGT CTTCATCCGT ACTGCAGGTA AGTTCCCGCA CACCGTGTCC 8 00 

AGATCTTCCG AGAGATTAGC GATATATGCT AATGATTCAT CAGACGAGGC 850 

5 TGCTGACCTC AGCTTCCCTG ACGGCGACCA ATCTCGCAGA GTTATGCCTG 900 

GTGACAACGT CGAGATGATC CTGAAGACCC ACCACCCTGT TGCTGCTGAG 950 

GCTGGTCAAC GCTTCA 966 

10 

2) INFORMATION FOR SEQ ID NO: 499 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 846 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastoschizomyces capita. tus 

(B) STRAIN: ATCC 10663 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 499 

TGGTGCTATT ATTGTTGTTG CTGCTTCTGA TGGTCAAATG CCCCAAACCC 50 

GTGAGCACTT GCTTCTTGCT CGTCAAGTTG GTGTTAAGCA CATTGTTGTT 100 

TTCGTTAACA AGATTGATAC TATTGATGAT CCTGAAATGT TGGAACTTGT 150 

3 0 CGAAATGGAA ATGAGAGAAC TTCTTTCTTC TTACGGTTTT GATGGCGATA 2 00 

ACACCCCTGT CATTATGGGT TCTGCTCTCT GTGCTC TTGA AGGTCGTGAA 2 50 

CCAGAAATTG GTGAACAAAG AATCAACCAA CTCCTTGATG CTATCGATGA 3 00 

ATACATTCCT ACCCCAGTTC GTGATATGGA CCAACCTTTC TTGATGCCAC 350 

TTGAAGGTGT TTTCTCTATT CCAGGTCGTG GTACTGTTGC CACTGGACGT 4 00 

35 GTCTATCGTG GTACTTTGAA GAGAGGTGAA GAAGTTGAAG TTGTTGGCTA 4 50 

CAATGATGCT CCAATCAAGA CCACCGTTAC TGGTATTGAA ATGTTCAAGA 500 

AGGAACTTGA TCAAGCTCAA GCTGGTGACA ACGCTGGTAT TCTTTTGAGA 550 

GGTGTTAAGC GTGAAGACCT TAAGCGTGGT ATGGTTGTTG CTAAACCAGG 600 

TACCGTTAAG CCACACACCA AGTTCCTTGC CTCCATCTAT GTTTTGACTA 650 

4 0 AGGAAGAAGG TGGCAGACAC TCTGGCTTTG GTCTTAACTA CAGACCTCAA 700 

CTTTTCCTTG GTTCTGCTGA TGTTACCACT GTCTTGACCT TCCCAGAGGG 750 
TGTTGACCAA AGCACTCAAG TCATGCCAGG TGACAACACT GAAATGGTTT 800 
GCGAACTTGT TCACCCAGTT GCTGTCGAAC AAGGCCAACG TTTCAA 84 6 

45 

2) INFORMATION FOR SEQ ID NO: 500 

( i ) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida rugosa 

(B) STRAIN: ATCC 96275 



60 



307 
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WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500 



PCT/CA00/01150 



GGTGCCATTA TTGTTGTTGC TGCCTCTGAT 
TGAGCATCTT TTGCTTGCCC GCCAAGTCGG 
5 TTGTTAACAA GATTGATACC ATTGATGACC 
GAGATGGAGA TGCGTGAACT GTTGAATGAA 
CTCTCCTGTC ATTATGGGCT CTGCTCTTGC 
CCGAGATTGG TAAGGACCGT ATCATGCAGC 
TGGATCCCTA CCCCCGAGCG TGACCTTGAC 

10 TGAGGCCTCT TTCTCCATTT CTGGTCGTGG 
TCGAGCGTGG TATTCTCAAG AAGGGTGAGG 
AACAAGCAGC CCCTGAAATC TGTTGTTACT 
GGAACTTGAT CAGGCCCAGG GCGGTGATAA 
GTATTCGTCG TGAGGACTTG CAGCGTGGTA 

15 ACTGTTAAGG CTCACACCAA GTTCCTTTCC 
GGAAGAGGGC GGCCGTCACT CTCCTTTCGG 
TGTTCGTTTC TGCAGCTGAT GTCACCGTTA 
GTTGAACAGC ACACTCAGGT CTTCCCTGGT 
CGAGCTCGTT CACCCTACTG CTATTGAGGT 

20 



2 ) INFORMATION FOR SEQ ID NO: 501 

25 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 944 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides xwmitis 
35 (B) STRAIN: Silveira 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501 

AGTTGTCGTC GTTGCTGCTT CAGACGGTCA AATGTATGCA ACCGAGAGCA 50 

4 0 CTCCCGGATC TTGGTTTAAA TGGCACTAAT ATAAGACAGG CCTCAAACTC 100 

GAGAGCATTT ACTTCTCGCC CGTCAGATCG GTATCCAAAA AATCGTCGTC 150 

TTCGTGAACA AGGTTGATGC CATCGAGGAC AAAGAGATGT TGGAGCTTGT 2 00 

TGAATTGGAG ATGCGTGAAC TCCTAACCAG CTACGGTTTC GAGGGTGAAG 2 50 

AAACTCCCAT CATTTTTGGC TCTGCTCTCT GTGCCCTCGA AGGAAGACAA 300 

4 5 CCCGAGATCG GTGTTACCAA GATTGATGAG CTCTTGCAGG CCGTCGACAC 3 50 

CTGGATTCCC ACTCCTCAGC GTGAGACTGA CAAGCCCTTC TTGATGTCCA 4 00 

TTGAGGAAGT GTTCTCTATT TCCGGACGAG GAACCGTTGT CTCCGGCCGT 4 50 

GTGGAGCGTG GTATCCTCAA GAAGGACTCC GAAGTTGAAA TTGTCGGCGG 500 

TTCGCCCGAG CCAATCAAAA CCAAGGTTAC CGATATCGAA ACCTTTAAGA 550 

50 AGTCTTGCGA CGAGTCTCGC GCTGGTGATA ACTCCGGCTT GCTCCTACGA 600 

GGCGTTAAGC GTGAAGATAT TAGCCGTGGC ATGGTCGTCG CTGTACCAGG 650 

AAGTGTCAAG GCCCATACTG AATTCTTAGT TTCGCTTTAC GTCCTCACCG 7 00 

AAGCTGAGGG TGGGCGCAAA TCTGGATTCA GCAGCAAGTA CCGCCCACAG 750 

ATGTTCATTC GCACTGCCGG TATGTAATAC TGTGATAATT TCGTTGACAT 800 

55 GGTACTGATT GAATTCTATA GACGAAGCGG CTCAGCTCAG CTGGCCCGGA 850 

GAAGATCAAG ACAAGATGGC TATGCCAGGA GACAATATCG AAATGATTTG 900 

CACCACCTTG CACCCAGTTG CCGCCGAGGC TGGCCAGCGA TTCA 944 

60 

308 

SUBSTITUTE SHEET (RULE 26) 



GGACAGATGC CCCAGACCCG 50 

TATGCAAAAG GTCGTTGTGT 100 

CTGAAATGCT TGATCTTGTC 150 

TATGACTTCG ATGGAGATAA 2 00 

TGCTCTTGAG GACAAGAACC 2 50 

TCTTGGACGC TGTTGATGAA 3 00 

AAGCCTTTCA TGATGCCTAT 3 50 

TACTGTTGCC ACTGGCCGTG 4 00 

AAGTCGAGAT CGTTGGTTTC 4 50 

GGTATTGAAA TGTTCAAGAA 500 

TGCTGGTATC TTGCTTCGTG 550 

TGGTTTTGGC CAAGCCTGGA 600 

TCCATCTACG TTCTCTCCAA 650 

TATGAACTAT CGTCCCCAGA 70 0 

CTCTTACTTT CCCTGAGGGT 750 

GAGAACACCG AGATGGTTGG 800 

TGGTCAACGC TTCAAC 84 6 



WO 01/23604 
2) INFORMATION FOR SEQ ID NO: 502 



PCT/CA00/01150 



10 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 84 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium oxysporum 

(B) STRAIN: WSA-212 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502 

GCTATCATCG TCGTTGCTGC CTCCGATGGA CAGATGCCCC AGACCCGTGA 50 

ACACTTGCTC CTCGCTCGTC AGGTCGGTGT CCAGCGAATT GTCGTCTTCG 100 

TCAACAAGGT CGATGCCATT GATGACCCCG AGATGCTTGA GCTCGTCGAG 150 

2 0 ATGGAGATGC GCGAGCTTCT TAACACCTAC GGCTTCGAAG GCGACGACAC 200 

TCCCGTCATC ATGGGCTCTG CTCTTATGTC TCTCCAGAAC CAGCGCCCCG 250 

AGATTGGCAC CGAGAAGATC GATGAGCTCC TTGCTGCCGT CGACGAGTGG 300 

ATCCCAACCC CCGAGCGTGA CCTTGACAAG CCCTTCCTTA TGTCCGTCGA 350 

GGATGTCTTC TCCATTGCCG GCCGTGGTAC CGTCGTGTCT GGCCGTGTGG 400 

2 5 AGCGTGGTAT TCTGAAGCGT GATCAGGAGA TCGAGCTTGT CGGAAAGGGG 4 50 

CAGGAGGTTA TCAAGACCAA GGTTACCGAC ATTGAGACCT TCAAGAAGTC 50 0 

TTGTGAGCAG TCCCAGGCTG GTGACAACTC TGGTCTCCTC ATCCGAGGTG 550 

TTCGCCGTGA GGATGTC CGC CGTGGTATGG TTGTCTGCGC TCCTGGCACC 600 

GTCAAGTCTC ATACCCAGTT TCTCGCTTCC CTCTACGTCC TCACCAAGGA 650 

30 GGAGGGTGGC CGACACACCG GTTTCCAGGA GCACTACCGA CCCCAGCTCT 700 

ATCTCCGAAC TGCAGATGAG TCCATTGACC TGACTTTCCC CGAGGGTACT 750 

GAGGATGCCT CCAGTAAGAT GGTCATGCCT GGTGACAACA CCGAGATGGT 800 

TGTCACCATG GGTCACCCCA ATGCCATCGA GGTTGGTCAG CGATTCAAC 84 9 



35 



50 



2) INFORMATION FOR SEQ ID NO: 503 



( i ) SEQUENCE CHARACTERISTICS : 
4 0 (A) LENGTH: 1064 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 5 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulation 

(B) STRAIN: G186A5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503 



TGGTGCTATT GTCGTTGTTG CTGCTGCTGA CGGCCAAATG TAAGACGCCG 50 

CGAGGGACTG CTGAGGGTTT TATGCTTTTT AGGCCCCCTT GTTTCTGAGA 100 

55 GCATGATGAT ACTAATATTC GGAAACGTAT CTATTAGGCC TCAAACACGT 150 

GAGCATTTGC TCCTTGCCCG ACAGGTCGGT GTCCAAAAGA TCGTCGTTTT 200 

CGTGAACAAA GTCGACGCCC TTGAGGACAA GGAGATGTTG GAGCTTGTCG 250 

AGTTAGAAAT GAGAGAGCTC TTAAACACCT ACGGCTTCGA GGGTGAAGAG 300 

ACACCCATCA TCTTTGGTTC TGCCCTTTGC GCCATGGAAG GCCGTGAGCC 350 

60 TGAGTTGGGA GAAAAGAAAA TTGATGAATT GCTGGAGGCT GTTGATACTT 4 00 

309 
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GGATCCCCAC ACCACAACGT GATACCGAAA AACCTTTCTT GATGTCCGTT1 4 f 5© 
GAGGAAGTAT TCTCTATCTC CGGCCGTGGA ACCGTTGCCT CCGGTCGTGT 500 
TGAGCGCGGT GTCCTCAAGA AGGATTCAGA AGTCGAGCTA ATTGGGGGCG 550 
GCTCCACCCC CATCAGGACG AAGGTAACTG ATATCGAAAC TTTCAAGAAA 600 
5 TCCTGTGACG AGTCTAGAGC TGGGGACAAC TCCGGTCTTT TATTGCGTGG 650 
TATCAAGCGT GAAGATATCC GCCGTGGTAT GGTAGTTGCC GTTCCTGGCA 70 0 

GCGTCAAGGC CCACGACAAG TTCTTGGTGT CGATGTATGT CCTGACCGAA 750 
GCTGAGGGTG GTCGCCGAAC CGGATTCGGC CAGAACTATC GTCCTCAAAT 8 00 

GTTCATCCGC ACAGCTGGTA TGTCAAAATG GGACCCCTTT TCATAATCCT 850 
10 TTCTTTTTTT CCTTTTCCTC TCTATCTCTC TTTCTGTTTC CTTTCAACTC 900 
GCCTGATTCA CGAAATTAAC TAACCCGTTT GATTATAGAC GAAGCCGCCC 950 

ATCTCAGCTT CCCTAGTGGA GCAGATGAAA GCAAACTCGT TATGCCTGGT 1000 

GACAACGTCG AGATGATCCT CCAGACACAC CGCCCCGTGG CTGCTGAGGC 1050 

CGGCCAGCGA TTCA 10 64 

15 



2) INFORMATION FOR SEQ ID NO: 504 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 982 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 
30 (B) STRAIN: ATCC 32071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504 

TGGTGCTATC GTCGTTGTTG CTGCCTCTGA CGGCCAAATG TAGGGATTTT 50 

3 5 GCAAGACTGG TGAAAAAATC TAAAGAAAAT AGAAAAGATT GTGCTGATGT 100 

TTGGTATCAG GCCCCAAACA CGCGAGCATC TGCTTCTTGC CAGACAAGTC 150 

GGTGTTCAGA AAATCGTTGT TTTCGTCAAC AAGGTCGATG CTGTAGAGGA 2 00 

TAAGGAGATG TTGGAGCTTG TCGAATTGGA GATGAGAGAG CTCTTGACCA 2 50 

CCTATGGGTT CGAGGGTGAG AAGACACCTA TCATCTTTGG TTCTGCGCTC 3 00 

4 0 TGTGCTATGG AGGGCCGTCA GCCCGAGTTG GGAGAGCAGA AAATTGATGA 3 50 

ATTACTCGAG GCTGTGGATA CTTGGATCCC TACGCCACAG CGTGATACTG 4 00 

ACAAGCCCTT CCTGATGTCC ATTGAGGAGG TGTTCTCTAT CTCTGGACGA 4 50 

GGAACCGTTG CCTCCGGCCG CGTTGAGCGT GGTATCCTCA AGAAGGACTC 500 

CGAAGTTGAA ATTATTGGCG GCGGTGTTCC CACAATCCTG ACCAAGGTGA 550 

4 5 CTGATATCGA AACCTTCAAG AAGTCTTGCG ACGAGTCCAG AGCCGGGGAC 6 00 

AACTCCGGCC TCTTGTTGCG CGGTGTCAAG CGTGAGGATA TCCGCCGTGG 650 

TATGGTCGTT GCAGTTCCCG GAAGCGTCAA AGCACATGAC AGATTCTTGG 700 

TGTCGATGTA CGTTCTGACC GAGGCTGAGG GTGGTCGCCG CACTGGCTTC 750 

GGTCAGAACT ATCGTCCTCA AATGTTCATC CGCACAGCTG GTACGTTCAT 800 

50 TCTTTCACTA TATTCCTATA TGCATAGCCC GATCCTCCCA TTAACTAATT 8 50 

GACACAGACG AGGCTGCTGA ACTCAGCTGG CCTGATGGAG ACGACGAAGC 900 

CAAAATGGTC ATGCCCGGTG ACAATGTTGA AATGGTCCTG AAGTCACACC 950 

GCCCGGGGCC GCTGAGGCTG GACAGCGATT CA 982 



2 ) INFORMATION FOR SEQ ID NO: 505 



(i) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 931 bases 

310 
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10 



15 



20 



25 



30 



WO 01/23604 

(B) 
(C) 
(D) 



PCT/CA00/01150 



TYPE: Nucleic acid 
STRANDEDNESS : Double 
TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: ATCC 58950 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 505 



CGCTGTTGTT 
AGCTGCCAAT 
CACTTGCTCC 
CAACAAGGTT 
TGGAAATGCG 
CCTATCATTT 
GATTGGCGAA 
TCCCCACCCC 
GAAGTTTTCT 
GCGTGGTATT 
AGAACCCTAT 
TGCGATGAAT 
CAAGCGTGAG 
CCAAGGCTCA 
GAAGGTGGTC 
CATCCGTACT 
GCTAACGAGT 
CAGTCCAAGC 
ACACCGTCCC 



GTCGTCGCTG 
TATGGACACT 
TCGCCCGTCA 
GATGCCGTCG 
TGAACTCTTG 
TCGGATCCGC 
CAGAAGATTG 
CCAGCGTGAC 
CCATCTCTGG 
TTGCGCAAGG 
CAAGACCAAG 
CTCGTGCTGG 
GACATTCGTC 
TGACAACTTC 
GTCGTACTGG 
GCCGGTATGT 
TATAGATGAG 
AGGTCATGCC 
GTTGCCGCCG 



CTTCTGATGG 
GCTGATAAGA 
GGTCGGTGTT 
AGGACCCCGA 
ACCACCTACG 
TCTTTGCGCC 
ACGAGCTCAT 
CTTGACAAGC 
TCGTGGTACC 
ATTCTGAGGT 
GTTACCGACA 
TGACAACTCT 
GTGGTATGGT 
TTGGTCTCCA 
ATTCGGCGCC 
TCCCTTTCAA 
GCTGCTACTC 
CGGTGACAAC 
AAGCTGGTCA 



TCAAATGTAA 
ATAGGCCCCA 
CAAAAGATCG 
GATGTTGGAA 
GTTTCGAGGG 
TTGGAAGGCC 
GAACGCCGTT 
CCTTCTTGAT 
GTTGCATCTG 
TGAGATTATC 
TTGAGACCTT 
GGCTTGCTTC 
TATCGCTGCT 
TGTATGTCTT 
AACTACCGTC 
AGTCAATTAA 
TCAGCTTCCC 
GTTGAGATGA 
G 



CATATCCACG 50 

AACCCGTGAG 100 

TCGTCTTCGT 150 

CTTGTCGAAT 2 00 

TGAAGAGACC 250 

GCAAGCCCGA 3 00 

GATACCTGGA 350 

GTCCGTTGAG 4 00 

GTCGTGTTGA 450 

GGATACCAGA 500 

CAAGAAGTCT 550 

TCCGTGGTAT 600 

CCTGGAACCA 650 

GACTGAGGCT 700 

CTCAAGCTTT 750 

TGAGCGATTT 800 

CGGTGACGAT 850 

TCTTGAAGAC 900 

931 



35 2) INFORMATION FOR SEQ ID NO: 506 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pichia anomala 

(B) STRAIN: ATCC 182 05 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506 



TGGTGCTATT ATTGTTGTTG CTGCTTCTGA TGGTCAAATG CCTCAAACCA 50 

GAGAACATTT RTTATTGGCT AGACAAGTTG GTGTTCAACA CATTGTTGTC 100 

TTTGTTAACA AAGTTGATAC TATTGATGAC CCAGAAATGT TGGAATTAGT 150 

TGAAATGGAA ATGAGAGAAT TGTTAAGTAC TTATGGTTTT GATGGTGATA 2 00 

55 ACGTCCCAGT TGTTATGGGT TCTGCTTTAT GTGCCTTGGA AGGTCGTGAA 2 50 

GAAGAAATTG GTGTCAAAGC TATTGATAAA TTATTAGCTG CTGTTGATGA 3 00 

ATATATCCCA ACCCCACAAA GAGATTTAGA AAAACCATTC TTGATGGGTG 3 50 

TTGAAGATGT CTTYTCAATC TCAGGTAGAG GTAC CGTTGT TACTGGTCGT 4 00 

GTTGAACGTG GTAACTTGAA GAAAGGTGAT GAAGTTGAAA TTGTTGGTTT 450 

60 AAACAAAACT CCATTGAAAA CTACTGTYAC NGGTATTGAA ATGTTCAAAA 500 
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AAGAATTGGA CGAAGCTATG GCTGGTGATA ACTGTGGTAT CTTATTACGT 550 

GGTATCAAAA GAGATGACAT YAAAAGAGGT ATGGTTATTG CTAAAACCGG 600 

TACCATCTCW GCTCACACTA AATTCTTAGC CTCAATGTAT ATTTTGACTA 650 

AAGAAGAAGG TGGTCGTCAC TCAGGTTTTG GTGAACATTA CAGACCTCAA 700 

TTATTCATCA GAACTGGTGA TGTTACCGTT GTTTTAACCT TYCCAGAAGG 75 0 

TGGTGATTCA TCTCAACAAA TCTTACCAGG TGACAATGTC GAAATGGTTT 8 00 

GTGAATTGGT TCACCCAACT GCTTTAGAAG CTGGTCAAAG ATTCAA 84 6 



2) INFORMATION FOR SEQ ID NO: 507 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 964 bases 
15 <B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: WSA-225 

2 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 507 

GGAGCTGTTG TCGTTGTCGC AGCTTCTGAC GGTCAAATGT AATTGAATGC 50 

CCGCCCAGAC GGATGAAAGG ATTTGACGTT TCTAACATCA GTCTAGGCCT 100 

CAGACCAGAG AACATTTGCT CCTTGCCCGC CAGGTCGGTG TCCAGAAGCT 150 

3 0 GGTCGTTTTC GTTAACAAGG TCGATGCCGT TGAGGACCCA GAGATGTTGG 2 00 

AGCTTGTCGA ACTTGAAATG CGTGAACTCC TCAGCCACTA CAGTTTTGAG 250 

GGTGAGGAGA CCCCCATCAT TTTTGGCTCT GCTCTCTGTG CCCTCGAGTC 3 00 

CCGTCGACCT GAGCTTGGTG TCGAGAAGAT TGACGAGCTA TTGAACGCCG 350 

TCGACACCTG GATCCCCACC CCCGAGCGCG CCACTGATAA GCCTTTCCTC 4 00 

35 ATGTCCATTG AGGAAGTGTT CTCTATCTCT GGTCGTGGTA CCGTCGTCTC 450 

CGGTCGTGTT GAGCGTGGTA TCCTCAAGAA GGATTCCGAC GTCGAAATTG 50 0 

TTGGTGGCTC TACCACCCCT ATCAAGACCA AGGTCACAGA TATCGAAACC 550 

TTCAAGAAGT CCTGCGATGA ATCTCGAGCT GGTGACAACT CTGGTCTCCT 60 0 

TCTCCGAGGT ATCAAGCGTG AGGACTTGAA GCGTGGAATG GTTGTTGCTG 6 50 

4 0 CCCCCGGATC CACCAAGGCT CACACCGACT TCATGGTCTC CCTCTACGTC 7 00 

CTGACTGAGG CTGAGGGTGG TCGTTCCAAC GGCTTCACCC ACAAGTACCG 750 

CCCTCAAATG TTCATCCGTA CTGCTGGTAT GTAACCAAAG TTTCCGCTAT 800 

TTACTAAGTA GATCATTGCT AACTTGTATT CCCTTCCGTA GACGAAGCCG 850 

CATCTTTCAG CTGGCCTGGA GAAGACCAAG ACAAGAAGGC TATGCCTGGT 900 

4 5 GACAACGTCG AGATGATTTG CAAAACCCTC CACCCCATTG CTGCCGAGGC 950 

TGGCCAACGA TTCA 964 



50 2) INFORMATION FOR SEQ ID NO: 508 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

312 
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PCT/CA00/01150 



(A) ORGANISM: Yarrowia lipolytica 

(B) STRAIN: ATCC 38295 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508 

GGTGCTATCA TTGTCGTTGC TGCTGGAGAC GGTTCCAAGC CCCAGACCCG 50 

AGAGCATCTG CTGCTTGCTC GACAGGTCGG TGTCCAGAAC CTGGTTGTGT 100 

TTGTTAACAA GGTTGATCAG ATTGATGATA AGGAGATTCT TGAGCTCGTT 150 

GACATGGAGA TGCGAGATCT GCTGACCCAG TACGGTTTTG ATGGTGACAA 2 00 

CACCCCCGTT GTCATGGGCT CTGCTCTGTG CGCTCTTGAG GGCAAGCAGA 250 

AGGATATTGG AGAGGACGCC ATCATGGCCC TTATGGATGC CGTTGATGAG 3 00 

CACATCCCTA CCCCTAACCG TGACCTTGAG AAGCCCTTCC TGATGCCCGT 3 50 

TGAGGACGTT TTCTCCATCT CTGGCCGAGG AACTGTTGTT ACTGGCCGAG 4 00 

TCGAGCGAGG AAACCTGAAG AAGGGTGAGG AAATCGAGAT TGTTGGCTAC 4 50 

AACAACAAGC CCATCAAGGC TGTTGTTACC GGTATTGAGA TGTTCAAGAA 50 0 

GGAGCTCGAG TCCGCCATGG CCGGTGACAA CGCCGGTATC CTGCTCCGAG 550 

GTATCAAGCG AGACGAGATC AAGCGAGGTA TGGTCATGTG CAAGCCTGGC 600 

ACCGTCAACG CCCACACCAA GTTCCTTGCT TCTCTTTACA TCATCCCCAC 650 

CGAGGAGGGT GGTCGAACCA GCTCTTTCGG CGCCAACTAC CGACCCCAGA 700 

TGTTCATCCG AACTTCTTCC GTCACCGCCA CTCTCACCTT CCCCGAGGGT 750 

ACCGACGAGT CCCAGACCGT CAACCCCGGT GACAACACTG AGATGGTTCT 8 00 

CGAGCTTGTT CACCCTACCG CCATTGAGGT CAACCAGCGA TTCA 844 



2) INFORMATION FOR SEQ ID NO: 509 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1067 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bigemina 

(B) STRAIN: Suarez-2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509 

CTTGGACAAG CTGAAGAGCG AGCGTGAGCG TGGTATCACC ATTGACATTA 50 

CCCTGTGGAA GTTCGAAACT GGCAAGTACT ACTACACCGT CATTGACGCC 100 

CCCGGTCACC GTGACTTCAT TAAGAACATG ATTACGGGTA CCTCCCAGGC 150 

CGATGTTGCT ATGCTTGTCG TGCCCGCCGA GGCTGGTGGT TTCGAAGCTG 200 

CCTTCTCTAA GGAAGGTCAG ACCCGTGAGC ACGCTCTTTT GGCCTTCACC 250 

CTTGGTGTCA AGCAGATCAT TTGCGCCATC AACAAGATGG ACAAGTGCGA 3 00 

CTACAAGGAG GACCGTTACA GCGAAATCCA GAAGGAAGTT CAGGGTTACC 3 50 

TGAAGAAGGT CGGTTACAAC ATCGAGAAGG TGCCTTTCGT CGCCATCTCC 400 

GGTTTCATGG GTGACAACAT GGTTGAGCGC TCCACCAACA TGCCGTGGTA 450 

CAAGGGCAAG ACCTTGGTCG AGGCCCTCGA CATGATGGAG CCCCCGAAGA 500 

GGCCCGTCGA CAAGCCCCTG CGTCTTCCCC TCCAGGGTGT GTACAAGATC 550 

GGTGGTATCG GTACCGTCCC TGTCGGTCGT GTGGAGACTG GTCAGCTCAA 600 

GGCCGGTATG GTCCTCACCT TCGCCCCCAA CCCGATCACT ACTGAGTGCA 65 0 

AATCCGTCGA AATGCACCAC GAAGTTATCG ATGTTGCCAG CCCTGGTGAC 700 

AACGTTGGTT TCAACGTGAA GAACGTGTCC ACCTCTGACA TCCGCACTGG 750 

TCACGTCGCT TCTGACTCCA AGAACGACCC CGCCAAGGCC GCCGTGTCCT 8 00 

TCACCGCCCA GGTCATCATC TTGAACCACC CTGGTACCAT CAAGGCCGGT 850 

TACTCCCCTG TGGTTGACTG CCACACTGCC CACATCTCGT GCAAATTCGA 900 

CGAGATCACC AGCCGTATGG ACAAGCGTAC CGGTAAGGCC CTTGAGGAGA 950 
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ACCCCAAGAC CATCAAGAAC GGCGACGCCG 
TGCAAGCCCA TGGTCGTCGA GGCCTTCACT 
TTTCGCCGTG CGTGACG 



PCT/CA00/01150 

CTATGGTCGT CCTGAAGCCG 1000 
GAATACGCTC CCCTTGGTCG 1050 

1067 



2 ) INFORMATION FOR SEQ ID NO: 510 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1049 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 

(B) STRAIN: Suarez-3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510 

GTGAACGTGA ACGTGGTATT ACTATTGATA TTACCTTATG GAAGTTCGAG 50 

ACCACCAAGT ACTACTACAC CGTCATTGAT GCCCCTGGTC ACCGTGACTT 100 

CATCAAGAAC ATGATTACGG GTACTTCTCA AGCCGATGTT GCTATGCTTG 150 

TTGTACCAGC TGAGGCTGGT GGTTTCGAGG CCGCTTTCTC CAAGGAAGGA 2 00 

CAGACCCGTG AGCACGCTCT TTTGGCTTTC ACCCTTGGTG TCAAACAGAT 2 50 

CATCTGTGCC ATTAACAAGA TGGACAAGTG CGACTACAAG GAGGACCGTT 3 00 

ACAGTGAAAT CCAGAAGGAA GTCCAGGGTT ACCTCAAGAA GGTCGGTTAC 3 50 

AATATTGAGA AGGTGCCCTT CGTTGCCATC TCCGGTTTCA TGGGAGACAA 4 00 

CATGGTTGAG CGTTCCACCA ACATGCCCTG GTATAAGGGA AAGACATTGG 450 

TCGAGGCCCT TGATCAGATG GAACCCCCAA AGAGGCCCGT TGACAAGCGA 500 

CTTCGTCTTC CCCTCCAGGG TGTCTACAAG ATCGGTGGTA TCGGTACCGT 550 

CCCCGTCGGT CGTGTTGAAA CTGGTATGTT GAAGGCTGGT ATGATTCTAA 600 

CCTTTGCTCC TAACCCAATC ACCACTGAAT GCAAATCCGT TGAAATGCAC 650 

CACGAAACCG TTGAGGTTGC TTACCCCGGT GACAACGTCG GTTTCAACGT 700 
AAAGAACGTT TCTACTTCTG ACATTCGCAG TGGTCACGTT GCCTCTGATT v 750 

CTAAGAACGA CCCTGCCAAG GCTGCTGTTT CCTTCACTGC CCAGGTCATT 8 00 

GTGCTCAACC ACCCTGGTAC CATTAAGGCC GGTTACTGCC CCGTCGTCGA 850 

TTGCCACACC GCTCACATTT CATGTAAATT CGAAGAGATC ACCAGCCGTA 900 

TGGACAAGCG TACCGGTAAA TCTCTTGAGG AAAACCCCAA GACCATCAAG 950 

AACGGTGACG CTGCCATGGT TGTGCTCAAG CCAATGAAGC CCATGGTTGT 10 00 

CGAATCCTTC ACTGAGTATG CTCCTCTTGG TCGTTTCGCT GTTCGTGAC 104 9 



2) INFORMATION FOR SEQ ID NO: 511 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1070 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Crithidia fasciculata 

(B) STRAIN: ATCC 11745 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 511 

TGGACAAGCT GAAGGCGGAG CGCGAGCGCG GTATCACGAT CGATATTGCC 50~ 

CTGTGGAAGT TCGAGTCGCC CAAGTCCGTG TTCACGATCA TCGATGCCCC 100 

5 CGGCCACCGC GACTTCATCA AGAACATGAT CACCGGCACC TCCCAGGCCG 150 

ATGCCGCCAT TCTGATGATT GACTCGACCC AGGGTGGCTT CGAGGCTGGC 2 00 

ATCTCCAAGG ACGGCCAGAC CCGCGAGCAC GCCCTGCTTG CCTTCACGCT 2 50 

GGGCGTGAAG CAGATGGTTG TGTGCTGCAA CAAGATGGAC GACAAGACGG 3 00 

TGCAGTACGC CCAGGCCCGC TACGAGGAGA TCAGCAAGGA GGTCGGCGCG 3 50 

10 TACCTGAAGC GCGTGGGCTA CAACCCGGAG AAGGTGCGCT TCATCCCGAT 4 00 

CTCGGGCTGG CAGGGCGACA ACATGATCGA GAAGTCCGAC AACATGTCGT 450 

GGTACAAGGG TCCCACGCTG CTGGAGGCGC TCGACCTGCT GGAGGCCCCC 500 

GTGCGTCCGG TGGACAAGCC GCTGCGCCTG CCCCTGCAGG ACGTGTACAA 550 

GATCGGCGGT ATCGGCACTG TGCCCGTGGG CCGTGTGGAG ACCGGCGTGA 600 

15 TGAAGCCGGG CGACGTTGTG GTGTTCGCGC CTGCCAACGT GACGACCGAG 650 

GTGAAGTCGA TCGAGATGCA CCACGAGCAG CTGGCTGAGG CCGTGCCCGG 700 

CGACAACGTG GGCTTCAACG TGAAGAACGT GTCCGTGAAG GATATCCGCC 750 

GTGGTAACGT GTGCGGCAAC ACGAAGAGCG ACCCCCCGAA GGAGGCGGCC 800 

GACTTCACTG CCCAGGTGAT CGTGCTGAAC CACCCCGGCC AGATCAGCAA 850 

20 CGGCTACGCG CCGGTGCTGG ACTGCCACAC GAGC CACATC GCGTGCAAGT 900 

TCGCGGACAT CGAGTCCAAG ATCGACCGCC GCT CTGGCAA GGAGCTGGAG 950 

AAGAGCCCGA AGGCCATCAA GTCCGGCGAT GCGGCCATCG TGAAGATGAT 1000 

CCCGCAGAAG CCGATGTGCG TGGAGGTGTT CAACGACTAC CCGCCGCTGG 1050 

GCCGCTTCGC TGTCCGCGAT 1070 

25 

2) INFORMATION FOR SEQ ID NO: 512 

30 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1052 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Entamoeba histolytica 
40 (B) STRAIN: HM1-IMMS 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512 

GCTGAAAGAG AAAGAGGAAT TACTATTGAT ATTTCATTAT GGAAATTCGA 50 

45 AACATCTAAA TACTACTTCA CTATTATTGA TGCCCCAGGT CAYAGAGATT 100 

TCATTAAGAA CATGATTACT GGAACTTCAC AAGCTGATGT TGCCATCCTT 150 

ATTGTTGCTG CTGGTACTGG WGAATTTGAA GCTGGTATTT CAAAGAATGG 2 00 

ACAAACCAGA GAACAYATTC TTCTTTCATA CACTCTTGGA GTTAAACAAA 2 50 

TGATTGTTGG WGTTAACAAG ATGGATGCTA TTCAATATAA ACAAGAAAGA 3 00 

50 TATGAAGAAA TTAAGAAAGA AATTAGTGCA TTCCTTAAGA AGACWGGATA 350 

TAATCCAGAC AAGATTCCAT TTGTCCCAAT TTCAGGATTC CAAGGAGATA 400 

ATATGATTGA ACCATCAACC AACATGCCAT GGTACAAAGG ACCAACATTA 4 50 

ATTGGAGCAC TTGATTCAGT CACACCACCA GAAAGACCAG TTGATAAACC 500 

ACTTAGACTT CCACTTCAAG ATGTTTAYAA GATTTCAGGT ATTGGAACTG 550 

55 TACCATGTGG AAGAGTTGAA ACTGGAGTTC TTAGACCAGG AACTATTGTT 600 

CAATTTGCAC CATCAGGAGT TTCATCTGAA TGTAAATCAG TTGAAATGCA 650 

TCACACAGCA CTTGCTCAAG CTATTCCAGG TGATAATGTT GGATTCAATG 700 

TTAGAAAYTT AACAGTTAAA GATATTAAGA GAGGAAATGT AGCATCAGAT 750 

GCTAAGAATC AACCAGCTGT TGGATGTGAA GATTTCACTG CTCAAGTCAT 8 00 

60 TGTCATGAAC CATCCAGGAC AAATTAGAAA GGGATATACA CCAGTTCTTG 850 
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ATTGCCATAC ATCACACATT 
ATTGATAGAA GAACAGGTAA 
TAAGAATGGA GATTCAGCAC 
GTGTTGAAGA ATTTGCTAAA 
5 GA 



GCATGTAAAT TCGAAGAATT 
ATCCATGGAA GGAGGAGAAC 
TTGTTAAGAT TGTTCCAACT 
TTCCCACCAT TGGGAAGATT 



PCT/CA00/01150 

ATTAJ^GCAAG 90jy* 
CAGA^PATAT 930 
AAACCACTTT 1000 
TGCTGTTAGA 1050 
1052 



2) INFORMATION FOR SEQ ID NO: 513 



10 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1082 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Giardia lamblia 

(B) STRAIN: Faubert-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513 

25 GACGAGCGCG AGCGCGGGAT CACGATCAAC ATCGCGCTCT GGAAGTTCGA 50 

GACGAAGAAG TACATCGTCA CGATCATCGA CGCCCCGGGC CACCGCGACT 100 

TCATCAAGAA CATGATCACG GGGACGTCCC AGGCCGACGT CGCGATCCTC 150 

GTCGTCGCGG CGGGCCAGGG CGAGTTCGAG GCCGGGATCT CGAAGGACGG 2 00 

CCAGACGCGC GAGCACGCGA CCCTTGCGAA CACGCTCGGG ATCAAGACGA 2 50 

3 0 TGATCATCTG CGTCAACAAG ATGGACGACG GCCAGGTCAA GTACTCGAAG 3 00 

GAGCGCTACG ACGAGATCAA GGGCGAGATG ATGAAGCAGC TCAAGAACAT 3 50 

CGGCTGGAAG AAGGCCGAGG AGTTCGACTA CATCCCGACG TCCGGCTGGA 400 

C CGGGGACAA CATCATGGAG AAGTCCGACA AGATGCCCTG GTACGAGGGC 450 

CCGTGCCTGA TCGACGCGAT CGACGGGCTC AAGGCCCCGA AGCGCCCGAC 500 

3 5 CGACAAGCCC CTCCGCCTCC CGATCCAGGA CGTCTACAAG ATCTCGGGCG 550 

TCGGGACCGT CCCCGCGGGC CGCGTCGAGA CGGGCGAGCT CGCGCCCGGG 60 0 

ATGAAGGTCG TCTTCGCCCC GACGTCCCAG GTCTCGGAGG TCAAGTCCGT 650 

CGAGATGCAC CACGAGGAGC TCAAGAAGGC CGGGCCCGGG GACAACGTCG 70 0 

GCTTCAACGT CCGCGGGCTC GCCGTCAAGG ACCTCAAGAA GGGCTACGTC 750 

4 0 GTCGGGGACG TGACGAACGA CCCGCCCGTC GGCTGCAAGA GCTTCACCGC 800 

CCAGGTCATC GTCATGAACC ACCCGAAGAA GATCCAGCCC GGCTACACGC 850 

CCGTCATCGA CTGCCACACC GCGCACATCG CGTGCCAGTT CCAGCTCTTC 90 0 

CTCCAGAAGC TCGACAAGCG CACGCTCAAG CCCGAGATGG AGAACCCGCC 950 

CGACGCAGGC CGCGGCGATT GCATCATCGT CAAGATGGTC CCCCAGAAGC 1000 

4 5 CCCTGTGCTG CGAGACGTTC AACGACTACG CGCCCCTCGG CCGCTTCGCC 1050 

GTCCGCGACA TGCGCCAAAC CGTTGCCGTC GG 1082 



50 2) INFORMATION FOR SEQ ID NO: 514 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(B) 



PCT/CA00/01150 



ORGANISM: Leishmania tropica 
STRAIN: ATCC 3 0816 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514 



ACGCGTGGGT 
ATCGACATTG 
CATCGATGCG 
CGTCGCAGGC 

10 TTCGAGGCTG 
TGCCTTCACT 
ACGACAAGAC 
GAGGTGGGCG 
CTTCATCCCG 

15 ACAACATGCC 
CTGGAGCCGC 
GGACGTGTAC 
AGACCGGCAT 
GTGACGACTG 

20 GGCGCAGCCC 
AGGACATCCG 
AAGGAGGCGG 
CCAGATCAGC 
TTGCGTGCCG 

2 5 AAGGAGCTGG 
CGTGAAGATG 
ACGCGCCGCT 



GCTCGACAAG 
CGCTGTGGAA 
CCCGGCCACC 
GGACGCCGCC 
GCATCTCGAA 
CTTGGCGTGA 
GGTGACGTAC 
CGTACCTGAA 
ATCTCGGGCT 
GTGGTACAAG 
CGGTGCGCCC 
AAGATCGGCG 
CATGAAGCCG 
AGGTGAAGTC 
GGCGACAACG 
CCGTGGTAAC 
CCGACTTCAC 
AACGGCTACG 
CTTCGCGGAA 
AGAAGAACCC 
GTGCCGCAGA 
GGGCCGCTTT 



CTGAAGGCGG 
GTTCGAGTCG 
GCGACTTCAT 
ATCCTGATGA 
GGACGGCCAG 
AGCAGATGGT 
GCGCAGTCGC 
GCGCGTGGGC 
GGCAGGGCGA 
GGTCCCACGC 
GGTGGACAAG 
GTATCGGGAC 
GGCGACGTGG 
GATCGAGATG 
TCGGCTTCAA 
GTGTGCGGCA 
GGCGCAGGTG 
CGCCGGTGCT 
ATCGAGTCCA 
CAAGGCGATC 
AGCCGATGTG 
GCCGTGCGCG 



AGCGCGAGCG 
CCCAAGTCCG 
CAAGAACATG 
TCGACTCGAC 
ACCCGCGAGC 
GGTGTGCTGC 
GCTACGATGA 
TACAACCCGG 
CAACATGATC 
TGCTGGACGC 
CCGCTGCGCC 
GGTGCCCGTG 
TGACGTTCGC 
CACCACGAGC 
CGTGAAGAAC 
ACTCGAAGAA 
ATCGTGCTGA 
GGACTGCCAC 
AGATCGACCG 
AAGTCTGGCG 
CGTGGAGGTG 
ACATGCGCCA 



CGGCATCACG 50 

TGTTCACGAT 100 

ATCACGGGCA 150 

GCATGGTGGC 2 00 

ACGCGCTGCT 250 

AACAAGATGG 3 00 

GATCAGCAAG 350 

AGAAGGTGCG 4 00 

GAGAAGTCGG 4 50 

GCTCGACATG 500 

TGCCCCTGCA 550 

GGGCGCGTGG 600 

GCCCGCCAAC 650 

AGCTGGCGGA 700 

GTGTCGGTGA 750 

CGACCCGCCG 800 

ACCACCCCGG 850 

ACGAGCCACA 900 

CCGCTCCGGC 950 

ATGCCGCGAT 10 00 

TTCAACGACT 1050 

AACCGTTG 1098 



30 
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40 
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50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 515 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1104 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 

(B) STRAIN: ATCC 50119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515 



TACGCGTGGG 
GATCGACATT 
TCATCGATGC 
ACGTCGCAGG 
CTTCGAGGCT 
TTGCCTTCAC 
GACGACAAGA 
GGAGGTGGGC 
GCTTCATCCC 
GACAACATGC 
GCTGGAGCCG 
AGGACGTGTA 
GAGACCGGCA 
CGTGACGACT 



TGCTCGACAA 
GCGCTGTGGA 
GCCCGGCCAC 
CGGACGCCGC 
GGCATCTCGA 
TCTTGGCGTG 
CGGTGACGTA 
GCGTACCTGA 
GATCTCGGGC 
CGTGGTACAA 
CCGGTGCGCC 
CAAGATCGGC 
TCATGAAGCC 
GAGGTGAAGT 



GCTGAAGGCG 
AGTTCGAGTC 
CGCGACTTCA 
CATCCTGATG 
AGGACGGCCA 
AAGCAGATGG 
CGCGCAGTCG 
AGCGCGTGGG 
TGGCAGGGCG 
GGGTCCCACG 
CGGTGGACAA 
GGTATCGGGA 
GGGCGACGTG 
CGATCGAGAT 
31 



GAGCGCGAGC 
GCCCAAGTCC 
TCAAGAACAT 
ATCGACTCGA 
GACCCGCGAG 
TGGTGTGCTG 
CGCTACGAGG 
CTACAACCCG 
ACAACATGAT 
CTGCTGGACG 
GCCGCTGCGC 
CGGTGCCCGT 
GTGACGTTCG 
GCAC CACGAG 



GCGGCATCAC 50 

GTGTTCACGA 100 

GATCACGGGC 150 

CGCATGGTGG 200 

CACGCGCTGC 250 

CAACAAGATG 3 00 

AGATCAGCAA 3 50 

GAGAAGGTGC 4 00 

CGAGAAGTCG 4 50 

CGCTCGACAT 500 

CTGCCCCTGC 550 

GGGCCGCGTG 600 

CGCCCGCCAA 650 

CAGCTGGCGG 700 
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AGGCGCAGCC CGGCGACAAC GTCGGCTTCA 

AAGGACATCC GCCGTGGCAA CGTGTGCGGC 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC 

5 ATTGCGTGCC GCTTCGCGGA AATCGAGTCC 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC 
CGTC 



PCT/CA00/01150 

fccBttSiKiSK' COTGl^GSTG -P3d 

AACTCGAAGA ACGACCCGCC 8</0 

GATCGTGCTG AACCACCCCG 85 0 

TGGACTGCCA CACGAGCCAC 900 

AAGATCGACC GCCGCTCCGG 950 

CAAGTCTGGC GATGCCGCGA 100 0 

GCGTGGAGGT GTTCAACGAC 1050 

GACATGCGCC AAACCGTTGC 1100 

1104 



2) INFORMATION FOR SEQ ID NO: 516 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1106 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 
2 5 (B) STRAIN: ATCC 3 0815 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516 



TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

3 0 GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 10 0 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 
ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 2 00 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 2 50 

TTGCCTTCAC KCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

35 GACGACAAGA CGGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 3 50 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 400 
GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAAGTCG 4 50 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 
GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

4 0 AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGGCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 
CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 
AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 
AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

45 GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 
GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 
ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 10 00 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

50 TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTGC 1100 

CGTCGG H06 



55 2) INFORMATION FOR SEQ ID NO: 517 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10 99 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 
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15 



20 
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WO 01/23604 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 

(B) STRAIN: ATCC 50212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517 



PCT/CA00/01150 



TACGCGTGGG 
GATCGACATT 
TCATCGATGC 
ACGTCGCAGG 
CTTCGAGGCT 
TTGCCTTCAC 
GACGACAAGA 
GGAGGTGGGC 
GCTTCATCCC 
GACAACATGC 
GCTGGAGCCG 
AGGACGTGTA 
GAGACCGGCA 
CGTGACGACT 
AGGCGCAGCC 
AAGGACATCC 
GAAGGAGGCG 
GCCAGATCAG 
ATTGCGTGCC 
CAAGGAGCTG 
TCGTGAAGAT 
TACGCGCCGC 



TGCTCGACAA 
GCGCTGTGGA 
GCCCGGCCAC 
CGGACGCCGC 
GGCATCTCGA 
GCTTGGCGTG 
CCGTGACGTA 
GCGTACCTGA 
GATCTCGGGC 
CGTGGTACAA 
CCGGTGCGCC 
CAAGATCGGC 
TCATGAAGCC 
GAGGTGAAGT 
CGGCGACAAC 
GCCGTGGCAA 
GCCGACTTCA 
CAACGGCTAC 
GCTTCGCGGA 
GAGAAGAACC 
GGTGCCGCAG 
TGGGCCGCTT 



GCTGAAGGCG 
AGTTCGAGTC 
CGCGACTTCA 
CATCCTGATG 
AGGACGGCCA 
AAGCAGATGG 
CGCGCAGTCG 
AGCGCGTGGG 
TGGCAGGGCG 
GGGTCCCACG 
CGGTGGACAA 
GGTATCGGGA 
GGGCGACGTG 
CGATCGAGAT 
GTCGGCTTCA 
CGTGTGCGGC 
CGGCGCAGGT 
GCGCCGGTGC 
AATCGAGTCC 
CCAAGGCGAT 
AAGCCGATGT 
TGCCGTGCGC 



GAGCGCGAGC 
GCCCAAGTCC 
TCAAGAACAT 
ATCGACTCGA 
GACCCGCGAG 
TGGTGTGCTG 
CGCTACGATG 
CTACAACCCG 
ACAACATGAT 
CTGCTGGACG 
GCCGCTGCGC 
CTGTGCCCGT 
GTGACGTTCG 
GCACCACGAG 
ACGTGAAGAA 
AACTCGAAGA 
GATCGTGCTG 
TGGACTGCCA 
AAGATCGACC 
CAAGTCTGGC 
GCGTGGAGGT 
GACATGCGCC 



GCGGCATCAC 50 

GTGTTCACGA 100 

GATCACGGGC 150 

CGCATGGTGG 2 00 

CACGCGCTGC 2 50 

CAACAAGATG 3 00 

AGATCAGCAA 3 50 

GAGAAGGTGC 4 00 

CGAGAGGTCG 450 

CGCTCGACAT 500 

CTGCCCCTGC 550 

GGGCCGCGTG 600 

CGCCCGCCAA 650 

CAGCTGGCGG 70 0 

CGTGTCGGTG 750 

ACGACCCGCC 800 

AACCACCCCG 850 

CACGAGCCAC 900 

GCCGCTCCGG 950 

GATGCCGCGA 1000 

GTTCAACGAC 1050 

AAACCGTTG 1099 



35 



45 



2) INFORMATION FOR SEQ ID NO: 518 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1098 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania infantum 

(B) STRAIN: MOU 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCCAAGTCC GTGTTCACGA 100 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

55 ACGTCGCAGG CGGACGCCGC CATCCTGATG ATCGACTCGA CGCATGGTGG 200 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 250 

TTGCCTTCAC GCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

GACGACAAGA CCGTGACGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGGGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 4 00 

60 GCTTCATCCC GATCTCGGGC TGGCAGGGCG ACAACATGAT CGAGAGGTCG 450 
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GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCT^ACAT SOJb 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 55 0 

AGGACGTGTA CAAGATCGGC GGTATCGGGA CTGTGCCCGT GGGCCGCGTG 600 

GAGACCGGCA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

5 CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGCAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

10 ATTGCGTGCC GCTTCGCGGA AATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GATGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTT 1098 



15 



30 



2) INFORMATION FOR SEQ ID NO: 519 



(i) SEQUENCE CHARACTERISTICS : 
2 0 (A) LENGTH: 1071 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania enriettii 

(B) STRAIN: ATCC 50120 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519 



CTCGACAAGC TGAAGGCGGA GCGCGAGCGC GGCATCACGA TCGACATTGC 50 

GCTGTGGAAG TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGATGCGC 100 

35 CCGGCCACCG CGACTTCATC AAGAACATGA TCACCGGCAC GTCGCAGGCC 150 

GACGCCGCCA TCCTGATGAT CGACTCGACC CAGGGCGGCT TCGAGGCTGG 2 00 

CATCTCGAAG GACGGCCAGA CCCGCGAGCA CGCGCTGCTC GCCTTCACGC 2 50 

TTGGTGTGAA GCAGATGGTG GTGTGCTGCA ACAAGATGGA CGACAAGACG 300 

GTGCAGTACT CGCAGGCGCG CTACGAGGAG ATCAGCAAGG AGGTGGGCGC 350 

4 0 GTACCTGAAG CGCGTCGGCT ACAACCCGGA GAAGGTGCGC TTCATCCCGA 40 0 

TCTCGGGCTG GCAGGGCGAC AACATGATCG ACAAGTCGGA CAGCATGCCG 4 50 

TGGTACAAGG GGCCCACGCT GCTGGACGCG CTCGACATGC TGGAGGCGCC 50 0 

GGTGCGCCCG GTGGACAAGC CGCTGCGCCT GCCCCTGCAG GACGTGTACA 550 

AGATCGGCGG TATCGGGACG GTGCCCGTGG GCCGCGTGGA GACCGGCATC 600 

4 5 ATGAAGCCTG GCGACGTGGT GACGTTTGCG CCCGCCAACG TGACGACGGA 650 

GGTGAAGTCG ATCGAGATGC ACCACGAGCA GCTGGCGGAG GCGGTGCCCG 700 

GCGACAACGT CGGCTTCAAC GTGAAGAACG TGTCGGTGAA GGACATCCGC 750 

CGTGGTAACG TGTGCGGCAA CTCGAAGAAC GACCCGCCGA AGGAGGCGGC 8 00 

CGACTTCACG GCGCAGGTGA TCGTGCTGAA CCACCCTGGC CAGATCAGCA 850 

50 ACGGCTACGC GCCGGTGCTG GACTGCCACA CGAGCCACAT CGCGTGCCGC 900 

TTCGCGGATA TCGAGTCCAA GATCGACCGC CGCTCTGGCA AGGAGCTGGA 950 

GAAGAACCCC AAGGCGATCA AGTCCGGCGA TGCGGCCATC GTGAAGATGG 1000 

TGCCGCAGAA GCCGATGTGC GTGGAGGTGT TCAACGACTA CCCGCCGCTG 1050 

GGGCGCTTCG CTGTGCGCGA C 1071 

55 

2 ) INFORMATION FOR SEQ ID NO: 52 0 
60 (i) SEQUENCE CHARACTERISTICS: 

320 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(A) LENGTH: 1071 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmanial gerbilli 
10 (B) STRAIN: ATCC 50121 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 520 

CTCGACAAGC TGAAGGCGGA GCGCGAGCGC GGCATCACGA TCGACATTGC 50 

15 GCTGTGGAAG TTCGAGTCGC CCAAGTCCGT GTTCACGATC ATCGATGCGC 100 

CCGGCCACCG CGACTTCATC AAGAACATGA TCACGGGCAC GTCGCAGGCG 150 

GACGCCGCCA TCCTGATGAT CGACTCGACG CATGGTGGCT TCGAGGCTGG 200 

CATCTCGAAG GACGGCCAGA CCCGCGAGCA CGCGCTGCTT GCCTTCACTC 2 50 

TTGGCGTGAA GCAGATGGTG GTGTGCTGCA ACAAGATGGA CGACAAGACG 3 00 

20 GTGACGTACG CGCAGTCGCG CTACGATGAG ATCAGCAAGG AGGTGGGCGC 3 50 

GTACCTGAAG CGCGTGGGCT ACAACCCGGA GAAGGTGCGC TTCATCCCGA 4 00 

TCTCGGGCTG GCAGGGCGAC AACATGATCG AGAAGTCGGA CAACATGCCG 4 50 

TGGTACAAGG GTCCCACGCT GCTGGACGCG CTCGACATGC TGGAGCCGCC 500 

GGTGCGCCCG GTGGACAAGC CGCTGCGCCT GCCCCTGCAG GACGTGTACA 550 

25 AGATCGGCGG TATCGGGACG GTGCCCGTGG GCCGCGTGGA GACCGGCATC 6 00 

ATGAAGCCGG GCGACGTGGT GACGTTCGCG CCCGCCAACG TGACGACTGA 650 

GGTGAAGTCG ATCGAGATGC ACCACGAGCA GCTGGCGGAG GCGCAGCCCG 700 

GCGACAACGT CGGCTTCAAC GTGAAGAACG TGTCGGTGAA GGACATCCGC 750 

CGTGGTAACG TGTGCGGCAA CTCGAAGAAC GACCCGCCGA AGGAGGCGGC 8 00 

3 0 CGACTTCACG GCGCAGGTGA TCGTGCTGAA CCACCCCGGC CAGATCAGCA 8 50 

ACGGCTATGC GCCGGTGCTG GACTGCCACA CGAGCCACAT TGCGTGCCGC 900 

TTCGCGGAAA TCGAGTCCAA GATCGACCGC CGCTCCGGCA AGGAGCTGGA 950 

GAAGAACCCC AAGGCGATCA AGTCTGGCGA TGCCGCGATC GTGAAGATGG 10 00 

TGCCGCAGAA GCCGATGTGC GTGGAGGTGT TCAACGACTA CGCGCCGCTG 1050 

35 GGCCGCTTTG CCGTGCGCGA C 1071 



40 



2 ) INFORMATION FOR SEQ ID NO: 521 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1114 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Leishmania hertigi 

(B) STRAIN: ATCC 50125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521 

55 TCGTTCAAGT ACGCGTGGGT GCTCGACAAG CTGAAGGCGG AGCGCGAGCG 50 

CGGTATCACG ATCGACATTG CGCTGTGGAA GTTCGAGTCG CCCAAGTCGG 100 

TGTTCACGAT CATCGATGCG CCCGGCCACC GCGACTTCAT CAAGAACATG 150 

ATCACCGGCA CGTCGCAGGC GGATGCTGCC ATTCTGATGA TCGATTCGAC 2 00 

GCAGGGTGGC TTCGAGGCTG GCATCTCGAA GGACGGCCAG ACGCGCGAGC 2 50 

60 ACGCGCTGCT GGCCTTCACG CTGGGCGTGA AGCAGATGGT TGTGTGCTGC 3 00 
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AACAAGATGG ACGACAAGAC GGTGCAGTAC 
GATCAGCAAG GAGGTGGGCG CGTACCTGAA 
AGAAGGTGCG CTTCATCCCG ATCTCGGGCT 
GAGAAGTCCG ACAACATGTC GTGGTACAAG 
GCTGGACATG CTGGAGGCGC CGGTGCGCCC 
TGCCCCTGCA GGACGTGTAC AAGATCGGCG 
GGCCGTGTGG AGACCGGCAT CATGAAGCCC 
GCCCGCCAAC GTGACGACGG AGGTGAAGTC 
AGCTGCAGGA GGCTGTGCCC GGCGACAACG 
GTGTCGGTGA AGGACATCCG CCGTGGTAAC 
CGACCCGCCG AAGGAGGCGG CTGACTTCAC 
ACCACCCCGG CCAGATCAGC AACGGCTACG 
ACCAGCCACA TCGCGTGCCG CTTCGCGGAC 
CCGCTCCGGC AAGGAGCTGG AGAAGAACCC 
ACGCCGCCAT CGTGAAGATG GTGCCGCAGA 
TTCAACGACT ACCCGCCGCT GGGCCGCTTT 
AACCGTTGCC GTCG 



PCT/CA00/01150 

GCGCAGGCGC GCTACGAGGA 3 5 0/ 

GCGCGTGGGC TACAACCCGG 4 00 

GGCAGGGCGA CAACATGATC 4 50 

GGTCCCACGC TGCTGGAGGC 50 0 

GGTGGACAAG CCGCTGCGCC 55 0 

GCATTGGCAC GGTGCCGGTG 600 

GGCGACGTGG TGACGTTCGC 650 

GATCGAGATG CACCACGAGC 700 

TCGGCTTCAA CGTGAAGAAC 750 

GTGTGTGGCA ACTCGAAGAA 800 

GGCGCAGGTG ATCGTGCTGA 850 

CGCCGGTGCT GGACTGCCAC 900 

ATCGAGTCGA AGATCGACCG 950 

CAAGTCCATC AAGTCCGGCG 100 0 

AGCCGATGTG CGTGGAGGTG 1050 

GCGGTGCGCG ACATGCGCCA 110 0 

1114 



2) INFORMATION FOR SEQ ID NO: 522 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1106 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522 



TACGCGTGGG TGCTCGACAA GCTGAAGGCG 
GATCGACATT GCGCTGTGGA AGTTCGAGTC 
TCATCGATGC GCCCGGCCAC CGCGACTTCA 
ACGTCGCAGG CGGACGCCGC CATTCTGATG 
CTTCGAGGCT GGCATCTCGA AGGACGGCCA 
TTGCCTTCAC TCTTGGCGTG AAGCAGATGG 
GACGACAAGA CGGTGACGTA CGCGCAGTCR 
GGAGGTGGGC GCGTACCTGA AGCGCGTGGG 
GCTTCATCCC GATCTCGGGC TGGCAGGGCG 
GACAACATGC CGTGGTACAA GGGTCCCACG 
GCTGGAGCCG CCGGTGCGCC CGGTGGACAA 
AGGACGTGTA CAAGATCGGC GGTATCGGGA 
GAGACCGGCA TCATGAAGCC GGGCGACGTG 
CGTGACGACT GAGGTGAAGT CGATCGAGAT 
AGGCGCAGCC CGGCGACAAC GTCGGCTTCA 
AAGGACATCC GCCGTGGTAA CGTGTGCGGC 
GAAGGAGGCG GCCGACTTCA CGGCGCAGGT 
GCCAGATCAG CAACGGCTAT GCGCCGGTGC 
ATTGCGTGCC GCTTCGCGGA AATCGAGTCC 
CAAGGAGCTG GAGAAGAACC CCAAGGCGAT 
TCGTGAAGAT GGTGCCGCAG AAGCCGATGT 
TACGCGCCGC TGGGCCGCTT TGCCGTGCGC 
CGTCGG 



GAGCGCGAGC GCGGCATCAC 50 

GCCCAAGTCC GTGTTCACGA 100 

TCAAGAACAT GATCACGGGC 150 

ATCGACTCGA CGCATGGCGG 2 00 

GACCCGCGAG CACGCGCTGC 2 50 

TGGTGTGCTG CAACAAGATG 3 00 

CGCTACGATG AGATCAGCAA 3 50 

CTACAACCCG GAGAAGGTGC 4 00 

ACAACATGAT CGAGAAGTCG 4 50 

CTGCTGGACG CGCTCGACAT 500 

GCCGCTGCGC CTGCCCCTGC 550 

CGGTGCCCGT GGGCCGCGTG 600 

GTGACGTTCG CGCCCGCCAA 650 

GCACCACGAG CAGC TGGCGG 70 0 

ACGTGAAGAA CGTGTCGGTG 750 

AACTCGAAGA ACGACCCGCC 800 

GATCGTGCTG AACCACCCCG 850 

TGGACTGCCA CACGAGCCAC 900 

AAGATCGACC GCCGCTCCGG 950 

CAAGTCTGGC GATGCCGCGA 10 00 

GCGTGGAGGT GTTCAACGAC 1050 

GACATGCGCC AAACCGTTGC 1100 

1106 
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2 ) INFORMATION FOR SEQ ID NO: 523 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania amazonensis 
15 (B) STRAIN: ATCC 50131 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523 

TACGCGTGGG TGCTCGACAA GCTGAAGGCG GAGCGCGAGC GCGGCATCAC 50 

2 0 GATCGACATT GCGCTGTGGA AGTTCGAGTC GCCGAAGTCC GTGTTCACGA 10 0 

TCATCGATGC GCCCGGCCAC CGCGACTTCA TCAAGAACAT GATCACGGGC 150 

ACGTCGCAGG CGGACGCGGC CATCCTGATG ATCGACTCGA CGCATGGTGG 2 00 

CTTCGAGGCT GGCATCTCGA AGGACGGCCA GACCCGCGAG CACGCGCTGC 2 50 

TTGCCTTCAC TCTTGGCGTG AAGCAGATGG TGGTGTGCTG CAACAAGATG 3 00 

25 GACGACAAGA CGGTGATGTA CGCGCAGTCG CGCTACGATG AGATCAGCAA 350 

GGAGGTGAGC GCGTACCTGA AGCGCGTGGG CTACAACCCG GAGAAGGTGC 4 00 

GCTTCATCCC GATCTCGGGG TGGCAGGGCG ACAACATGAT CGACAAGTCG 4 50 

GACAACATGC CGTGGTACAA GGGTCCCACG CTGCTGGACG CGCTCGACAT 500 

GCTGGAGCCG CCGGTGCGCC CGGTGGACAA GCCGCTGCGC CTGCCCCTGC 550 

3 0 AGGACGTGTA CAAGATCGGC GGTATCGGGA CGGTGCCCGT GGGCCGCGTG 600 

GAGACCGGGA TCATGAAGCC GGGCGACGTG GTGACGTTCG CGCCCGCCAA 650 

CGTGACGACT GAGGTGAAGT CGATCGAGAT GCACCACGAG CAGCTGGCGG 700 

AGGCGCAGCC CGGCGACAAC GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG 750 

AAGGACATCC GCCGTGGTAA CGTGTGCGGC AACTCGAAGA ACGACCCGCC 800 

3 5 GAAGGAGGCG GCCGACTTCA CGGCGCAGGT GATCGTGCTG AACCACCCCG 850 

GCCAGATCAG CAACGGCTAC GCGCCGGTGC TGGACTGCCA CACGAGCCAC 900 

ATCGCGTGCC GCTTCGCGGA GATCGAGTCC AAGATCGACC GCCGCTCCGG 950 

CAAGGAGCTG GAGAAGAACC CCAAGGCGAT CAAGTCTGGC GACGCCGCGA 1000 

TCGTGAAGAT GGTGCCGCAG AAGCCGATGT GCGTGGAGGT GTTCAACGAC 1050 

4 0 TACGCGCCGC TGGGCCGCTT TGCCGTGCGC GACATGCGCC AAACCGTTGC 1100 

CGTCG HQS 



45 2) INFORMATION FOR SEQ ID NO: 524 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 

(B) STRAIN: ATCC 50156 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524 
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TACGCGTGGG 
GATCGACATT 
TCATCGATGC 
ACGTCGCAGG 
5 CTTCGAGGCT 
TTGCCTTCAC 
GACGACAAGA 
GGAGGTGAGC 
GCTTCATCCC 

10 GACAACATGC 
GCTGGAGCCG 
AGGACGTGTA 
GAGACCGGGA 
CGTGACGACT 

15 AGGCGCAGCC 
AAGGACATCC 
GAAGGAGGCG 
GCCAGATCAG 
ATCGCGTGCC 

2 0 CAAGGAGCTG 
TCGTGAAGAT 
TACGCGCCGC 



TGCTCGACAA 
GCGCTGTGGA 
GCCCGGCCAC 
CGGACGCGGC 
GGCATCTCGA 
TCTTGGCGTG 
CGGTGATGTA 
GCGTACCTGA 
GATCTCGGGG 
CGTGGTACAA 
CCGGTGCGCC 
CAAGATCGGC 
TCATGAAGCC 
GAGGTGAAGT 
CGGCGACAAC 
GCCGTGGTAA 
GCCGACTTCA 
CAACGGCTAC 
GCTTCGCGGA 
GAGAAGAACC 
GGTGCCGCAG 
TGGGCCGCTT 



GCTGAAGGCG 
AGTTCGAGTC 
CGCGACTTCA 
CATCCTGATG 
AGGACGGCCA 
AAGCAGATGG 
CGCGCAGTCG 
AGCGCGTGGG 
TGGCAGGGCG 
GGGTCCCACG 
CGGTGGACAA 
GGTATCGGGA 
GGGCGACGTG 
CGATCGAGAT 
GTCGGCTTCA 
CGTGTGCGGC 
CGGCGCAGGT 
GCGCCGGTGC 
GATCGAGTCC 
CCAAGGCGAT 
AAGCCGATGT 
TGCCGTGCGC 



GAGCGCGAGC 
GCCGAAGTCC 
TCAAGAACAT 
ATCGACTCGA 
GACCCGCGAG 
TGGTGTGCTG 
CGCTACGATG 
CTACAACCCG 
ACAACATGAT 
CTGCTGGACG 
GCCGCTGCGC 
CGGTGCCCGT 
GTGACGTTCG 
GCACCACGAG 
ACGTGAAGAA 
AACTCGAAGA 
GATCGTGCTG 
TGGACTGCCA 
AAGATCGACC 
CAAGTCTGGC 
GCGTGGAGGT 
GACATGCGCC 



PCT/CA00/01150 

GCGGcJ^TCAC 50 

GTGTTCACGA 100 

GATCACGGGC 150 

CGCATGGTGG 200 

CACGCGCTGC 250 

CAACAAGATG 3 00 

AGATCAGCAA 350 

GAGAAGGTGC 4 00 

CGACAAGTCG 4 50 

CGCTCGACAT 500 

CTGCCCCTGC 550 

GGGCCGCGTG 600 

CGCCCGCCAA 650 

CAGCTGGCGG 700 

CGTGTCGGTG 750 

ACGACCCGCC 800 

AACCACCCCG 850 

CACGAGCCAC 900 

GCCGCTCCGG 950 

GACGCCGCGA 1000 

GTTCAACGAC 1050 

AAACCGTT 10 98 



25 
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2) INFORMATION FOR SEQ ID NO: 525 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1081 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tarentolae 

(B) STRAIN: II WT 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525 

GCTGAAGGCG GAGCGCGAGC GCGGCATCAC GATCGACATT GCGCTGTGGA 50 

AGTTCGAGTC GCCCAAGTCG GTGTTCACGA TCATCGATGC GCCCGGCCAC 100 

CGCGACTTCA TCAAGAACAT GATCACGGGC ACGTCGCAGG CGGACGCGGC 150 

45 CATCCTGATG ATCGACTCGA CGCACGGTGG GTTCGAGGCT GGCATCTCGA 200 

AGGACGGGCA GACGCGCGAG CACGCGCTGC TTGCCTTCAC TCTTGGCGTG 250 

AAGCAGATGG TTGTGTGCTG CAACAAGATG GACGACAAGA CGGTGATGTA 3 00 

CGCGCAGTCG CGCTACGATG AGATCAGCAA GGAGGTGGGC GCGTACCTGA 3 50 

AGCGCGTGGG CTACAACCCG GAGAAGGTGC GCTTCATCCC GATCTCGGGC 4 00 

50 TGGCAGGGCG ACAACATGAT CGAGAAGTCG GACAACATGC CGTGGTACAA 450 

GGGTCCCACG CTGCTGGACG CGCTCGACAT GCTGGAGGCG CCGGTGCGCC 50 0 

CGGTGGACAA GCCGCTGCGC CTGCCCCTGC AGGACGTGTA CAAGATCGGC 55 0 

GGCATCGGCA CGGTGCCCGT GGGCCGCGTG GAGACCGGCA TCATGAAGCC 60 0 

GGGCGACGTG GTGACGTTCG CGCCCGCGAA CGTGACGACG GAGGTGAAGT 650 

55 CGATCGAGAT GCACCACGAG CAGCTGGCGG AGGCGCAGCC CGGCGACAAC 700 

GTCGGCTTCA ACGTGAAGAA CGTGTCGGTG AAGGACATCC GCCGTGGGAA 750 

CGTGTGCGGT AACTCGAAGA ACGACCCGCC GAAGGAGGCC GCCGACTTCA 800 

CGGCGCAGGT GATCGTGCTG AACCACCCCG GCCAGATCAG CAACGGCTAC 850 

GCGCCGGTGC TGGACTGCCA CACGAGCCAC ATCGCGTGCC GGTTCGCGGA 900 

60 CATCGAGTCC AAGATTGACC GCCGCTCCGG CAAGGAGCTG GAGAAGAACC 950 
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CCAAGGCGAT CAAGTCCGGC GATGCCGCGA TCGTGAAGAT GGTGdCGCAG lOOOl 

AAGCCGATGT GCGTGGAGAT GTTCAACGAC TACGCGCCGC TTGGCCGCTT 10^0 

TGCTGTGCGC GACATGCGCC AAACCGTTGC C 1081 



2) INFORMATION FOR SEQ ID NO: 526 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1102 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 



20 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 5012 9 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 526 



AAATACGCGT GGGTGCTCGA CAAGCTGAAG GCGGAGCGCG AGCGCGGCAT 50 
CACGATCGAC ATTGCGCTGT GGAAGTTCGA GTCGCCCAAG TCCGTGTTCA 100 
25 CGATCATCGA TGCGCCCGGC CACCGCGACT TCATCAAGAA CATGATCACG 150 
GGCACGTCGC AGGCGGACGC CGCCATCCTG ATGATCGACT CGACGCATGG 2 00 

TGGCTTCGAG GCTGGCATCT CGAAGGACGG CCAGACCCGC GAGCACGCGC 2 50 

TGCTTGCCTT CACKCTTGGC GTGAAGCAGA TGGTGGTGTG CTGCAACAAG 300 
ATGGACGACA AGACGGTGAC GTACGCGCAG TCGCGCTACG ATGAGATCAG 3 50 

3 0 CAAGGAGGTG GGCGCGTACC TGAAGCGCGT GGGCTACAAC CCGGAGAAGG 4 00 

TGCGCTTCAT CCCGATCTCG GGCTGGCAGG GCGACAACAT GATCGAGAAG 450 
TCGGACAACA TGCCGTGGTA CAAGGGTCCC ACGCTGCTGG ACGCGCTCGA 500 
CATGCTGGAG CCGCCGGTGC GCCCGGTGGA CAAGCCGCTG CGCCTGCCCC 550 
TGCAGGACGT GTACAAGATC GGCGGTATCG GGACGGTGCC CGTGGGGCGC 600 
35 GTGGAGACCG GCATCATGAA GCCGGGCGAC GTGGTGACGT TCGCGCCCGC 650 
CAACGTGACG ACTGAGGTGA AGTCGATCGA GATGCACCAC GAGCAGCTGG 700 
CGGAGGCGCA GCCCGGCGAC AACGTCGGCT TCAACGTGAA GAACGTGTCG 750 
GTGAAGGACA TCCGCCGTGG TAACGTGTGC GGCAACTCGA AGAACGACCC 8 00 

GCCGAAGGAG GCGGCCGACT TCACGGCGGA GGTGATCGTG CTGAACCACC 850 

4 0 CCGGCCAGAT CAGCAACGGC TACGCGCCGG TGCTGGACTG CCACACGAGC 90 0 

CACATTGCGT GCCGCTTCGC GGAAATCGAG TCCAAGATCG ACCGCCGCTC 950 

CGGCAAGGAG CTGGAGAAGA ACCCCAAGGC GATCAAGTCT GGCGATGCCG 1000 

CGATCGTGAA GATGGTGCCG CAGAAGCCGA TGTGCGTGGA GGTGTTCAAC 1050 

GACTACGCGC CGCTGGGCCG CTTTGCCGTG CGCGACATGC GCCAAACCGT 1100 

45 TG 1102 



50 



2) INFORMATION FOR SEQ ID NO: 52 7 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 110 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Neospora caninum 

325 
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WO 01/23604 

(B) STRAIN: Suarez-4 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527 



5 GGACRAACTT 
TGTGGAAATT 
GGACACAGAG 
CTGTGCTGTC 
TCTCCAAGAA 

10 GGTGTGAAAC 
ACCCTATAGC 
AYNTTAAAAA 
TNTGGCTGGA 
GTTCAAGGGA 

15 CCCTGCTTGA 
AAACCCTTGC 
TACTGTCCCT 
TGGTCACCTT 
ATGCACCATG 

20 CAATGTCAAG 
GTGACAGCAA 
GTGATTATTT 
GCTGGATTGT 
AGAAGATTGA 

25 TTGAAATCTG 
GTGTGTCGAG 
GTGAC 



AAAGCTGAAC 
TGAGACCAGC 
ACTTCATCAA 
CTGATTGTTG 
CGGGCAGACC 
AACTAATTGT 
CARAAGAGAT 
AATTGGYTAC 
ATGGTGACAA 
TGGAAAGTCM 
AGCTYTGGAT 
GTTTGCCTYT 
GTGGGTCGTG 
TGCTCCAGTC 
AAGCATTGAG 
AACGTGTCTG 
AAATGATCCA 
TGAACCATCC 
CACACAGCTC 
TCGTCGTTCT 
GTGACGCTGC 
AGCTTCTCTG 



GTGAGCGTGG 
AAGTACTATG 
AAACATGATT 
CTGCTGGTGT 
CGTGAGCATG 
TGGCGTTAAC 
ACGARGAAAT 
AACCCCGACA 
CATGCTGGAN 
CCCGTAAGGA 
TGCATTYTGC 
CCAGGATGTC 
TGGAGACTGG 
AATGTAACAA 
TGAAGCCCTT 
TCAAAGATGT 
CCCATGGAAG 
AGGCCAAATC 
ACATTGCTTG 
GGGAAAAAGC 
CATCGTTGAT 
ATTATCCTCC 



TATCACCATT 
TTACCATCAT 
ACAGGCACAT 
TGGTGAATTT 
CCCTTNTGGC 
AAAAKGGATT 
TGTTAAGGAA 
CAGTANCATT 
CCAAGTGCTA 
CGGCAATGCC 
CACCAAYTTG 
TATAAAATTG 
TGTTCTCAAA 
CTGAAGTGAA 
CCTGGGGACA 
CCGTCGTGGC 
CTGCTGGCTT 
AGTGCTGGAT 
CAAGTTTGCT 
TGGAAGATGG 
ATGGTTC CTG 
CCTGGGCCGT 



GATATCTCCC 50 

TGATGCCCCA 100 

CCCAGGCTGA 150 

GAAGCCGGTA 2 00 

TTACACCCTG 250 

CCACTGAGCC 3 00 

GTCAGCMCCT 3 50 

TGKGCCAATT 4 00 

ATATGCCATG 4 50 

AGKGGAACCM 5 00 

CCCAACTGAC 550 

GKGGTATTGG 600 

CCTGGCATGG 650 

GTCTGTAGAA 700 

ATGTGGGCTT 750 

AATGTGGCTG 800 

CACAGCTCAG 850 

ATGCACCTGT 900 

GAGCTGAAGG 950 

CCCTAAATTC 1000 

GCAAGCCCAT 1050 

TTTGCTGTGC 1100 
1105 



30 



40 



2) INFORMATION FOR SEQ ID NO: 52 8 



( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 93 5 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichomonas vaginalis 

(B) STRAIN: ATCC 30001 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528 

GCACATCCCA GGCTGATGCT GCTATCCTTG TCATCGACTC CACACTCGGT 50 

GGYTTCGAAG CCGGTATCGC TGAACAAGGC CAGACACGTG AACACGCTCT 100 

TCTTGCCTTC ACACTCGGCA TCAAGCAGGT CATTGTCGCC GTCAACAAGA 150 

50 TGGATGACAA GACAGTCAAC TACAACAAGG CYCGTTTCGA CGAAATCACA 200 

GCCGAAATGA CACGCATCCT TACAGGCATC GGCTACAAGC CAGAAATGTT 250 

CCGCTTCGTC CCAATCTCCG GCTGGGCTGG CGACAACATG ACAGAGAAGT 300 

CTCCAAACAT GCCATGGTAC AATGGCCCAT ACCTTCTTGA AGCCCTCGAT 350 

TCCCTTCAGC CACCAAAGCG CCCATTCGAC AAGCCACTCC GTCTTCCACT 4 00 

55 CCAGGATGTC TACAAGATCA ACGGTATCGG TACAGTTCCA GTCGGCCGTG 450 

TCGAATCCGG CACAATGAAG CCAGGCATGA TCGTTAACTT CGCCCCATCC 500 

ACAGTTACAG CTGAAGTTAA GTCCATCGAA ATGCACCACG AATCCCTTCC 550 

AGAGGCTCTT CCAGGTGACA ACATCGGCTT CAACGTCAAG AACGTTTCCA 600 

CAGCTGATGT CAAGCGTGGC TACGTCGTTG GTGATACAAA GCGTGACCCA 650 

60 CCAGTCGAAT GCGCTTCCTT CACAGCTCAR ATGATCATCT CCAACCACCC 700 

326 
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AGGCAAGATC CACGCCGGCT 
ACATCGCCTG CAAGTTCGAC 
GGCAAGAAGG CTACAGAGAA 
TATCGTCGAG GTTGTCCCAT 
5 AGTACCCACC ACTCGGCCGT 



ACCAGCCAGT TTTCGACTGC 
AAGCTCATCC AGCGTATCGA 
CCCAGAATAC ATTCAGAAGG 
CCAAGCCACT CGTCGTCGAG 
TTCGCCATCC GTGAT 



PCT/CA00/01150 

CACAC^GCTC 750 J 

TCGTCGCCAC 80 0/ 

ATGATGCCGC 850 
TCCTTCCAGG 90 0 

935 



2) INFORMATION FOR SEQ ID NO: 529 



10 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1065 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Trypanosoma brucei subsp. brucei 

(B) STRAIN: EATR07 95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529 

2 5 AAGCTGAAGG CTGAGCGCGA ACGTGGTATC ACGATCGACA TTGCACTGTG 50 

GAAATTCGAG TCACCCAAGT CTGTCTTCAC TATTATTGAT GCTCCTGGGC 10 0 

ACCGTGACTT CATCAAGAAC ATGATCACCG GCACATCGCA AGCCGACGCA 150 

GCCATCCTCA TCATTGCCTC TGCGCAGGGT GAGTTCGAGG CTGGTATCTC 2 00 

CAAGGATGGA CAGACCCGCG AGCACGCGTT GCTGGCCTTC ACTTTGGGTG 2 50 

3 0 TGAAGCAGAT GGTTGTGTGC TGCAACAAGA TGGACGACAA GACTGTGAAC 3 00 

TACGGACAGG AGCGGTATGA CGAGATTGTG AAGGAGGTGT CTGCTTACAT 3 50 

CAAGAAGGTT GGGTACAACG TGGAGAAGGT GCGCTTCGTC CCCATCTCCG 4 00 

GATGGCAGGG CGACAACATG ATTGAGAAAT CCGAGAAGAT GCCATGGTAC 450 

AAGGGTCCAA CGCTCCTGGA GGCACTAGAC ATGCTGGAGC CACCAGTGCG 500 

35 TCCGAGCGAC AAGCCCCTGC GTCTGCCACT GCAGGACGTG TACAAGATCG 550 

GTGGTATTGG CACCGTGCCC GTTGGTCGTG TGGAGACCGG CGTGATGAAG 6 00 

CCTGGTGATG TGGTGACGTT TGCCCCCGCC AACGTGACGA CCGAGGTGAA 650 

ATCGATCGAG ATGCACCACG AGCAGCTCGC TGAGGCGACC CCCGGTGACA 700 

ACGTCGGCTT TAACGTGAAG AACGTTTCTG TAAAGGACAT CCGCCGTGGC 750 

4 0 AACGTCTGCG GTAACACCAA GAACGACCCC CCAAAGGAGG CCGCCGACTT 8 00 

CACGGCACAG GTGATCATCC TGAACCACCC CGGACAGATT GGAAACGGTT 850 

ATGCGCCCGT GCTGGACTGC CACACATCGC ACATTGCCTG CAAGTTCGCG 90 0 

GAGATCGAGT CGAAGATCGA CCGTCGCTCT GGCAAGGAGC TGGAGAAGGC 950 

TCCCAAGTCG ATCAAGTCTG GCGACGCCGC GATCGTGCGC ATGGTGCCGC 1000 

45 AGAAGCCTAT GTGCGTGGAG GTCTTCAACG ACTACGCGCC ACTCGGCCGC 1050 

TTTGCCGTGC GTGAC 1065 



50 2) INFORMATION FOR SEQ ID NO: 530 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 97 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

327 
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WO 01/23604 PCT/CA00/01 150 

(A) ORGANISM: Crithxaia rascxcuiana 

(B) STRAIN: ATCC 11745 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530 

5 

TTCGCGGAGG GCGTGCCGCC GGTGCTGACG GCGCTGGACG TGACGGAGGA 50 

CCTCGGCCGC GACGAGCCGC TGACGCTGGA GATTGTGCAG CACTTGGACG 100 

CGAACACCGG CCGCTGCATT GCCATGCAGA CGACGGATCT GCTGAAGCTG 150 

AAGTCGAAGG TTGTGTCGAC GGGCGGCAAC ATCTCCGTGC CGGTTGGCCG 200 

10 CGAGACGCTG GGCCGCATCT TCAACGTGCT CGGCGACGCG ATCGACCAGC 250 

GCGGTGTGGT GGGCGAGAAG ATGCGCATGC CGATCCACGC CGAGGCGCCG 3 00 

AAGCTGGCGG ACCAGGCCGC GGAGGACGCG ATTCTGACGA CCGGCATCAA 3 50 

GGTGATCGAC CTGATTCTGC CGTACTGCAA GGGTGGCAAG ATCGGGCTGT 4 00 

TCGGCGGTGC TGGTGTGGGC AAGACTGTGA TCATCATGGA GCTGATCAAC 450 

15 AACGTGGCCA AGGGCCACGG TGGTTTCTCC GTGTTCGCCG GCGTTGGCGA 500 

GCGCACCCGC GAGGGCACGG ATCTGTACCT GGAGATGATG CAGTCGAAGG 550 

TCATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTGGTGTA CGGCCAGATG 600 

AACGAGCCCC CGGGTGCGCG TGCGCGTGTT GCGCAGTCTG CGCTGACGAT 650 

GGCGGAGTAC TTCCGTGACG TGGAGGGCCA GAACGTGCTG CTGTTCATCG 700 

2 0 ACAACATCTT CCGCTTCACC CAGGCCAACT CCGAGGTGTC CGCCCTGCTG 750 

GGCCGCATTC CCGCCGCCGT GGGCTACCAG CCGACGCTTG CCGAGGATCT 800 

TGGTATGCTG CAGGAGCGCA TTACGTCGAC GACGAAGGGC TCGATTACGT 850 

CTGTGCAGGC CGTGTACGTG CCGGCCGATG ATATCACGGA TCCGGCGCCG 900 

GCGACGACCT TCTCGCACCT GGATGCGACG ACGGTGCTGG ACCGCGCGGT 950 

25 TGCCGAGTCT GGCATCTACC CCGCCGTGAA CCCGCTGGAG TGCGCGTCGC 10 00 

GTATCATGGA CCCCGATGTG ATCGACGTGG ACCACTACAA CGTTGCGCAG 1050 

GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

CGTGCTGGGT ATCGACGAGC TGAGCGAGGA GGACAAGCTT GTGGTGGACC 1150 

GCGCTCGCAA GGTGACGCGC TTCCTGTCGC AGCCGTTCCA GGTGGCCGAG 12 00 

30 GTGTTCACCG GCATGACGGG CCACTACGTG CAGCTGGAGG ACACAGTGGA 12 50 

GTCGTTCTCT GGCCTGCTGA TGGGCTCGTA CGACCAGATC CCGGAGA 12 97 



35 2) INFORMATION FOR SEQ ID NO: 531 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 98 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 3 0816 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531 



CTTCTCGGAG GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC 10 0 

GCGAACACGG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 15 0 

GAAGTCGAAG GTCGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 2 00 

55 GTGAGACGCT GGGCCGCATC TTCAAYGTTC TGGGCGACGC GATCGACCAG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 3 00 

GAAGCTGGCG GATCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CCTGATTCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG 400 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 450 

60 CAACGTCGCG AAGGGCCACG GCGGTTTCTC CGTGTTTGCC GGCGTTGGCG 500 
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AGCGCACGCG CGAGGGCACG GACCTGTACC 
GTGATTGACC TGAAGGGCGA GTCGAAGTGY 
GAACGAGCCC CCGGGTGCGC GCGCGCGCGT 
TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC 
GACAACATCT TCCGCTTCAC GCAGGCGAAC 
GGGCCGCATT CCGGCCGCCG TGGGCTACCA 
TTGGTATGCT GCAGGAGCGC ATCACGTCGA 
TCCGTGCAGG CCGTGTACGT GCCAGCGGAT 
CGCGACGACG TTCTCGCACC TGGACGCGAC 
TGGCGGAGTC GGGCATCTAC CCTGCCGTGA 
CGTATCATGG ACCCTGATGT GATCGATGTG 
GGATATCGTG CAGATGCTGA CCAAGTACAA 
CGGTGCTTGG CATCGACGAG CTGAGCGAGG 
CGCGCGCGCA AGGTGACCCG GTTCCTGTCG 
GGTGTTCACG GGCATGACGG GCCACTACGT 
AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT 



PCT/CA00/01150 

TGGAGATGAT GCAGT*GAAG S^J& 

GTGCTTGTGT ATGGGCAGAT 60 0 

TGCGCAGTCT GCGCTGACGA 650 

AGAACGTGCT GCTGTTCATC 700 

TCCGAGGTGT CTGCGCTGCT 750 

GCCGACGCTT GCGGAGGATC 800 

CAACGAAGGG GTCGATCACG 850 

GATATCACGG ATCCCGCGCC 900 

GACTGTGCTG GACCGCGCGG 950 

ACCCGCTGGA GTGCGCGTCG 10 0 0 

GACCACTACA ACGTTGCGCA 1050 

GGAGCTGCAG GATATCATTG 1100 

AAGACAAGGT TGTTGTGGAC 1150 

CAGCCGTTCC AGGTTGCGGA 12 0 0 

GCAGCTGGTC GACACGGTGG 1250 

ACGACCAGAT CCCGGAGA 12 98 



2) INFORMATION FOR SEQ ID NO: 532 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 97 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 

(B) STRAIN: ATCC 50119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532 

TTCTCGGAGG GCGTGCCGCC CGTGCTGACG GCGCTGGATG TGACGGAGGA 5 0 

CCTTGGCCGC GATGAGCCGC TGACGCTGGA GATCGTGCAG CACTTGGACG 100 

CGAACACCGG CCGCTGCATT GCGATGCAGA CGACGGACCT GCTGAAGCTG 150 

AAGTCGAAGG TTGTGTCGAC CGGCGGCAAC ATCTCTGTGC CGGTGGGCCG 2 00 

TGAGACGCTG GGCCGCATCT TCAACGTTCT GGGCGACGCG ATCGACCAGC 2 50 

GCGGCCCCGT GGGCGAGAAG ATGCGCATGG CGATCCACGC CGAGGCCCCA 3 00 

AAGCTGGCGG ATCAGGCCGC GGAGGACACG ATCCTGACGA CCGGCATCAA 350 

GGTGATCGAC CTGATTCTGC CCTACTGCAA GGGTGGCAAG ATCGGCCTGT 4 00 

TCGGCGGTGC CGGTGTGGGC AAGACTGTGA TCATCATGGA GCTGATCAAC 450 

AACGTCGCGA AGGGCCACGG TGGTTTCTCC GTGTTTGCCG GCGTTGGCGA 500 

GCGCACGCGC GAGGGCACGG ACCTGTACCT GGAGATGATG CAGTCGAAGG 550 

TGATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTTGTGTA CGGGCAGATG 600 

AACGAGCCCC CGGGTGCGCG CGCGCGCGTT GCGCAGTCTG CGCTGACGAT 650 

GGCGGAGTAC TTC CGCGACG TGGAGGGCCA GAACGTGCTG CTGTTCATCG 700 

ACAACATCTT CCGCTTCACG CAGGCGAACT CCGAGGTGTC TGCGCTGCTG 750 

GGCCGCATTC CAGCCGCCGT GGGCTACCAG CCGACGCTTG CGGAGGATCT 8 00 

TGGTATGCTG CAGGAGCGCA TCACGTCGAC AACGAAGGGG TCGATCACGT 850 

CCGTGCAGGC CGTGTACGTG CCAGCGGATG ATATCACGGA TCCCGCGCCC 90 0 

GCGACGACGT TCTCGCACCT GGACGCGACG ACTGTGCTGG ACCGCGCGGT 950 

GGCGGAGTCG GGCATCTACC CTGCCGTGAA CCCGCTGGAG TGCGCGTCGC 10 00 

GTATCATGGA CCCCGACGTG ATCGATGTGG ACCACTACAA CGTTGCGCAG 1050 

GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

GGTGCTTGGC ATCGACGAGC TGAGCGAGGA AGACAAGGTT GTTGTGGACC 1150 

GCGCGCGCAA GGTGACCCGG TTCCTGTCGC AGCCGTTCCA GGTTGCGGAG 12 00 

GTGTTCACGG GCATGACGGG CCACTACGTG CAGCTGGTCG ACACGGTGGA 12 50 

329 
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WO 01/23604 PCT/CA00/01 150 

GTCGTTCTCT GGCCTGCTGA TGGGGTCGTA CGACCAGATC CCGGA^QA 12 97. 



5 2) INFORMATION FOR SEQ ID NO: 533 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1298 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 

(B) STRAIN: ATCC 50212 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533 



CTTCTCGGAG GGCGTGCCGC CCGTACTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAT 100 

GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 2 00 

25 GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGACGC GATCGACCAG 2 50 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 300 

GAAGCTGGCG GACCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CCTCATTCTG CCCTACTGCA AGGGCGGCAA GATCGGCCTG 4 00 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 4 50 

3 0 CAACGTCGCG AAGGGCCACG GTGGCTTCTC CGTGTTTGCC GGCGTTGGCG 500 

AGCGCACGCG CGAGGGCACG GACCTATACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 7 00 

35 GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 

GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCCGAGGATC 800 

TTGGTATGCT GCAGGAGCGC ATCACATCGA CGACGAAGGG GTCGATCACG 850 

TCCGTGCAGG CCGTGTACGT GCCGGCGGAT GATATCACGG ATCCCGCGCC 900 

CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG 950 

4 0 TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 10 0 0 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 

45 GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGCC GACACGGTGG 1250 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAGA 12 98 



50 2 ) INFORMATION FOR SEQ ID NO: 534 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 98 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

330 
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WO 01/23604 



PCT/CAOO/01150 



(A) ORGANISM: Leishmania infantum 

(B) STRAIN: MOU 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534 

CTTCTCGGAG GGCGTGCCGC CCGTACTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAT 100 

GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

GAAGTCGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 2 00 

GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGACGC GATCGACCAG 2 50 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 3 00 

AAAGCTGGCG GACCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA 350 

AGGTGATCGA CCTCATTCTG CCCTACTGCA AGGGCGGCAA GATCGGCCTG 4 00 

TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 4 50 

CAACGTCGCG AAGGGCCACG GTGGCTTCTC CGTGTTTGCC GGCGTTGGCG 500 

AGCGCACGCG CGAGGGCACG GACCTATACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 

GGGCCGCATT CCGGCCGCCG TGGGCTAC C A GCCGACGCTT GCCGAGGATC 800 

TTGGTATGCT GCAGGAGCGC ATCACATCGA CGACGAAGGG GTCGATCACG 850 

TCCGTGCAGG CCGTGTACGT GCCGGCGGAT GATATCACTG ATCCCGCGCC 90 0 

CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG 950 

TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 10 00 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 12 00 

GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGCC GACACGGTGG 12 50 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAGA 12 98 



2) INFORMATION FOR SEQ ID NO: 535 

( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 13 01 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania gerbilli 

(B) STRAIN: ATCC 50121 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535 

GCACTTCTCG GAGGGCGTGC CGCCCGTGCT GACGGCGCTG GATGTGACGG 50 

AGGACCTTGG CCGCGATGAG CCGCTGACGC TGGAGATCGT GCAGCACTTG 100 

GACGCGAACA CCGGCCGCTG CATTGCGATG CAGACGACGG ACCTGCTGAA 150 

GCTGAAGTCG AAGGTTGTGT CGACCGGTGG CAACATCTCT GTGCCGGTGG 200 

GCCGTGAGAC GCTGGGCCGC ATCTTCAACG TTCTGGGCGA TGCGATCGAC 250 

CAGCGCGGCC CCGTGGGCGA GAAGATGCGC ATGGCGATCC ACGCCGAGGC 3 00 

CCCGAAGCTG GCGGATCAGG CCGCGGAGGA CACGATCCTG ACGACCGGCA 350 

TCAAGGTGAT CGACCTGATT CTGCCCTACT GCAAGGGTGG CAAGATCGGY 400 

CTGTTCGGCG GTGCCGGTGT GGGCAAGACT GTGATCATCA TGGAGCTGAT 450 

CAACAACGTC GCGAAGGGCC ACGGTGGTTT CTCCGTGTTT GCCGGCGTTG 500 
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GCGAGCGCAC GCGCGAGGGC ACGGACCTGT 
AAGGTGATTG ACCTGAAGGG CGAGTCGAAG 
GATGAACGAG CCCCCGGGTG CGCGCGCGCG 
CGATGGCGGA GTACTTCCGC GACGTGGAGG 
5 ATCGACAACA TCTTCCGCTT CACGCAGGCG 
GCTGGGCCGC ATTCCGGCCG CCGTGGGCTA 
ATCTTGGTAT GCTGCAGGAG CGCATCACGT 
ACGTCCGTGC AGGCCGTGTA CGTGCCAGCG 
GCCCGCGACG ACGTTCTCGC ACCTTGACGC 

10 CGGTGGCGGA GTCGGGCATC TACCCTGCCG 
TCGCGTATCA TGGACCCCGA TGTGATCGAT 
GCAGGATATC GTGCAGATGC TGACCAAGTA 
TTGCGGTGCT TGGCATCGAC GAGCTGAGCG 
GACCGCGCGC GCAAGGTGAC CCGGTTCCTG 

15 GGAGGTGTTC ACGGGCATGA CGGGCCACTA 
TGGAGTCGTT CTCTGGCTTG CTGATGGGGT 
A 
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ACCTGGAGAT GATGCAGTCG " ' ST5T) 

TGCGTGCTTG TGTACGGGCA 6 0&> 

CGTTGCGCAG TCTGCGCTGA 650 

GCCAGAACGT GCTGCTGTTC 700 

AACTCCGAGG TGTCCGCGCT 75 0 

CCAGCCGACG CTTGCGGAGG 8 00 

CGACAACGAA GGGGTCGATC 850 

GATGATATCA CGGATCCCGC 900 

GACGACTGTG CTGGACCGCG 950 

TGAACCCGCT GGAGTGCGCG 10 00 

GTGGACCACT ACAACGTTGC 1050 

CAAGGAGCTG CAGGACATCA 1100 

AGGAAGACAA GGTTGTGGTG 1150 

TCGCAGCCGT TCCAGGTTGC 12 00 

CGTGCAGCTG GTCGACACGG 12 50 

CGTACGACCA GATCCCGGAG 13 0 0 

1301 



20 



2) INFORMATION FOR SEQ ID NO: 536 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 98 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania hertigi 

(B) STRAIN: ATCC 50125 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 536 

CTTCGCGGAG GGCGTGCCGC CGGTGCTGAC GTCGCTGGAT GTGACGGAGA 50 

ACCTCGGCCG CGATGAGCCG CTGACGCTGG AGATTGTGCA GCACTTGGAC 100 

GCGAACACCG GTCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

4 0 GAAGTCGAAG GTCGTGTCGA CCGGTGGCAA CATCTCTGTG CCTGTTGGCC 2 00 

GCGAGACGCT GGGTCGCATC TTCAACGTGC TTGGCGATGC GATTGACCAG 25 0 

CGCGGCCCTG TGGGTGAGAA GATGCGCATG GCGATCCACG CCGAGGCGCC 3 00 

GAAGCTGGCG GATCAGGCGG CAGAGGACAC GATCCTGACG ACCGGCATCA 3 50 

AGGTGATCGA TCTTATTCTG CCGTACTGCA AGGGTGGTAA GATCGGTCTG 4 00 

4 5 TTCGGTGGTG CCGGTGTAGG CAAGACTGTG ATTATTATGG AGCTGATCAA 450 

TAACGTGGCG AAGGGCCACG GTGGGTTTTC CGTGTTTGCT GGCGTGGGCG 500 

AGCGCACGCG CGAGGGCACT GACCTGTACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA ATCAAAGTGC GTGCTTGTGT ACGGACAGAT 60 0 

GAACGAGCCC CCGGGTGCGC GTGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

50 TGGCCGAGTA CTTCCGCGAT GTGGAGGGCC AGAACGTGCT GCTGTTCATT 70 0 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT 750 

GGGTCGCATT CCTGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC 800 

TGGGCATGCT GCAGGAGCGC ATTACGTCGA CGACGAAGGG CTCGATTACG 850 

TCTGTGCAGG CCGTGTACGT GCCTGCGGAT GATATCACGG ACCCGGCGCC 900 

55. CGCGACGACG TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGTGCGG 950 

TGGCAGAGTC GGGCATTTAC CCTGCGGTGA ACCCGCTGGA GTGCGCGTCG 1000 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CCGTGCTTGG TATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

60 CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 
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GGTGTTCACT GGCATGACGG GTCACTACGT TCAGCTGGAG GACAdGGTGG 125J 
AGTCGTTCTC TGGCCTACTG ATGGGGTCAT ACGACCAGAT CCCGGAGA 12 91 



2) INFORMATION FOR SEQ ID NO: 53 7 

(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1297 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

15 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 537 

CTTCTCGGAG GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG 50 

ACCTTGGCCG TGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC 10 0 

GCGAACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT 150 

2 5 GAAGTCGAAG GTTGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC 2 00 

GTGAGACGCT GGGCCGCATC TTCAACGTTC TGGGCGATGC GATCGACCAG 2 50 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC 3 00 

GAAGCTGGCG GATCAGGCCG CAGAGGACAC GATCCTGACG ACCGGCATCA 3 50 

AGGTGATCGA CTTGATCCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG 4 00 

3 0 TTCGGCGGTG CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA 4 50 

CAATGTCGCG AAGGGCCACG GTGGTTTCTC CGTGTTTGCC GGCGTTGGCG 500 

AGCGCACGCG CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGTCGAAG 550 

GTGATTGACC TGAAGGGCGA GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT 60 0 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

35 TGGCGGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 70 0 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CCGCGCTGCT 7 50 
GGGCCGCATT CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC 800 
TTGGTATGCT GCAGGAGCGC ATCACGTCGA CAACGAAGGG GTCGATCACG 850 
TCCGTGCAGG CCGTGTACGT GCCAGCGGAT GATATCACGG ATCCCGCGCC 900 

4 0 CGCGACGACG TTCTCGCACC TGGATGCGAC GACTGTGCTG GACCGCGCGG 950 

TGGCGGAGTC GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG 10 00 

CGTATCATGG ACCCCGATGT GATCGATGTG GACCACTACA ACGTTGCGCA 1050 

GGATATCGTG CAGATGCTGA CCAAGTACAA GGAGCTGCAG GACATCATTG 1100 

CGGTGCTTGG CATCGACGAG CTGAGCGAGG AAGACAAGGT TGTGGTGGAC 1150 

45 CGCGCGCGCA AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA 1200 

GGTGTTCACG GGCATGACGG GCCACTACGT GCAGCTGGTC GACACGGTGG 1250 

AGTCGTTCTC TGGCCTGCTG ATGGGGTCGT ACGACCAGAT CCCGGAG 12 97 



50 



60 



2) INFORMATION FOR SEQ ID NO: 538 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 97 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania amazonensis 

(B) STRAIN: ATCC 50131 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 538 

TTCTCGGAGG GCGTGCCGCC CGTGCTGACG GCGCTGGATG TGACGGAGGA 50 

CCTTGGCCGC GATGAGCCGC TGACGCTGGA GATCGTGCAG CACCTGGACG 100 

CGAACACCGG CCGCTGCATT GCGATGCAGA CGACGGACCT GTTGAAGCTG 150 

AAGTCGAAGG TTGTGTCGAC CGGCGGCAAC ATCTCTGTGC CGGTGGGCCG 2 00 

TGAGACGCTG GGCCGCATCT TCAACGTGCT GGGCGACGCG ATCGACCAGC 2 50 

GCGGCCCCGT GGGTGAGAAG ATGCGCATGG CGATCCACGC CGAGGCCCCG 3 00 

AAGCTGGCGG ATCAGGCCGC GGAGGACACG ATCCTGACGA CCGGCATCAA 3 50 

GGTGATCGAC CTGATTCTGC CCTACTGCAA GGGTGGCAAG ATCGGCCTGT 4 00 

TTGGTGGCGC CGGTGTGGGC AAGACCGTGA TCATCATGGA GTTGATTAAC 4 50 

AACGTCGCGA AGGGCCACGG TGGTTTCTCG GTGTTTGCCG GCGTTGGCGA 50 0 

GCGCACGCGC GAGGGCACGG ACCTGTACCT GGAGATGATG CAGTCGAAGG 550 

TGATTGACCT GAAGGGCGAG TCGAAGTGCG TGCTTGTGTA CGGGCAGATG 60 0 

AACGAGCCCC CGGGTGCGCG CGCGCGCGTT GCGCAGTCTG CGCTGACGAT 650 

GGCGGAGTAC TTCCGAGACG TGGAGGGCCA GAATGTGCTG CTGTTCATCG 70 0 

ACAACATCTT CCGCTTCACG CAGGCGAACT CCGAGGTGTC TGCGCTGCTG 7 50 

GGCCGCATTC CGGCCGCCGT GGGCTACCAG CCGACGCTTG CGGAGGATCT 80 0 

TGGTATGCTG CAGGAGCGCA TCACGTCGAC GACGAAGGGG TCGATCACGT 850 

CCGTGCAGGC CGTGTACGTG CCTGCGGATG ATAT CACGGA TCCGGCGCCC 90 0 

GCGACGACGT TCTCGCACCT GGACGCGACG ACTGTGCTGG ACCGCGCGGT 950 

GGCGGAGTCG GGGATCTACC CTGCCGTGAA CCCGCTGGAG TGCGCGTCGC 100 0 

GTATCATGGA CCCCGATGTG ATCGACGTGG ACCACTACAA CGTTGCGCAG 1050 

GATATCGTGC AGATGCTGAC CAAGTACAAG GAGCTGCAGG ATATCATTGC 1100 

GGTGCTTGGT ATCGACGAGC TGAGCGAGGA GGACAAGGTC GTGGTGGACC 1150 

GCGCGCGCAA GGTGACCCGG TTCCTGTCGC AGCCGTTCCA GGTTGCGGAG 12 00 

GTGTTCACGG GCATGACGGG CCACTACGTG CAGCTGGCCG ACACGGTGGA 12 50 

GTCGTTCTCT GGGCTGCTGA TGGGGTCGTA CGACCAGATC CCGGAGA 12 97 



2) INFORMATION FOR SEQ ID NO: 539 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 9 
CCITACATCC TBGTYGCICT IAACAAG 



2) INFORMATION FOR SEQ ID NO: 54 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 0 
GGDGCITCYT CRTCGWAITC CTG 
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2) INFORMATION FOR SEQ ID NO: 541 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541 
GTKGAAATGT TCCGCAAGCT GCT 



2) INFORMATION FOR SEQ ID NO: 542 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542 
CGGAARTAGA ACTGSGGACG GTAG 



2) INFORMATION FOR SEQ ID NO: 543 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543 
ATCTTAGTAG TTTCTGCTGC TGA 



2) INFORMATION FOR SEQ ID NO: 544 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE : Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544 
AYGTTGTCGC CMGGCATTMC CAT 23 

2) INFORMATION FOR SEQ ID NO: 545 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 5 
TACATCCTBG TYGCICTIAA CAAGTG 2 6 

25 

2) INFORMATION FOR SEQ ID NO: 54 6 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54 6 
40 CCRCGICCGG TRATGGTGAA GAT 23 



2 ) INFORMATION FOR SEQ ID NO: 547 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547 

55 

GTACAGTTGC TTCAGGACGT ATC 23 



60 2 ) INFORMATION FOR SEQ ID NO: 548 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 548 
ACGTTCGATT TCATCACGTT G 



2) INFORMATION FOR SEQ ID NO: 549 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549 
GAACGTGATA CTGACAAACC TTTA 



2) INFORMATION FOR SEQ ID NO: 550 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 550 
GAAGAAGAAC ACCAACGTTG 



2) INFORMATION FOR SEQ ID NO : 551 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551 
GAAGAAAAAA TCTTCGAACT GGCTA 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555 
5 TCTCGAACTT TCTCTATGTA TGCA 24 



2) INFORMATION FOR SEQ ID NO: 556 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 556 

20 

CGGCGCNATC YTSGTTGTTG C 21 



25 2) INFORMATION FOR SEQ ID NO: 557 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 557 

CCMAGGCATR ACCATCTCGG TG 22 



40 



50 



55 



2 ) INFORMATION FOR SEQ ID NO: 558 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558 
TCITTYAART AYGC I TGGGT 20 

2 ) INFORMATION FOR SEQ ID NO: 559 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 23 bases 

339 



SUBSTITUTE SHEET (RULE 26) 



10 



30 



60 



WO 01/23604 PCT/CA00/01150 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559 
CCGACRGCRA YIGTYTGICK CAT 23 



2) INFORMATION FOR SEQ ID NO: 560 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 560 
25 GAYTTCATYA ARAAYATGAT YAC 2 3 



2) INFORMATION FOR SEQ ID NO: 561 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 561 

40 

ACIGTICGGC CRCCCTCACG GAT 23 

45 2) INFORMATION FOR SEQ ID NO: 562 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 bases 

(B) TYPE: Nucleic acid 
50' (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562 

CARATGRAYG ARCCICCIGG IGYIMGIATG 3 0 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 563 
GGYTGRTAIC CIACIGCIGA IGGCAT 



2 ) INFORMATION FOR SEQ ID NO: 564 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564 
TAYGG I CARA TGAAYGARCC ICCIGGIAA 



2) INFORMATION FOR SEQ ID NO: 565 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 565 
GGYTGRTAIC CIACIGCIGA IGGDAT 



2) INFORMATION FOR SEQ ID NO: 566 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 566 
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TTYGGIGGIG CIGGIGTIGG IAARAC 
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'26 



5 2 ) INFORMATION FOR SEQ ID NO: 567 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 567 

TCRTCIGCIG GIACRTAIAY IGCYTG 26 



20 



30 



35 



2 ) INFORMATION FOR SEQ ID NO: 568 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 568 
RTIATIGGIG CIGTIRTIGA YGT 23 

2 ) INFORMATION FOR SEQ ID NO: 569 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 569 
RTIRTIGGIS CIGTIRTIGA TAT 23 

50 



2 ) INFORMATION FOR SEQ ID NO: 570 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 570 
RTIRYIGGIC CIGTIRTIGA YGT 



2) INFORMATION FOR SEQ ID NO: 571 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 571 
RTIRTIGGIC CIGTIRTIGA TGT 



2 ) INFORMATION FOR SEQ ID NO: 572 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 572 
RTIRTIGGIS CIGTIRTIGA 



2 ) INFORMATION FOR SEQ ID NO: 573 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 573 
CCICCIACCA TRTARAAI GC 



2 ) INFORMATION FOR SEQ ID NO: 574 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574 
ATIGCIATGG AYGGIACIGA RGG 



2) INFORMATION FOR SEQ ID NO: 575 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 5 
TIACCATTTC AGTACCTTCT GGTAA 



2) INFORMATION FOR SEQ ID NO: 576 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576 
AACTTCRTCA AGAAGGTYGG TTACAA 



2 ) INFORMATION FOR SEQ ID NO: 577 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 7 
CATGATTGAA CCATCCACCA 
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2 ) INFORMATION FOR SEQ ID NO : 578 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 578 
CATGATTGAA GCTTCCACCA 2 0 



2 ) INFORMATION FOR SEQ ID NO: 579 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57 9 
GAAGGCCGTG CTGGTGAGAA 20 

30 



2) INFORMATION FOR SEQ ID NO: 580 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 580 
4 5 GCTAAACCAG CTACAATCAC TCCAC 25 



50 



2 ) INFORMATION FOR SEQ ID NO: 581 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 581 

60 
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/22 



5 2 ) INFORMATION FOR SEQ ID NO: 582 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582 

TTTCAACTTC GTCGTTGACA CGAACAGT 28 



20 



30 



35 



60 



2) INFORMATION FOR SEQ ID NO : 583 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 35 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 583 
CAACTGCTTT TTGGATATCT TCTTTAATAC CAACG 35 

2 ) INFORMATION FOR SEQ ID NO: 584 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 584 
ACATGACACA TCTAAAACAA 2 0 

50 

2 ) INFORMATION FOR SEQ ID NO: 585 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 585 
ACCACATACT GAATTCAAAG 20 

2 ) INFORMATION FOR SEQ ID NO: 586 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586 

20 

CAGAAGTATA CGTATTATCA 2 0 

25 2 ) INFORMATION FOR SEQ ID NO: 587 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 587 

CGTATTATCA AAAGACGAAG 2 0 



2 ) INFORMATION FOR SEQ ID NO: 588 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 588 
TCTTCTCAAA CTATCGTCCA 2 0 

2 ) INFORMATION FOR SEQ ID NO: 58 9 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 2 0 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58 9 
GCACGAAACT TCTAAAACAA 2 0 



2) INFORMATION FOR SEQ ID NO: 590 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 590 
25 TATACGTATT ATCTAAAGAT 2 0 



2) INFORMATION FOR SEQ ID NO : 591 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591 

40 

TCCTGGTTCT ATTACACCAC 2 0 



45 2) INFORMATION FOR SEQ ID NO: 592 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 592 

CAAAGCTGAA GTATACGTAT 20 



60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 593 
TTCACTAACT ATCGCCCACA 20 

2) INFORMATION FOR SEQ ID NO: 594 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

2 5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594 
ATTGGTATCC ATGACACTTC 20 

30 

2 ) INFORMATION FOR SEQ ID NO: 595 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 595 
45 TTAAAGCAGA CGTATACGTT 2 0 



2 ) INFORMATION FOR SEQ ID NO: 596 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY : . Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 596 

60 
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120 



5 2 ) INFORMATION FOR SEQ ID NO: 597 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 597 

ATTGGTATCA AAGAAACTTC 20 



20 



30 



35 
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2 ) INFORMATION FOR SEQ ID NO : 598 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 598 
AATTACACCT CACACAAAAT 2 0 

2) INFORMATION FOR SEQ ID NO: 599 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 599 
CGGTGAAGAA ATCGAAATCA 20 

50 

2 ) INFORMATION FOR SEQ ID NO: 600 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 600 
5 ATGCAAGAAG AATCAAGCAA 2 0 



2) INFORMATION FOR SEQ ID NO: 601 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 601 

20 

GTTTCACGTG ATGATGTACA 2 0 



25 2) INFORMATION FOR SEQ ID NO: 602 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 602 

AAGTTGAAGT TGTTGGTATT 2 0 



40 

2) INFORMATION FOR SEQ ID NO: 603 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 603 
GGTATTAAAG ACGAAACATC 2 0 

55 

2) INFORMATION FOR SEQ ID NO: 604 

(i) SEQUENCE "CHARACTERISTICS : 
60 (A) LENGTH: 2 0 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 604 
GGTGATGAAG TAGAAATCGT 



2) INFORMATION FOR SEQ ID NO: 605 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605 
GAAATGTTCC GTAAATTATT 



2 ) INFORMATION FOR SEQ ID NO: 6 06 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 606 
ATTAGACTAC GCTGAAGCTG 



2) INFORMATION FOR SEQ ID NO : 607 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecal is 

(B) STRAIN: ATCC 29212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 607 
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CGGAGCTATC 
GTGAACATAT 
TTCTTAAACA 
AGAAATGGAA 
ATGTTCCAGT 
TCTTATGAAG 
CCCAACTCCA 
ACGTATTCTC 
CGTGGTGAAG 
CGAAACATCT 
TAGACTACGC 
GCACGTGAAG 
CACTCCACAC 
AAGGCGGACG 
TTCCGTACAA 
AATGGTAATG 
CAATCGCTAT 



TTAGTAGTTT 
CTTATTATCA 
AAATGGATAT 
GTTCGTGACT 
TATCGCAGGT 
AAAAAATCTT 
GAACGTGATA 
AATCACTGGA 
TTCGCGTTGG 
AAAACAACTG 
TGAAGCAGGC 
ATATCGAACG 
ACAAAATTCA 
TCACACTCCA 
CAGACGTTAC 
CCTGGTGATA 
CGAAGACGGA 



CTGCTGCTGA 
CGTAACGTTG 
GGTTGATGAC 
TATTATCAGA 
TCTGCTTTGA 
AGAATTAATG 
CTGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TTACAGGTGT 
GACAACTTCG 
TGGACAAGTA 
AAGCTGAAGT 
TTCTTCACTA 
TGGTGTTGTA 
ACGTTGCTAT 
A 



TGGTCCTATG 
GTGT AC CAT A 
GAAGAATTAT 
ATACGATTTC 
AAGCTTTAGA 
GCTGCAGTTG 
ATTCATGATG 
TTGCTACAGG 
GAAATCGTTG 
TGAAATGTTC 
GTGCTTTATT 
TTAGCTAAAC 
ATACGTATTA 
ACTACCGTCC 
GAATTGCCAG 
GGACGTTGAA 



PCT/CA00/01150 

CCTCAAACAC 56 

CATCGTTGTA 100 

TAGAATTAGT 150 

CCAGGCGATG 2 00 

AGGCGACGAG 2 50 

ACGAATATAT 3 00 

CCAGTCGAAG 3 50 

ACGTGTTGAA 4 00 

GTATTAAAGA 450 

CGTAAATTAT 50 0 

ACGTGGTGTA 550 

CAGCTACAAT 600 

TCAAAAGAAG 650 

TCAATTCTAC 70 0 

AAGGTACTGA 750 

TTAATTCACC 800 

821 



20 



2) INFORMATION FOR SEQ ID NO: 608 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: ATCC 19434 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 608 



CGGAGCTATC 
GTGAACACAT 
TTCTTGAACA 

4 0 TGAAATGGAA 
ATGTTCCTGT 
TCATACGAAG 
CCCAACTCCA 
ACGTGTTCTC 

45 CGTGGACAAG 
AGAAACTTCA 
TAGACYACGC 
GCACGTGAAG 
CACACCTCRT 

50 AAGGTGGACG 
TTCCGTACAA 
A 



TTGGTAGTTT 
CCTATTGTCT 
AAGTAGACAT 
GTTCGTGACC 
AGTTGCTGGA 
AAAAAATTCT 
GAACGTGACA 
AATTACTGGA 
TTCGCGTTGG 
AAAACAACAG 
TGAAGCTGGA 
ACATCCAACG 
ACAAAATTCT 
TCATACTCCA 
CTGACGTAAC 



CTGCTGCTGA 
CGTCAAGTTG 
GGTTGATGAC 
TATTAACAGA 
TCAGCTTTGA 
TGAATTAATG 
ACGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TTACTGGTGT 
GACRACATTG 
TGGACAAGTT 
CTGCAGAAGT 
TTCTTCACTA 
AGGTGTTGTT 



CGGCCCAATG 
GTGTTCCTTA 
GAAGAATTAC 
ATACRAATTC 
AAGCTCTAGA 
GCTGCAGTTG 
ATTCATGATG 
TTGCTACAGG 
GAAGTTGTTG 
TGAAATGTTC 
GTGCTTTACT 
TTAGCTAAAC 
ATACGTGTTG 
ACTACCGTCC 
GAATTACCAG 



CCTCAAACTC 50 

CATCGTTGTA 100 

TAGAATTAGT 150 

CCTGGTGRCG 2 00 

AGGCGACGCT 2 50 

ACGAATACAT 3 00 

CCAGTTGAAG 3 50 

TCGTGTTGAA 400 

GTATTGCTGA 4 50 

CGTAAATTGT 500 

ACGTGGTGTT 550 

CAGGTACAAT 600 

ACAAAAGAAG 650 

ACAATTCTAC 700 

AAGGAACTGA 750 

751 



55 



60 



2) INFORMATION FOR SEQ ID NO : 609 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: ATCC 4 9573 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 609 

CGGTGCGATC TTAGTAGTAT CTGCTGCTGA CGGTCCTATG CCTCAAACTC 50 

GTGAACACAT CTTGTTATCA CGTAACGTTG GCGWACCATA CATCGTTGTT 100 

TTCTTGAACA AAATGGATAT GGTTGAYGAC GAAGAATTGC TAGAATTAGT 150 

15 TGAAATGGAA GTTCGTGACC TATTGTCTGA ATATGACTTC CCAGGCGACG 2 00 

ATGTTCCTGT AATCGCCGGT TCTGCTTTGA AAGCTCTTGA AGGAGATCCT 250 

TCATACGAAG AAAAAATCAT GGAATTGATG GCTGCAGTTG ACGAATACGT 3 00 

TCCAACTCCA GAACGTGATA CTGACAAACC ATTCATGATG CCAGTCGAAG 350 

ACGTATTCTC AATCACTGGA CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

2 0 CGTGGACAAG TTCGCGTTGG TGATGAAGTA GAAATCGTTG GTATTGCTGA 450 

CGAAACTGCT AAAACAACTG TAACAGGTGT TGAAATGTTC CGTAAATTGT 500 

TAGACTATGC TGAAGCAGGG GATAACATTG GTGCATTGCT ACGTGGGGTT 550 

GCTCGTGAAG ACATCCAACG TGGACAAGTA TTGGCTAAAG CTGGTACRAT 600 

CACACCTCAT ACAAAATTCA AAGCTGAAGT TTATGTTTTG ACAAAAGAAG 650 

25 AAGGTGGRCG TCACACTCCA TTCTTCACTA ACTACCGTCC TCAGTTCTAC 700 

TTCCGTACAA CTGACGTAAC TGGTGTTGTT GAATTACCAG AAGGAACTGA 750 



30 



40 



45 



A 



2) INFORMATION FOR SEQ ID NO: 610 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 891 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus influenzae Rd 

(B) STRAIN: KW20 

(C) ACCESSION NUMBER: extracted from U32739 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610 



751 



AATATGATTA CTGGTGCGGC ACAAATGGAT GGTGCTATTT TAGTAGTAGC 50 

AGCAACAGAT GGTCCTATGC CACAAACTCG TGAACACATC TTATTAGGTC 10 0 

50 GCCAAGTAGG TGTTCCATAC ATCATCGTAT TCTTAAACAA ATGCGACATG 150 

GTAGATGACG AAGAGTTATT AGAATTAGTC GAAATGGAAG TTCGTGAACT 200 

TCTATCTCAA TATGACTTCC CAGGTGACGA TACACCAATC GTACGTGGTT 250 

• CAGCATTACA AGCGTTAAAC GGCGTAGCAG AATGGGAAGA AAAAATCCTT 300 

GAGTTAGCAA ACCACTTAGA TACTTACATC CCAGAACCAG AACGTGCGAT 350 

55 TGACCAACCG TTCCTTCTTC CAATCGAAGA TGTGTTCTCA ATCTCAGGTC 4 00 

GTGGTACTGT AGTAACAGGT CGTGTAGAAC GAGGTATTAT CCGTACAGGT 4 50 

GATGAAGTAG AAATCGTCGG TATCAAAGAT ACAGCGAAAA CTACTGTAAC 500 

GGGTGTTGAA ATGTTCCGTA AATTACTTGA CGAAGGTCGT GCAGGTGAAA 550 

ACATCGGTGC ATTATTACGT GGTACCAAAC GTGAAGAAAT CGAACGTGGT 60 0 

60 CAAGTATTAG CGAAACCAGG TTCAATCACA CCACACACTG ACTTCGAATC 650 
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AGAAGTGTAC GTATTATCAA AAGATGAAGG TGGTCGTCAT ACTCCA^TCT 70 0 

TCAAAGGTTA CCGTCCACAA TTCTATTTCC GTACAACAGA CGTGACTGGT 75 0- 

ACAATCGAAT TACCAGAAGG CGTGGAAATG GTAATGCCAG GCGATAACAT 8 00 

CAAGATGACA GTAAGCTTAA TCCACCCAAT TGCGATGGAT CAAGGTTTAC 850 

GTTTCGCAAT CCGTGAAGGT GGCCGTACAG TAGGTGCAGG C 8 91 



2) INFORMATION FOR SEQ ID NO: 611 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611 

25 CGGCGGTATC TTAGTTGTAT CTGCTGCTGA CGGTCCAATG CCACAAACTC 50 

GTGAACACAT CTTATTATCA CGTAACGTTG GTGTACCAGC ATTAGTTGTA 100 

TTCTTAAACA AAGTTGACAT GGTAGACGAC GAAGAATTAT TAGAATTAGT 150 

TGAAATGGAA GTTCGTGACT TATTAAGCGA ATATGACTTC CCAGGTGACG 2 00 

ATGTACCTGT AATCGCTGGT TCTGCATTAA AAGCATTAGA AGGCGATGCT 2 50 

30 GAATACGAAC AAAAAATCTT AGACTTAATG CAAGCAGTTG ATGATTACAT 3 00 

TCCAACTCCA GAACGTGATT CTGACAAACC ATTCATGATG CCAGTTGAGG 350 

ACGTATTCTC AATCACTGGT CGTGGTACTG TTGCTACAGG CCGTGTTGAA 4 00 

CGTGGTCAAA TCAAAGTTGG TGAAGAAGTT GAAATCATCG GTATGCACGA 4 50 

AACTTCTAAA ACAACTGTTA CTGGTGTAGA AATGTTCCGT AAATTATTAG 500 

35 ACTACGCTGA AGCTGGTGAC AACATCGGTG CTTTATTACG TGGTGTTGCA 550 

CGTGAAGACG TACAACGTGG TCAAGTATTA GCTGCTCCTG GTTCTATTAC 600 

ACCACACACA AAATTCAAAG CTGAAGTATA CGTATTATCT AAAGATGAAG 650 

GTGGACGTCA CACTCCATTC TTCACTAACT ATCGCCCACA ATTCTATTTC 7 00 

CGTACTACTG ACGTAACTGG TGTTGTAAAC TTACCAGAAG GTACAGAAAT 750 

40 GGTTATGCCT GGCGACAACG TTGAAATGAC AGTTGAATTA ATCGCTCCAA 8 00 

TCGCTATCGA AGACGGAA 818 



45 2) INFORMATION FOR SEQ ID NO: 612 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Parathyphi A 

(B) STRAIN: ATCC 9150 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612 
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25 



30 
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40 



45 



50 



55 



60 



GGCGCGATCC 
TGAGCACATC 
TCCTGAACAA 
GAAATGGAAG 
CACGCCGATC 
AGTGGGAAGC 
CCGGAACCAG 
CGTATTCTCC 
GCGGTATCAT 
ACTCAGAAGT 
CGAAGGCCGT 
GTGAAGAAAT 
CCGCACACCA 
CGGCCGTCAT 
GTACTACTGA 
GTAATGCCGG 
CGCAATGGAC 



TGGTTGTTGC 
CTGCTGGGTC 
ATGCGACATG 
TTCGTGAACT 
GTTCGTGGTT 
GAAAATCATC 
AGCGTGCGAT 
ATCTCCGGTC 
CAAAGTGGGC 
CTACCTGTAC 
GCCGGTGAGA 
CGAACGTGGT 
AGTTCGAATC 
ACTCCGTTCT 
CGTGACTGGC 
GCGACAACAT 
GACGGTCTGC 



TGCGACTGAC 
GTCAGGTAGG 
GTTGATGACG 
TCTGTCTCAG 
CTGCTCTGAA 
GAACTGGCTG 
TGACAAGCCG 
GTGGTACCGT 
GAAGAAGTTG 
TGGCGTTGAA 
ACGTAGGTGT 
CAGGTACTGG 
TGAAGTGTAC 
TCAAAGGCTA 
ACCATCGAAC 
CAAAATGGTT 
GTTTC 



GGCCCGATGC 
CGTTCCGTAC 
AAGAGCTGCT 
TACGACTTCC 
AGCGCTGGAA 
GCTTCCTGGA 
TTCCTGCTGC 
TGTTACCGGT 
AAATCGTTGG 
ATGTTCCGCA 
TCTGCTGCGT 
CTAAGCCGGG 
ATTCTGTCCA 
CCGTCCGCAG 
TGCCGGAAGG 
GTTACCCTGA 



CGCAbACCCG 
ATCATCGTGT 
GGAACTGGTT 
CGGGCGACGA 
GGCGACGCAG 
TTCTTACATC 
CGATCGAAGA 
CGTGTAGAAC 
TATCAAAGAG 
AACTGCTGGA 
GGTATCAAAC 
CACCATCAAG 
AAGATGAAGG 
TTCTACTTCC 
CGTAGAGATG 
TCCACCCGAT 



2) INFORMATION FOR SEQ ID NO: 613 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia fx carl a 

(B) STRAIN: ATCC 33105 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613 



GGCGCTATCC 
TGAGCACATC 
TCATGAACAA 
GAAATGGAAG 
CCTGCCGGTG 
AGTGGGAAGC 
CCAGAACCAG 
CGTATTCTCC 
GCGGTATCAT 
ACCGTCAAGT 
CGAAGGCCGT 
GTGAAGACAT 
CCGCACACCC 
TGGTCGTCAC 
GTACTACTGA 
GTAATGCCTG 



TGGTTGTTGC 
CTGCTGGGTC 
ATGCGACATG 
TTCGCGAACT 
ATTCGCGGTT 
TAAAATCATC 
AGCGCGCTAT 
ATCTCCGGTC 
CAAAGTTGGC 
CTACCTGTAC 
GCCGGTGAGA 
CGAACGTGGT 
AGTTCGATTC 
ACKCCATTCT 
CGTGACCGGT 
GCGACAACGT 



TGCGACTGAC 
GYCAGGTTGG 
GTTGATGATG 
GCTGTCCGCT 
CCGCGCTGAA 
GAGCTGGCTG 
CGACAAGCCG 
GTGGTACCGT 
GAAGAAGTTG 
CGGCGTTGAA 
ACGTAGGTGT 
CAGGTTCTGG 
AGAAGTGTAC 
TCAAAGGCTA 
ACCATCGAAC 
GAACATGA 



GGCCCAATGC 
CGTTCCTTTC 
AAGAGCTGCT 
TACGACTTCC 
AGCGCTGGAA 
AMCMSCTGGA 
TTCCTGCTGC 
GGTTACCGGT 
AAATCGTTGG 
ATGTTCCGCA 
TCTGCTGCGT 
CTAAACCAGG 
ATCCTGAGCA 
CCGTCCACAG 
TGCCAGAAGG 



CTCAGACCCG 
ATCATCGTRT 
GGAACTGGTA 
CTGGCGATGA 
GGCGAAGCCG 
TACTTACATC 
CAATCGAAGA 
CGTGTTGAGC 
TATCAAAGAC 
AACTGCTGGA 
GGTATCAAGC 
TTCCATCAAG 
AAGAAGAAGG 
TTCTACTTCC 
CGTTGAGATG 



2) INFORMATION FOR SEQ ID NO: 614 



iuJo 

150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
825 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
778 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 653 bases 



356 



SUBSTITUTE SHEET (RULE 26) 



10 



WO 01/23604 PCT/CA00/01 150 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus malodoratus 

(B) STRAIN: ATCC 43197 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 614 



GTGCGATCTT AGTAGTATCA GCTACTGATG GTCCAATGCC TCAAACTCGT 50 

GAACACATTT TGTTATCACG TCAAGTTGGT GTTAAGCACT TGATCGTTTT 100 

15 CTTGAACAAA GTAGATTTAG TTGATGACGA AGAATTGATC GACTTAGTTG 150 

AAATGGAAGT ACGTGAATTA CTTTCTGAAT ATGGTTTCCC AGGTGATGAT 2 00 

ATTCCAGTGC TTAAAGGTTC TGCTTTGAAA GCATTAGAAG GCGATCCAGA 2 50 

ACAAGAACAA GTTATTCTTG ATTTGATGGA TACCGTTGAT GAATATATCC 3 00 

CAACACCTGA ACGTGACAAT GACAAACCGT TCTTGTTACC AGTTGAGGAT 350 

2 0 GTTTTCTCGA TCACAGGACG TGGTACTGTA GCTTCTGGTC GTATCGACCG 4 00 

TGGCGAAGTT AAAGTCGGCG ATGAAATTGA AATCATCGGG ATCAAACCTG 4 50 

AAGTTCAAAA AGCAATCGTT ACTGGACTTG AAATGTTCCG TAAAACATTG 50 0 

GATTATGGTG AAGCTGGCGA TAACGTTGGG GTTCTATTAC GTGGGATTAC 550 

ACGTGATGAA ATCGAACGTG GCCAAGTATT AGCTAAACCA GGTTCAATCA 600 

2 5 CACCACATAC TAAGTTCAAA GCCGAAGTAT ATGTGTTGAC GAAAGAAGAA 650 

GGT 653 



30 2 ) INFORMATION FOR SEQ ID NO: 615 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 9 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus durans 

(B) STRAIN: ATCC 19432 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615 



CCATTCTAGT TGTATCTGCA ACAGATGGAC CAATGCCACA AACACGTGAA 50 

CATATTTTAT TGTCACGTCA AGTAGGTGTT AAATATTTGA TCGTCTTCTT 100 

GAACAAAATC GACTTAGTAG ATGATGAAGA ATTGATTGAT CTTGTCGAAA 150 

TGGAAGTTCG TGAATTATTA AGCGAATATG GTTTCCCAGG TGACGATACA 200 

50 CCAGTCATCA AAGGTTCAGC ATTAAAAGCT TTACAAGGAG ATCCTGATGC 2 50 

AGAAGCAGCT ATCATGGAAT TGATGGATAC TGTTGATGAA TATATCCCAA 3 00 

CACCAGAACG TGATACAGAC AAACCATTAT TGTTACCAGT GGAAGATGTC 3 50 

TTCTCAATCA CAGGTCGTGG GACTGTTGCT TCAGGTCGTA TCGATCGTGG 4 00 

TGCAGTTCGT GTAGGTGATG AAATCGAAAT CGTCGGTATC AAACCTGAAA 4 50 

55 CACAAAAAGC TGTTGTAACT GGGGTCGAAA TGTTCCGCAA GACATTAGAC 500 

TATGGTGAAG CAGGAGATAA CGTTGGGGTA TTGTTACGTG GTATCCAACG 550 

TGAAGATATC GAACGTGGAC AAGTAATCGC AAAACCAGGT TCAATCACAC 600 

CACATACAAA ATTCAAAGCA GAAGTGTACG TATTGACAAA AGAAGAAGGT 650 

GGACGTCATA CACCATTCTT CAATAACTAT CGTCCACAAT TCTACTTCCG 700 

60 TACAACTGAC GTAACTGGAA CAATCGTTTT ACCTGGAGGC ACTGAAATGG 750 

357 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

TTATGCCTGG AGATAACGTA ACGATCGACG TTGAATTGAT CCATCX^AGTT 8QO 
GCCATCGAAA ACGGAACAAC TTTCTCTAT 82 9 



2) INFORMATION FOR SEQ ID NO: 616 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 669 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus pseudoavium 

(B) STRAIN: ATCC 49372 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616 

GGTGCAATTT TAGTAGTATC TGCTACTGAT GGCCCAATGC CACAAACACG 50 

TGAACATATC TTGTTATCAC GTCAAGTAGG GGTTAAACAC TTAATCGTCT 100 

TCTTGAACAA AGTTGATTTA GTTGATGATG AAGAATTGAT CGATTTAGTT 150 

2 5 GAAATGGAAG TTCGGGAATT GCTTTCTGAA TATGGTTTCC CAGGCGATGA 2 00 

TATTCCAGTA CTTAAAGGTT CTGCTTTGAA AGCTTTAGAA GGCGATCCTG 2 50 

AACAAGAACA AGTAATCCTT GACTTGATGG ATACGGTTGA TGAATACATC 3 00 

CCAACGCCTG AACGTGATAC TGACAAACCA TTCTTGTTAC CAGTCGAAGA 350 

TGTCTTCTCA ATCACAGGAC GTGGTACGGT TGCATCTGGT CGTATCGATC 4 00 

3 0 GTGGGGAAGT TAAAGTCGGT GATGAAGTTG AAATCATCGG GATCAAACCT 450 

GAAGTGCAAA AAGCTGTCGT AACTGGACTA GAAATGTTCC GTAAGACATT 500 

GGATTACGGT GAAGCTGGCG ATAACGTTGG GGTTCTATTA CGTGGGATTA 550 

CTCGTGATGA AATCGAACGT GGACAAGTAT TAGCTAAACC AGGTTCAATC 60 0 

ACTCCACATA CGAAATTCAG TGCAGAAGTT TATGTATTGA CGAAAGAAGA 650 

35 AGGTGGCCGT CATACGCCA 66 9 



40 



2) INFORMATION FOR SEQ ID NO: 617 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Enterococcus dispar 

(B) STRAIN: ATCC 51266 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617 

55 CGGGGCAATT TTAGTTGTAT CTGCAACTGA TGGCCCAATG CCACAAACAC 50 

GTGAACACAT TTTGTTAGCT CGTCAAGTAG GGGTTAAATA TTTAATCGTC 100 

TTCTTGAACA AAACAGATTT AGTTGATGAT GAAGAATTAT TGGAACTAGT 150 

TGAAATGGAA GTTCGTGAAT TATTAAATGA ATACAATTTC CCTGGCGATG 200 

ATATTCCTGT TATTCGCGGA TCTGCTTTAA AAGCATTAGA AGGCGATCCA 250 

60 GAACAAGAAG AAGTAATTAT GAACTTGATG GAT AC TGTGG ATGAATATAT 3 00 

358 
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CCCAACTCCA GAACGTGACA ATGATAAACC 
ATGTCTTCAC AATTACTGGT CGTGGTACTG 
CGTGGTAAAG TCAACGTTGG TGATGAAATT 
AGAAACACAA AAAGCTGTTG TAACCGGTTT 
5 TGGATTATGG TGAAGCTGGT GATAACGTTG 
ACTCGTGATG AAGTAGAACG TGGTCAAGTA 
TACACCGCAT ACCAAATTTA AAGGTGAAGT 
AAGGTGGACG TCATACTCCT TTCTTTAATA 
TTCCGTACAA CTGATGTGAC TGGTAACATC 
10 AATGGTAATG CCTGGTGATA ATGTAACAAT 
CAATCGCCGT TGAAAAAGGG ACTACTTTCT 



PCT/CA00/01150 

ATTCTTGTTA CCAC^QGAAG 3 5Qf 

TTGCTTCAGG TCGTATCGAC 400 

GAAATTATCG GAATTAAACC 4 50 

GGAAATGTTC CGTAAAACTT 500 

GGGTCTTATT ACGTGGGATT 550 

TTAGCAAAAC C AGGTTC CAT 600 

TTATATCTTA ACAAAAGAAG 650 

ACTATCGTCC TCAATTTTAT 700 

GCATTACCTG AAGGAACTGA 750 

TGAAGTTGAA TTGATTCATC 800 

GAATT 835 



15 2) INFORMATION FOR SEQ ID NO: 618 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 

(B) STRAIN: ATCC 14 02 5 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 618 



GGTGCAATCC TAGTAGTATC AGCTACTGAT GGTCCAATGC CGCAAACACG 50 

TGAACATATT TTGCTATCAC GGCAAGTGGG TGTTAAACAC TTAATCGTAT 10 0 

TTTTAAACAA AGTTGATTTA GTCGATGATG AAGAATTGAT CGATCTAGTT 150 

GAAATGGAAG TCCGTGAATT ACTTTCTGAA TATGGTTTCC CAGGTGACGA 20 0 

3 5 TATTCCAGTT CTCAAAGGTT CAGCTTTGAA AGCATTAGAA GGCGATCCTG 2 50 

AACAAGAACA AGTAATCCTT GATTTAATGG ATACAGTTGA CGAATATATC 300 

CCAACTCCAG AACGTGACAC TGACAAGCCA TTCTTGTTAC CAGTCGAAGA 3 50 

TGTATTTTCT ATCACTGGTC GTGGGACTGT AGCGTCTGGA CGGATTGATC 400 

GTGGTGAAGT TAAAGTCGGC GATGAAGTTG AAATCATCGG GATCAAACCT 450 

40 GAAATTCAAA AAGCAGTCGT AACTGGACTT GAAATGTTCC GTAAAACTTT 500 

AGATTATGGT GAAGCTGGCG ATAACGTTGG GGTTCTATTA CGTGGGATTA 550 

CACGTGATGA AATCGAACGT GGTCAAGTCT TAGCTAAACC AGGTTCAATC 600 

ACACCACATA CAAAATTCAG TGCAGAAGTT TACGTATTGA CGAAAGAAGA 650 

AGGTGGACGT CATACACCAT CTT 673 

45 



2) INFORMATION FOR SEQ ID NO: 619 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1713 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sac char omyces cerevisiae 
60 (C) ACCESSION NUMBER: K0042 8 
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PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 619 

TTTCATATTT TTAAGGATTT TGTTTTAGCA CCCATCCGAC CTCAGTCAAT 50 

5 ATATCCTTTC GCGACCAGGC TTTCCTCCCT TTTGCTGCTA ACTGGTTACA 100 

GATTTTCCTA TTTTTGGTCA TTTTTATCTT TGAAACTGAT TAAGCTGAAA 150 

AAATTTGAGC TTCTTTGTTG TAAACTATTT TGTGCTTTCA GTTTTATTCT 2 00 

AGCTCGACAA AGGTAACAGA CAAAAATGTC AGCTTTATTA CCAAGATTAC 2 50 

TCACAAGAAC AGCTTTTAAA GCTTCTGGGA AACTTCTGAG GCTCTCTTCA 3 00 

10 GTAATTTCTA GGACCTTTTC TCAAACTACT ACTTCCTATG CAGCTGCTTT 3 50 

TGATCGTTCC AAACCGCATG TAAATATAGG TACGATCGGC CATGTTGATC 400 

ATGGGAAGAC AACTTTAACC GCAGCCATTA CGAAAACGTT AGCCGCAAAA 450 

GGTGGTGCCA ACTTCTTGGA CTATGCTGCC ATCGATAAGG CTCCGGAAGA 500 

AAGAGCTCGT GGTATTACAA TTTCTACTGC ACACGTGGAA TACGAAACGG 550 

15 CCAAGAGACA TTATTCTCAC GTCGACTGTC CAGGCCACGC TGATTACATC 600 

AAGAATATGA TTACCGGTGC TGCTCAAATG GATGGTGCTA TCATTGTTGT 650 

AGCTGCTACC GATGGACAAA TGCCCCAAAC TAGAGAACAT TTACTTTTGG 700 

CCAGACAAGT TGGTGTCCAA CATATTGTCG TTTTTGTTAA CAAGGTTGAT 750 

ACCATTGATG ATCCAGAAAT GTTAGAGTTA GTCGAAATGG AAATGAGAGA 800 

2 0 ACTTTTAAAC GAATATGGGT TTGACGGTGA TAATGCTCCA ATTATCATGG 850 

GTTCTGCCCT TTGCGCTTTG GAAGGTCGCC AACCTGAAAT TGGGGAGCAG 900 

GCCATCATGA AACTTTTGGA TGCAGTGGAT GAGTATATTC CTACACCTGA 950 

AAGAGATTTG AACAAGCCTT TCTTGATGCC CGTTGAAGAT ATCTTCTCTA 1000 

TCTCCGGTAG AGGTACTGTG GTCACTGGTC GTGTGGAAAG GGGTAATTTA 1050 

25 AAGAAAGGTG AGGAATTGGA AATTGTTGGT CACAACTCCA CCCCATTGAA 1100 

AACAACAGTT ACTGGTATTG AAATGTTTAG AAAGGAATTG GACTCTGCTA 1150 

TGGCAGGTGA CAATGCCGGT GTTTTACTTA GAGGTATCAG GAGAGATCAA 12 00 

TTGAAGAGAG GTATGGTCTT AGCTAAGCCA GGTACCGTTA AAGCCCATAC 12 50 

AAAGATTCTA GCCTCTTTGT ACATTTTATC CAAAGAGGAA GGTGGTAGAC 13 0 0 

30 ATTCTGGGTT TGGTGAAAAC TACAGACCAC AAATGTTTAT AAGAACAGCT 13 50 

GATGTTACAG TTGTGATGAG ATTTCCTAAG GAGGTTGAAG ATCATTCTAT 14 00 

GCAAGTTATG CCAGGTGACA ATGTTGAAAT GGAATGTGAT TTGATCCATC 14 50 

CTACCCCATT AGAAGTTGGT CAACGTTTCA ATATCAGAGA GGGTGGAAGA 1500 

ACTGTTGGTA CCGGTCTAAT CACACGTATT ATTGAATAGA CTTATTGATG 1550 

35 CAACTGGAGT ATATTTCTAT ATATTCTGTT CATTTCCCCT CTCATAATAT 1600 

ATACTTGTTT CGTTAAAATT TTATACGTGT AAATAAAGTG CCATAAATTT 1650 

TTCAGCTTTA CTTTTGGTAG AGTCCTGCTA GCACTAGATT TTACAATTTC 1700 

ATGTGCACAC ACC 1713 

40 

2) INFORMATION FOR SEQ ID NO: 620 

( i ) SEQUENCE CHARACTER I STICS : 
45 (A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 620 
ATTGGTGCAT TGCTACGT 18 

55 



2) INFORMATION FOR SEQ ID NO: 621 

60 (i) SEQUENCE CHARACTERISTICS: 

360 
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(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
10 (B) STRAIN: ATCC 19434 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 621 

TGGTGCAATC TTAGTTGTTT CTGCAACTGA CGGTCCGATG CCTCAAACAC 50 

15 GTGAACACAT TTTATTGTCA CGCCAAGTTG GTGTAAAATA CCTGATTGTT 100 

TTCTTGAACA AAGTTGATTT AGTCGATGAT GAAGAATTGA TCGATTTGGT 150 

AGAAATGGAA GTTCGCGAGT TATTGAGCGA ATATGGTTTC CCAGGCGATG 2 00 

ACACTCCTGT GAT CAAAGGT TCCGCATTAA AAGCATTGCA AGGCGATCCA 2 50 

GATGCTGAAG CTGCTATTAT GGAATTGATG GATACAGTAG ATGAATATAT 3 00 

2 0 CCCAACACCA GAACGTGATA CAGATAAACC ATTACTATTG CCAGTGGAAG 3 50 

ACGTCTTCTC AATTACAGGT CGAGGAACTG TTGCCTCAGG TCGTATTGAT 4 00 

CGTGGTGCTG TTCGTGTCGG TGATGAGGTA GAGATCGTAG GGATCAAACC 4 50 

TGAAACACAA AAAGCAGTTG TAACAGGTGT AGAAATGTTC CGTAAAACGT 50 0 

TAGATTACGG GGAAGCTGGG GATAACGTAG GCGTGTTGTT ACGGGGGATC 550 

25 CAACGTGACG ATATCGAACG TGGACAAGTA CTTGCTAAAC CAGGTTCCAT 60 0 

TACTCCACAT ACAAAATTCA AAGCAGAAGT GTACGTGTTG ACAAAAGAAG 650 

AAGGTGGACG TCATACTCCA TTCTTCAACA ACTATCGTCC ACAGTTCTAC 7 00 

TTCCGCACAA CTGATGTTAC AGGAACAATC ACATTGCCAG AAGATACAGA 750 

A 751 

30 



2) INFORMATION FOR SEQ ID NO: 622. 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
45 (B) STRAIN: ATCC 13264 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 622 

GTCAAATGGG ACGAATCCAG ATTCCAAGAA ATTGTCAAGG AAACCTCCAA 50 

50 CTTTATCAAG AAGGTTGGTT ACAACCCAAA GACTGTTCCA TTCGTCCCAA 100 

TCTCTGGTTG GAACGGTGAC AACATGATTG AAGCTACCAC CAACGCTCCA 150 

TGGTACAAGG GTTGGGAAAA GGAAACCAAG GCCGGTGTCG TCAAGGGTAA 200 

GACTTTGTTG GAAGCCATTG ACGCCATTGA ACAACCATCT AGACCAACTG 250 

ACAAGCCATT GAGATTGCCA TTGCAAGATG TTTACAAGAT TGGTGGTATT 3 00 

55 GGTACTGTGC CAGTCGGTAG AGTTGAAACC GGTGTCATCA AGCCAGGTAT 350 

GGTTGTTACT TTCGCCCCAG CTGGTGTTAC CACTGAAGTC AAGTCCGTTG 400 

AAATGCATCA CGAACAATTG GAACAAGGTG TTCCAGGTGA CAACGTTGGT 450 

TTCAACGTCA AGAACGTTTC CGTTAAGGAA ATCAGAAGAG GTAACGTCTG 500 

TGGTGACGCT AAGAACGATC CACCAAAGGG TTGCGCTTCT TTCAACGCTA 550 

60 CCGTCATTGT TTTGAACCAT CCAGGTCAAA TCTCTGCTGG TTACTCTCCA 600 
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GTTTTGGATT GTCACACTGC TCACATTGCT 
GGAAAAGAAC GACAGAAGAT CTGGTAAGAA 
TCTTGAAGTC CGGTGACGCT GCTTTGGTCA 



TGTAGATTCG ACGAATTGTT 
GTTGGAAGAC CATCCAAAGT 
AGTTCGTTCC ATCTAAGCCA 



65/) 
700 
750 



2) INFORMATION FOR SEQ ID NO: 623 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 69 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

( i i ) MOLECULE . TYPE : Genomi c DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neofoinnans 

(B) STRAIN: ATCC 44104 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 623 

TCTTGAAAGC TTAAGGCCGA GCGAGAGCGA GGTATCACCA TCGACATTGC 50 

TCTTTGGAAG TTCGAGACCC CCAGGTACCA GGTCACCGTC ATTGACGCCC 100 

CCGGTCACCG AGACTTCATC AAGAACATGA TCACCGGTAC CTCCCAGGCT 150 

GACTGTGCCA TCCTCATCAT TGCCACCGGT ATCGGTGAGT TCGAGGCCGG 2 00 

TATCTCCAAG GACGGTCAGA C CCGAGAGC A CGCCCTCCTC GCCTTCACCC 2 50 

TCGGTGTCAG GCAGCTCATT GTTGCTTGCA ACAAGATGGA CACCTGCAAG 3 00 

TGGTCCGAGG ACCGATTCAA CGAAATCGTC AAGGAGACCA ACGGTTTCAT 3 50 

CAAGAAGGTT GGTTACAACC CCAAGGCTGT CCCCTTCGTC CCCATCTCTG 4 00 

GTTGGCACGG TGACAACATG TTGGAGGAGA CCACCAAGTC AGTGAATCCG 450 

CTTCTACGTG ATGAGATGTT TTTCTGACTT TCCGTGCAGC ATGCCCTGGT 500 

ACAAGGGATG GACCAAGGAG ACCAAGTCTG GTGTTTCCAA GGGTAAGACC 550 

CTCCTCGAGG CCATCGACGC CATCGAGCCC CCTACCCGAC CCACCGACAA 600 

GCCCCTCCGT CTCCCTCTCC AGGACGTCTA CAAGATCGGT GGTATCGGCA 650 

CAGTCCCTGT CGGCCGAGTC GAGACCGGTG TCATCAAGGC CGGTATGTTG 70 0 

TCTCATCTCT CTTGTCTCGT AACATGCGTC TCGTAACATG CGCTTACTTC 750 

ATTTTCAGGT ATGGTCGTCA AGTTCGCCCC CACCAACGTC ACCACTGAAG 800 

TCAAGTCCGT TGAGATGCAC CACGAGCAGA TCCCCGAGGG TCTTCCCGGA 850 

GACAACGTTG GTTTCAACGT CAAGAACGTT TCCATCAAGG ACATCCGACG 900 

AGGTAACGTC TGTGGTGACT CCAAGAACGA CCCCCCTATG GAGGCTGCTT 950 

CTTTCAACGC CCAGGTTATC GTCCTTAACC ACCCTGGTCA GATCGGTGCC 10 00 

GGTTACACCC CCGTTCTCGA CTGTCACACT GCCCGTAAGC CTGACCCAAT 1050 

ACCTCCAACA TACCTTTGAA GCTGACCCTT TCTAGACATT GCCTGCAAGT 1100 

TTGCTGAGTT GATCGAGAAG ATTGACCGAC GAACCGGTAA GGTCATGGAG 1150 

GCCGCCCCCA AGTTCGTCAA GTCTGGTGAC GCCGCCATTG TCAAGCTTGT 12 00 

TGCCCAGAAG CCCCTCTGTG TTGAGACCTA CGCCGACTAC CCCCCTCTTG 1250 

GTCGATTCGC CGTCCGAGA 1269 



2) INFORMATION FOR SEQ ID NO: 624 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 753 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 36801 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 624 

TCTGTCAAAT GGGACAAAAA CAGATTTGAA GAAATCATCA AGGAAACCTC 50 

CAACTTCGTC AAGAAGGTTG GTTACAACCC AAAGACTGTT CCATTCGTTC 100 

10 CAATCTCTGG TTGGAATGGT GACAACATGA TTGAACCATC CACCAACTGT 150 

CCATGGTACA AGGGTTGGGA AAAGGAAACC AAATCCGGTA AAGTTACTGG 2 00 

TAAGACCTTG TTAGAAGCTA TTGACGCTAT TGAACCACCA ACCAGACCAA 250 

CCGACAAACC ATTGAGATTG CCATTGCAAG ATGTTTACAA GATCGGTGGT 3 00 

ATTGGTACTG TGCCAGTCGG TAGAGTTGAA ACTGGTATCA TCAAAGCCGG 350 

15 TATGGTTGTT ACTTTCGCCC CAGCTGGTGT TACCACTGAA GTCAAATCCG 400 

TTGAAATGCA TCACGAACAA TTGGCTGAAG GTGTTCCAGG TGACAATGTT 450 

GGTTTCAACG TTAAGAACGT TTCCGTTAAA GAAATTAGAA GAGGTAACGT 500 

TTGTGGTGAC TCCAAGAACG ATCCACCAAA GGGTTGTGAC TCTTTCAATG 550 

CCCAAGTCAT TGTTTTGAAC CATCCAGGTC AAATCTCTGC TGGTTACTCT 600 

20 CCAGTCTTGG ATTGTCACCC TGCCCACATT GCTTGTAAAT TCGACACTTT 650 

GGTTGAAAAG ATTGACAGAA GAACTGGTAA GAAATTGGAA GAAAATCCAA 700 

AATTCGTCAA ATCCGGTGAT GCTGCTATCG TCAAGATGGT CCCAACCAAA 750 

CCA 753 



25 



2 ) INFORMATION FOR SEQ ID NO: 625 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 625 
CGTTGAAGAC ACGACCCAAA GTATCC 26 

40 

2 ) INFORMATION FOR SEQ ID NO: 626 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 626 
55 TACCACCTTT TAAGTAAGGT GCTAAT 26 



2 ) INFORMATION FOR SEQ ID NO: 62 7 

60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627 
ATTGTCTATA AAAATGGCGA TAAGTC 



2) INFORMATION FOR SEQ ID NO: 628 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 8 
AAAATGGCGA TAAGTCACAA AAAGTA 



2) INFORMATION FOR SEQ ID NO: 629 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 9 
AAGTTCCATC TCAACAAGGT CAATA 



2) INFORMATION FOR SEQ ID NO: 63 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 0 
CGGAGCTATC CTAGTCGTTT CA 
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2) INFORMATION FOR SEQ ID NO: 631 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631 
15 CAGACCAACY GAIAARCCAT TRAGAT 2 6 



2) INFORMATION FOR SEQ ID NO: 632 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

2 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 632 

30 

CCCTTTGGTG GRTCSTKCTT GGA 23 

3 5 2) INFORMATION FOR SEQ ID NO: 633 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633 

CAGACCAACY GAIAARCCIT TRAGAT 26 



2) INFORMATION FOR SEQ ID NO: 634 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 6 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 634 

AACACYGTCA GRRCIATTGC YATGGA 



2) INFORMATION FOR SEQ ID NO: 635 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 5 
AAACCRGTIA RRGCRACTCT IGCTCT 



2) INFORMATION FOR SEQ ID NO: 63 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 6 
ACTGGYGTTG AIATGTTCCG YAA 



2) INFORMATION FOR SEQ ID NO: 63 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 637 
ACGTCAGT I G TACGGAARTA GAA 



2) INFORMATION FOR SEQ ID NO: 638 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 638 
CCAATGCCAC AAACICGTGA RCACAT 26 

2) INFORMATION FOR SEQ ID NO: 63 9 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 639 
TTTACGGAAC ATTTCWACAC CWGTIACA 28 

2) INFORMATION FOR SEQ ID NO: 64 0 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 0 
TCCATGGTIT WYGGI CARAT GAA 23 

40 

2) INFORMATION FOR SEQ ID NO: 641 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641 
55 TGATAACCWA CIGCIGAIGG CATACG 26 



60 



2 ) INFORMATION FOR SEQ ID NO: 642 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 642 
GGCGTIGGIG ARCGIACICG TGA 



2) INFORMATION FOR SEQ ID NO: 643 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 643 
ACTGGIGTIG ARATGTTCCG YAA 



2) INFORMATION FOR SEQ ID NO: 644 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644 
ACGTCIGTIG TICKGAARTA GAA 



2) INFORMATION FOR SEQ ID NO: 64 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 5 
ACGTCIGTIG TICKGAARTA RAA 
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2 ) INFORMATION FOR SEQ ID NO: 64 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 646 
15 ATCGACAAGC CITTCYTIAT GSC 23 



2) INFORMATION FOR SEQ ID NO: 64 7 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 647 

30 

ACGTCCGTSG TRCGGAAGTA GAACTG 2 6 



35 2 ) INFORMATION FOR SEQ ID NO: 648 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 648 

ACGTCSGTSG TRCGGAAGTA GAACTG 



50 



60 



2 ) INFORMATION FOR SEQ ID NO: 64 9 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 6 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64 9 

GTCCTATGCC TCARACWCGI GAGCAC 2 6 



2 ) INFORMATION FOR SEQ ID NO: 650 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 650 
TTACGGAACA TYTCAACACC IGT 23 

20 

2 ) INFORMATION FOR SEQ ID NO: 651 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 651 
35 TGACGACCAC CITCYTCYTT YTTCA 25 



40 



2 ) INFORMATION FOR SEQ ID NO: 652 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 652 

50 

CCWAYAGTIY KICCICCYTC YCTIATA 27 



55 2 ) INFORMATION FOR SEQ ID NO: 653 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 653 
GAYTTCAT I A ARAAYATGAT 2 0 

2 ) INFORMATION FOR SEQ ID NO: 654 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 654 
TACAARATYK GIGGTATYGG 20 

2 ) INFORMATION FOR SEQ ID NO : 655 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 655 
CCRATACCIC MRATYTTGTA 20 

40 

2 ) INFORMATION FOR SEQ ID NO: 656 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 656 
55 AATTAATGGC TGCAGTTGAY GA 22 



60 



2 ) INFORMATION FOR SEQ ID NO: 657 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 657 
TTGTCCACGT TCGATRTCTT CA 



2) INFORMATION FOR SEQ ID NO: 658 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 658 
GATYTAGTCG ATGATGAAGA ATT 



2) INFORMATION FOR SEQ ID NO: 659 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 659 
GCTTTTTG I G TTTCWGGTTT RAT 



2) INFORMATION FOR SEQ ID NO: 660 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 660 
GTAGAATTGA GGACGGTAGT TAG 
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2) INFORMATION FOR SEQ ID NO: 661 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

( D ) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 661 
15 GTAGAAYTGT GGWCGATART TRT 23 



2) INFORMATION FOR SEQ ID NO: 662 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: CoryrzeJbacterium diphtherial 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 662 

3 5 CGGCGCAATC CTCGTTGTTG CTGCCACCGA CGGCCCAATG CCTCAGACCC 50 

GTGAGCACGT TCTGCTCGCT CGCCAGGTCG GCGTTCCTTA CATCCTCGTT 100 

GCTCTGAACA AGTGCGACAT GGTTGATGAT GAGGAAATCA TCGAGCTCGT 150 

CGAGATGGAG ATCCRTGAGC TGCTCGCTGA GCAGGATTAC GACGAAGAGG 2 00 

CTCCAATCAT CCACATCTCC GCAC TGAAGG CTCTTGAGGG CGACGAGAAG 2 50 

4 0 TGGACCCAGT CCATCATCGA CCTCATGCAG GCTTGCKATG ATTCCATCCC 3 00 

AGACCCAGAG CGTGAGACCG ACAAGCCATT CCTCATGCCT ATCGAGGACA 3 50 

TCTTCACCAT CACCGGCCGC GGTACCGTTG TTACCGGCCG TGTTGAGCGT 4 00 

GGCTCCCTGA AGGTCAACGA GGACGTCGAG ATCATCGGTA TCCGCGAGAA 450 

KGCTACCACC ACCACCGTTA CCGGTATCGA GATGTT C CGT AAGCTTCTCG 500 

45 ACTACACCGA GGCTGGCGAC AACTGTGGTC TGCTTCTCCG TGGCGTTAAG 550 

CGCGAAGACG TTGAGCGTGG CCAGGTTGTT GTTAAGCCAG GCGCTTACAC 60 0 

CCCTCACACC GAGTTCGAGG GCTCTGTCTA CGTTCTGTCC AAGGACGAGG 650 

GTGGCCGCCA CACCCCATTC TTCGACAACT ACCGCCCACA GTTCTACTTC 700 

CGCACCACCG ACGTTACCGG TGTTGTGAAG CTTCCTGAGG GCACCGAGAT 75 0 

50 GGTCATGCCT GGCGACAACG TCGACATGTC CGTCACCCTG ATCCAGCCTG 800 

TCGCTATGGA TGAGGGCCTG CGCTTCGCTA TC 832 



55 2) INFORMATION FOR SEQ ID NO: 663 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1192 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(ii) MOLECULE TYPE: Genomic DNA 



10 



15 



20 



25 



30 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida catenulata. 

(B) STRAIN: ATCC 10565 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 663 



AACGGCGAGC 
CACCGTGCGT 
CCGCTGTCAC 
ACCTTGGGCC 
TCCCATCCAG 
TCACCGAGCA 
GTCGACTTGT 
TGGTGCCGGT 
TTGCCAAGGC 
ACTCGTGAGG 
CAACTTGGAG 
AGCCCCCGGG 
GAGTACTTCC 
CATTTTCAGA 
GTATCCCCTC 
TTGTTGCAGG 
GCAGGCCGTG 
CCACTTTCGC 
GAGTTGGGTA 
GTTGGACGTC 
TCCAGGAGTG 
TTGGGTATGG 
CCGTAAGATC 
TCACTGGTAT 



AAGACTTGGT 
ACCATTGCCA 
TGACACCGGT 
GGATCATCAA 
TGCAAGCAGA 
GTCCGTCGAG 
TGGCTCCCTA 
GTCGGTAAGA 
CCACGGTGGT 
GTAACGACTT 
GGCGACTCCA 
GGCTCGTGCC 
GTGACGAGGA 
TTCACCCAGG 
CGCCGTCGGT 
AGAGAATTAC 
TACGTCCCTG 
TCACTTGGAC 
TCTACCCCGC 
GAGGTTGTTG 
CTTGCAGGCC 
ACGAGTTGTC 
CAGCGTTTCT 
CCCCGGTAGA 



GTTGGAGGTT 
TGGACGGTAC 
GCTCCCATCT 
CGTTGTCGGT 
GAAACCCCAT 
GCTGAGGTGT 
CGCCCGTGGT 
CCGTGTTCAT 
TTCTCCGTGT 
GTACCGTGAA 
AGGTGGCCTT 
CGTGTCGCCT 
AGGCCAGGAC 
CCGGTTCCGA 
TACCAGCCCA 
CACCACCAAG 
CCGATGACTT 
GCCACCACCG 
CGTCGACCCC 
GCCAGGAGCA 
TACAAGTCGT 
CGAGCAGGAC 
TGTCGCAGCC 
TTGGTGAGAT 



TCTCAGCACT 
CGAGGGTTTG 
CGGTCCCCGT 
GAGCCCATCG 
TCACGCCGAG 
TGGAGACCGG 
GGTAAGATTG 
CCAGGAGTTG 
TCACTGGTGT 
ATGAAGGAGA 
GGTGTTCGGT 
TGACCGGTTT 
GTGTTGTTGT 
GGTGTCGGCG 
CTTTGGCCAC 
AAGGGTTCCG 
GACTGACCCT 
TGTTGTCGCG 
TTGGACTCCA 
CTACGACGTC 
TGCAGGACAT 
AAGTTGACCG 
CTTCGCTGTC 
TGCAGGACAC 



TGGGTGAGAA 50 

GTGAGAGGTA 10 0 

TGGTCAGGGT 150 

ACGAGCGTGG 2 00 

CCCCCGTCTT 2 50 

TATCAAGGTT 3 00 

GTCTTTTCGG 3 5 0 

ATTAACAACA 4 00 

CGGTGAGCGT 450 

CCGGTGTCAT 50 0 

CAGATGAACG 550 

GACCATTGCC 60 0 

TCGTTGACAA 650 

TTGTTGGGTC 7 00 

CGACATGGGT 750 

TCACCTCTGT 8 00 

GCCCCCGCCA 850 

TGGTATCTCC 90 0 

AGTCGAGATT 950 

GCCACCGGTG 10 0 0 

CATTGCCATT 1050 

TCGAGAGAGC 1100 

GCCGAGGTTT 1150 

CG 1192 



40 



45 



50 



2) INFORMATION FOR SEQ ID NO: 664 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 664 
AAYATGAT I A CIGGIGCIGC ICARATGGA 



29 



55 



60 



2) INFORMATION FOR SEQ ID NO: 665 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 77 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(C) ACCESSION NUMBER: extracted from X00779 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 665 

ATGGGTAAAG AGAAGTCTCA CATTAACGTT GTCGTTATCG GTCATGTCGA 50 
TTCTGGTAAG TCTACCACTA CCGGTCATTT GATTTACAAG TGTGGTGGTA 100 
TTGACAAGAG AACCATCGAA AAGTTCGAAA AGGAAGCCGC TGAATTAGGT 150 
AAGGGTTCTT TCAAGTACGC TTGGGTTTTG GACAAGTTAA AGGCTGAAAG 200 
AGAAAGAGGT ATCACTATCG ATATTGCTTT GTGGAAGTTC GAAACTCCAA 250 
AGTACCAAGT TACCGTTATT GATGCTCCAG GTCACAGAGA TTTCATCAAG 3 00 

AACATGATTA CTGGTACTTC TCAAGCTGAC TGTGCTATCT TGATTATTGC 350 
TGGTGGTGTC GGTGAATTCG AAGCCGGTAT CTCTAAGGAT GGTCAAACCA 4 00 

GAGAACACGC TTTGTTGGCT TTCACCTTGG GTGTTAGACA ATTGATTGTT 450 
GCTGTCAACA AGATGGACTC CGTCAAATGG GACGAATCCA GATTCCAAGA 500 
AATTGTCAAG GAAACCTCCA ACTTTATCAA GAAGGTTGGT TACAACCCAA 550 
AGACTGTTCC ATTCGTCCCA ATCTCTGGTT GGAACGGTGA CAACATGATT 600 
GAAGCTACCA CCAACGCTCC ATGGTACAAG GGTTGGGAAA AGGAAACCAA 650 
GGCCGGTGTC GTCAAGGGTA AGACTTTGTT GGAAGCCATT GACGCCATTG 700 
AACAACCATC TAGACCAACT GACAAGCCAT TGAGATTGCC ATTGCAAGAT 750 
GTTTACAAGA TTGGTGGTAT TGGTACTGTG CCAGTCGGTA GAGTTGAAAC 800 
CGGTGTCATC AAGCCAGGTA TGGTTGTTAC TTTTGCCCCA GCTGGTGTTA 850 
CCACTGAAGT CAAGTCCGTT GAAATGCATC ACGAACAATT GGAACAAGGT 900 
GTTCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCGTTAAGGA 950 

AATCAGAAGA GGTAACGTCT GTGGTGACGC TAAGAACGAT CCACCAAAGG 1000 

GTTGCGCTTC TTTCAACGCT ACCGTCATTG TTTTGAACCA TCCAGGTCAA 1050 

ATCTCTGCTG GTTACTCTCC AGTTTTGGAT TGTCACACTG CTCACATTGC 1100 

TTGTAGATTC GACGAATTGT TGGAAAAGAA CGACAGAAGA TCTGGTAAGA 1150 

AGTTGGAAGA CCATCCAAAG TTCTTGAAGT CCGGTGACGC TGCTTTGGTC 12 00 

AAGTTCGTTC CATCTAAGCC AATGTGTGTT GAAGCTTTCA GTGAATACCC 1250 

ACCATTAGGT AGATTCGCTG TCAGAGACAT GAGACAAACT GTCGCTGTCG 130 0 

GTGTTATCAA GTCTGTTGAC AAGACTGAAA AGGCCGCTAA GGTTACCAAG 1350 

GCTGCTCAAA AGGCTGCTAA GAAATAA 13 77 



2 ) INFORMATION FOR SEQ ID NO: 666 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1536 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(C) ACCESSION NUMBER: extracted from M12082 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 666 

ATGGTTTTGC CAAGACTATA TACTGCTACA TCCCGTGCTG CTTTTAAAGC 50 

AGCCAAACAA TCCGCTCCGC TTCTATCCAC TTCGTGGAAA AGATGTATGG 100 

CCTCAGCTGC TCAATCTACT CCAATCACCG GTAAAGTTAC CGCTGTCATT 150 

GGTGCCATTG TTGACGTTCA TTTTGAACAA TCAGAGTTGC CCGCTATTTT 200 
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GAACGCTTTA GAAATTAAAA CACCTCAAGG TAAGTTGGTT TTGGAAGTTG 250 

CTCAACATTT GGGTGAAAAC ACTGTCAGAA CCATTGCTAT GGATGGTACC 3 00 

GAAGGTTTGG TCCGTGGTGA AAAGGTTCTT GACACTGGTG GCCCTATCTC 350 

CGTCCCAGTT GGGAGAGAAA CTTTAGGGAG AATCATCAAC GTTATCGGTG 4 00 

5 AACCTATTGA TGAAAGAGGT CCAATTAAGT CCAAACTAAG AAAGCCAATT 450 

CACGCAGACC CTCCTAGTTT TGCAGAACAA TCTACTTCGG CTGAAATTTT 50 0 

GGAAACAGGT ATCAAAGTCG TCGATCTATT AGCTCCTTAT GCCAGAGGTG 550 

GTAAGATTGG TCTTTTCGGT GGTGCAGGTG TCGGTAAGAC TGTGTTCATT 60 0 

CAAGAATTGA TTAACAATAT CGCCAAGGCC CATGGTGGTT TTTCCGTTTT 650 

10 CGCCGGTGTT GGTGAAAGGA CCAGAGAGGG TAATGACTTG TACCGTGAAA 700 

TGAAGGAAAC TGGAGTCATT AACTTGGAAG GTGAATCCAA GGTCGCCTTA 750 

GTTTTCGGTC AAATGAACGA ACCTCCAGGA GCCAGAGCCA GAGTCGCTTT 80 0 

AACTGGTTTG ACGATCGCTG AATATTTCAG AGATGAAGAA GGTCAAGACG 850 

TCTTGTTGTT TATCGACAAT ATCTTTAGAT TTACTCAAGC TGGTTCAGAA 90 0 

15 GTCTCTGCCC TTTTGGGTCG TATTCCATCT GCCGTCGGTT ATCAACCAAC 950 

TTTGGCCACT GATATGGGTC TCTTACAAGA AAGAATTACC ACCACAAAGA 1000 

AGGGTTCTGT CACTTCTGTG CAAGCCGTTT ATGTTCCAGC CGATGATTTA 1050 

ACAGATCCGT CTCCGTCCAC ATCTTTTGCC CATTTGGACG CATCATCCGT 1100 

CTTGTCAAGA GGTATTTCAG AATTAGGTAT TTACCCTGCA GTGGATCCAT 1150 

2 0 TGGATTCTAA ATCAAGGTTA TTGGATGCCG CCGTTGTCGG TCAAGAACAT 12 00 

TATGACGTCG CCTCCAAGGT TCAAGAAACT TTACAGACCT ATAAATCTTT 12 50 

ACAAGATATC ATTGCTATTT TGGGTATGGA TGAATTGTCC GAACAAGATA 13 00 

AACTAACTGT CGAAAGGGCA AGAAAGATTC AAAGATTCTT ATCTCAACCA 1350 

TTTGCTGTCG CCGAAGTCTT TACTGGTATC CCAGGTAAAT TAGTGAGATT 1400 

2 5 AAAGGACACC GTTGCCTCGT TCAAAGCCGT TTTGGAAGGT AAATACGATA 1450 

ATATACCAGA ACATGCTTTC TATATGGTTG GTGGTATTGA AGATGTTGTT 1500 

CGTAAAGCTG AAAAGTTAGC CCGTGAAGCC AACTAG 153 6 



30 



40 



2) INFORMATION FOR SEQ ID NO : 667 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 12 93 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: Y 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 667 

CGGAAGGCGT GCCGCCGGTG TTAACTGCTT TGGATGTGGT GGAAAAACTT 50 

GGCCGTGACG AGCCCTTGAC ACTTGAAATT GTACAGCACT TGGACGCCCA 100 

CACCGGCCGT TGCATTGCGA TGCAGACGAC AGATCTGCTT AAGCTGAAAT 150 

50 CGAAGGTAGT CTCGAGTGGC GGGAACATTT CTGTGCCTGT GGGTCGGGAG 2 00 

ACACTTGGCA GAATCTTCAA TGTGCTGGGC GATGCCATTG ACCAGCGCGG 2 50 

CCATGTTGGA GAGAAGCAAC GCATGCCTAT TCACGCTGTG GCACCAAAGT 300 

TGGCGGATCA GGCGGCAGAG GACACCATTC TCACAACGGG TATCAAGGTG 350 

ATTGATCTGA TTTTGCCCTA CTGCAAGGGT GGGAAGATTG GTCTTTTTGG 4 00 

55 TGGTGCTGGC GTGGGCAAGA CGGTTATTAT CATGGAACTT ATTAACAACG 4 50 

TTGCCAAGGG CCATGGTGGG TTCTCCGTCT TTGCGGGTGT TGGTGAACGC 500 

ACGCGTGAGG GGACGGATCT TTACCTGGAG ATGATGCAAT CCAAAGTTAT 550 

TGACCTGAAG GGTGACTCGA AGTGTGTGTT GGTGTATGGT CAGATGAACG 600 

AGCCTCCGGG TGCC CGTGCT CGTGTGGCGC AGTCTGCCTT GACAATGGCC 650 

60 GAGTACTTCC GTGATGTGGA AGGGCAAGAC GTGTTGCTTT TTATCGACAA 700 

376 
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10 



WO 01/23604 

CATTTTCCGC 
GTATTCCCGC 
CAGTTGCAGG 
GCAGGCTGTG 
CGACCTTTTC 
GAATCCGGCA 
CATGGACCCG 
TGGTGCAGAT 
CTTGGCATTG 
GCGTAAGGTG 
TTACTGGCAT 
TTTTCCGGCC 



TTTACGCAGG 
TGCCGTCGGC 
AGCGCATTAC 
TACGTGCCAG 
CCACCTCGAT 
TTTACCCCGC 
GATGTAATCA 
GCTTACCAAG 
ATGAGCTCAG 
ACAAAGTTTC 
GACAGGCCAC 
TGTTGATGGG 



CAAACTCTGA 
TACCAGCCTA 
GTCCACGACA 
CCGATGATAT 
GCCACGACGG 
TGTCAACCCA 
GCGTTGACCA 
TACAAGGAGC 
TGAGGAGGAT 
TCTCCCAGCC 
TACGTGCAGC 
CACATATGAT 



GGTGTCAGCG 
CCCTTGCTGA 
AAAGGTTCCA 
TACCGACCCT 
TGCTGGACCG 
CTGGAGTGTG 
CTACAACGTG 
TGCAGGATAT 
AAACTTATCG 
TTTCCAGGTG 
TGGAGGAGAC 
CAGGTGCCGG 



PCT/CA00/01150 

CTOTTGGGTC 7J50 

GGATCTTGGG "8 00 

TTACCTCTGT 850 

GCGCCAGCAA 900 

TGCCGTTGCC 950 

CCTCGCGTAT 1000 

GCGCAGGATG 1050 

CATTGCGGTG 110 0 

TGGACCGTGC 1150 

GCGGAGGTGT 12 00 

CATTGAGTCT 12 50 

AGA 1293 
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2) INFORMATION FOR SEQ ID NO: 668 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1191 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium glutamicxim 
(C) ACCESSION NUMBER: X77034 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 668 



GTGGCAAAGG 
CATCGGTCAC 
AGGTTCTGGC 
TCCATCGATA 
CTCCCACGTT 
CTCCAGGCCA 
ATGGACGGCG 
GACCCGTGAG 
TCGTTGCTCT 
CTCGTCGAGA 
AGAGGCTCCA 
AGAAGTGGGG 
ATCCCTGACC 
GGACATCTTC 
AGCGCGGTAC 
GAGAAGTCCA 
TCTTGACTCC 
TCAAGCGCGA 
TACACCCCTC 
TGAAGGTGGC 
ACTTCCGCAC 
GAGATGGTCA 
GCCTGTCGCT 
GCACCGTTGG 



CGAAGTTCGA 
GTTGACCACG 
TGACACTTAC 
AGGCTCCTGA 
GAGTACCAGA 
CGCCGACTAC 
CAATCCTCGT 
CACGTTCTTC 
TAACAAGTGC 
TGGAAGTTCG 
ATTGTTCACA 
CAAGCAGATC 
CAGTTCGTGA 
ACCATCACCG 
CCTGAACGTG 
CCTCCACCAC 
GCTGAGGCTG 
AGATGTTGAG 
ACACCGAGTT 
CGCCACACCC 
CACCGACGTT 
TGCCTGGCGA 
ATGGACGAGG 
CGCTGGTCGT 



GCGTACCAAG 
GTAAGACCAC 
CCTGAGCTCA 
GGAGAAGGAG 
CTGAAAAGCG 
ATCAAGAACA 
TGTTGCTGCT 
TTGCTCGCCA 
GACATGGTTG 
TGAACTTCTT 
TCTCCGCTCT 
CTTGAGCTCA 
GACCGACAAG 
GTCGTGGCAC 
AACGATGATG 
CGTTACCGGT 
GCGACAACTG 
CGTGGCCAGG 
CGAGGGCTCT 
CATTCTTCGA 
ACCGGTGTTG 
CAACGTCGAC 
GCCTGCGTTT 
GTCACCAAGA 



CCCCACGTAA 
CACCACCGCG 
ACGAGGCTTT 
CGTGGCATCA 
CCACTACGCA 
TGATTACCGG 
ACCGACGGCC 
GGTTGGCGTT 
AGGATGAGGA 
GCTGAGCAGG 
GAAGGCTCTT 
TGCAGGCTTG 
CCATTCCTCA 
CGTTGTTACC 
TTGACATCAT 
ATCGAGATGT 
TGGTCTGCTT 
TTATCGTTAA 
GTCTACGTTC 
CAACTACCGT 
TGAAGCTTCC 
ATGTCCGTCA 
CGCTATCCGC 
TCATCAAGTA 



ACATCGGCAC 50 

GCTATCACCA 100 

CGCCTTCGAC 150 

CGATCAACAT 2 00 

CACGTTGACG 2 50 

CGCTGCTCAG 3 00 

CAATGCCTCA 3 50 

CCTTACATCC 4 00 

AATCATCGAG 4 50 

ACTACGACGA 500 

GAGGGCGACG 550 

CGATGACAAC 600 

TGCCTATCGA 650 

GGTCGTGTTG 7 00 

CGGCATCAAG 750 

TCCGTAAGCT 8 00 

CTCCGTGGTA 850 

GCCAGGCGCT 900 

TGTCCAAGGA 950 

CCTCAGTTCT 1000 

AGAGGGCACC 1050 

CCCTGATCCA 1100 

GAAGGCTCCC 1150 

A 1191 



60 



2) INFORMATION FOR SEQ ID NO: 669 

377 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 83 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: extracted from V00267 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 669 

15 ATGGCTACTG GAAAGATTGT CCAGGTAATC GGCGCCGTAG TTGACGTCGA 50 

ATTCCCTCAG GATGCCGTAC CGCGCGTGTA CGATGCTCTT GAGGTGCAAA 100 

ATGGTAATGA GCGTCTGGTG CTGGAAGTTC AGCAGCAGCT CGGCGGCGGT 150 

ATCGTACGTA CCATCGCAAT GGGTTCCTCC GACGGTCTGC GTCGCGGTCT 2 00 

GGATGTAAAA GACCTCGAAC ACCCGATTGA AGTCCCGGTA GGTAAAGCGA 2 50 

2 0 CTCTGGGCCG TATCATGAAC GTACTGGGTG AACCGGTCGA CATGAAAGGC 300 

GAGATCGGTG AAGAAGAGCG TTGGGCGATT CACCGCGCAG CACCTTCCTA 350 

CGAAGAGCTG TCAAACTCTC AGGAACTGCT GGAAACCGGT ATCAAAGTTA 4 00 

TCGACCTGAT GTGTCCGTTC GCTAAGGGCG GTAAAGTTGG TCTGTTCGGT 4 50 

GGTGCGGGTG TAGGTAAAAC CGTAAACATG ATGGAGCTCA TTCGTAACAT 500 

2 5 CGCGATCGAG CACTCCGGTT ACTCTGTGTT TGCGGGCGTA GGTGAACGTA 550 

CTCGTGAGGG TAACGACTTC TACCACGAAA TGACCGACTC CAACGTTATC 600 

GACAAAGTAT CCCTGGTGTA TGGCCAGATG AACGAGCCGC CGGGAAACCG 650 

TCTGCGCGTT GCTCTGACCG GTCTGACCAT GGCTGAGAAA TTCCGTGACG 700 

AAGGTCGTGA CGTTCTGCTG TTCGTTGACA ACATCTATCG TTACACCCTG 750 

30 GCCGGTACGG AAGTATCCGC ACTGCTGGGC CGTATGCCTT CAGCGGTAGG 800 

TTATCAGCCG ACCCTGGCGG AAGAGATGGG CGTTCTGCAG GAACGTATCA 850 

CCTCCACCAA AACTGGTTCT ATCACCTCCG TACAGGCAGT ATACGTACCT 9 00 

GCGGATGACT TGACTGACCC GTCTCCGGCA ACCACCTTTG CGCACCTTGA 950 

CGCAACCGTG GTACTGAGCC GTCAGATCGC GTCTCTGGGT ATCTACCCGG 1000 

35 CCGTTGACCC GCTGGACTCC ACCAGCCGTC AGCTGGACCC GCTGGTGGTT 1050 

GGTCAGGAAC ACTACGACAC CGCGCGTGGC GTTCAGTCCA TCCTGCAACG 1100 

TTATCAGGAA CTGAAAGACA TCATCGCCAT CCTGGGTATG GATGAACTGT 1150 

CTGAAGAAGA CAAACTGGTG GTAGCGCGTG CTCGTAAGAT CCAGCGCTTC 1200 

CTGTCCCAGC CGTTCTTCGT GGCAGAAGTA TTCACCGGTT CTCCGGGTAA 1250 

4 0 ATACGTCTCC CTGAAAGACA CCATCCGTGG CTTTAAAGGC ATCATGGAAG 13 00 

GCGAATACGA TCACCTGCCG GAGCAGGCGT TCTACATGGT CGGTTCCATC 13 50 

GAAGAAGCTG TGGAAAAAGC CAAAAAACTT TAA 13 83 



45 



55 



60 



2) INFORMATION FOR SEQ ID NO: 670 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1410 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Helicobacter pylori 

(B) STRAIN: NCTC 11638 

(C) ACCESSION NUMBER: extracted from AF004014 

378 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
(xi) SEQUENCE DESCRIPTION: SEQ 

ATGAAAGCGA TGGAAGGTAA AATCATTCAG 
TGTGGAGTTT GAATCCTATC TGCCGGCGAT 
ATTTTGAAGT TAATGGCGTT CAAAAATCTT 
CATTTGGGTG GTAATCGGGT GCGAGCGATT 
CTTAGTGCGT AACCAAGCCG TCAAAGCTCG 
CTGTGGGCGA AGAAGTGTTA GGGCGTATTT 
ATTGATAATT TAGAGCCGCT TAAGCCGTCC 
AAAAGCCCCT AGTTTTGAGC AGCAAAGCAC 
CCGGTATTAA AGTCATTGAC TTGCTCGCGC 
GTAGGCTTGT TTGGTGGGGC TGGCGTAGGC 
GCTTATCCAC AATGTGGCTT ATAAGCATAA 
GTGTGGGGGA GCGCACCAGA GAAGGGAACG 
GAAGGGGGCG TTTTAGACAA AGTTGCGTTG 
GCCACCAGGT GCAAGGAATC GCATCGCATT 
AGTATTTCCG TGATGAAAAG GGCTTAGATG 
ATCTTTAGAT ACGCTCAAAG CGGTGCGGAA 
TATCCCTTCA GCGGTGGGGT ATCAGCCCAC 
AACTTCAAGA GCGTATCGCT TCCACTAAAA 
CAAGCGGTGT ATGTGCCAGC AGACGACTTG 
GGTGTTTGCG CATTTAGATG CGACTACGGT 
AAAAAGGGAT TTATCCGGCG GTGGATCCTT 
TTAAGCCCTC AAATGATTGG CGAGAAGCAC 
CCAGCAAGTT TTGCAAAAAT ACAAGGATTT 
TGGGATTGGA CGAATTGAGC GAAGAGGATA 
AGAAAAATTG AGAAGTTTTT ATCCCAGCCG 
TACAGGAAGT CCCGGTAAGT ATGTGACTCT 
TTGGAGGGAT TTTAGAGGGC AAATACGATC 
TACATGGTGG GCAGCATTCA AGAGGTTTTA 
AAATTCCTAA 



PCT/CA00/01150 

ID NO: 670 

GTTTTAGGCC CGGTGGTAGA 50 

TTTTGAAGCA CTAGACATTA 100 

TAGTTTTAGA GGTGGCAGCC 150 

GCTATGGATA TGACAGAAGG 200 

TGGCAAAATG ATTGAAGTGC 2 50 

TTAATGTTGT GGGCGAGAGC 3 00 

TTAACTTGGC CCATTCACAG 3 50 

TAAAACAGAA ATGTTTGAAA 4 00 

CTTATTCTAA GGGCGGTAAA 450 

AAAACGGTGA TCATTATGGA 500 

CGGGTATTCG GTGTTTGCAG 550 

ATCTGTATTT TGAGATGAAA 600 

TGCTATGGGC AAATGAATGA 650 

CACCGGCTTG ACGATGGCGG 700 

TGTTGATGTT TATTGATAAC 750 

ATGAGCGCGC TATTAGGCCG 800 

GCTAGCCGGG GAAATGGGGA 850 

ATGGCTCTAT CACTTCGGTT 900 

ACTGACCCAG CCCCTGCTTC 950 

GTTGAATAGA AAGATCGCTG 1000 

TGGATTCCAC TTCAAGGATT 1050 

TATGAAATCG CCACCGGTAT 1100 

GCAAGATATT ATTGCGATTT 1150 

AAAAAACGGT TGAAAGGGCC 12 00 

TTTTTTGTGG CTGAAGTGTT 1250 

CCAAGAGACT TTAGAGGGCT 13 00 

ACATTCCTGA AAACGCGTTT 1350 

GAAAAAGCTA AAAACATGAA 14 00 

1410 



2) INFORMATION FOR SEQ ID NO: 671 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 01 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium acetobutylicum 

(B) STRAIN: DSM 792 

(C) ACCESSION NUMBER: extracted from AF101055 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 671 

ATGCCAGAAC ATGTAGGTAA AATTGTTCAG GTAATAGGAC CTGTTGTGGA 50 

TATTAAATTT GATGCAGAGA ACCTTCCTGA CATCTATAAT TCCATAGAAA 10 0 

TAGATATGGG AGATAATAAA AAACTCATTG CTGAAGTTGA ACAACATGTA 150 

GGAGATGACA TAGTAAGAAC AATAGCAATG GAAGGTACTG ACGGATTAAA 200 

AAGAGGAATG GAAGCAGTTA ACACTGGTAA ACCAATATCT GTACCAGTTG 2 50 

GAGAAAATGT TTTAGGACGT CTTTTTAATG TTTTAGGTCA GACAATAGAT 3 00 

GAAGCAGGAG ACATGAATGC TGATAAGTAT TATCCAATTC ATAGACCAGC 3 50 

TCCAACCTTT GAAGAACAAT CAGTTCAACC AGAAATGTTT GAAACAGGTA 4 00 

TTAAGGTTAT AGATTTACTT GCTCCATATC AAAAGGGTGG AAAAATCGGT 450 

379 
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TTGTTCGGTG GTGCCGGTGT 
AAATAATATA GCAAAAGAAC 
GAGAAAGAAC AAGAGAAGGT 
GGAGTTATAA ATAAAACAGC 
5 TGGCGCTAGA ATGAGAGTTG 
TTAGAGACAA AGGTCAAGAT 
TTTACACAAG CTGGTTCAGA 
TGCCGTTGGT TATCAGCCAA 
AGAGAATAAC ATCAACAAAA 

10 TATGTTCCTG CTGATGACCT 
GCATCTTGAT GCAACAACAG 
TATATCCTGC AGTTAGTCCT 
AGAATTGTTG GAGAAGAGCA 
ACTTGAAAGA TATCAAGAAC 

15 ATGAACTTTC AGATGAGGAT 
CAGAGATTCT TATCTCAAGC 
GAAAGGTCAG TTTGTACCTG 
TATTAGATGG TAAGTGTGAT 
GGAACAATAG AAGATGTAAA 

20 A 



TGGTAAAACA GTTCTTATTC 
ACGGTGGATT ATCAGTATTC 
AATGACCTTT ATTATGAAAT 
TCTAGTATTT GGTCAGATGA 
CTTTAACAGG ACTTACAATG 
GTGCTTCTAT TTATAGATAA 
GGTTTCAGCT TTACTTGGTA 
CACTTGCAAA TGAAATGGGT 
CAGGGTTCAA TCACATCCGT 
TACAGACCCA GCTCCAGCAA 
TTCTTTCAAG AGAAATATCA 
CTTGAATCAA CTTCAAGAAT 
TTATGAAGTT GCTAACAAGG 
TTCAAGATAT CATAGCTATA 
AGATTGTTAG TTGGAAGAGC 
TTTTAGTGTT GCTGAACAAT 
TAAAAGATAC TATAAGAAGT 
GATCTTCCAG AAGCTGCATT 
AGAAAAAGCT AAAAAAATGA 



PCT/CA00/01150 

AGO^ACTTAT ioo 

ACAGGTGTTG 550 

GAAAGATTCA 600 

ATGAACCACC 650 

GCTGAATATT 70 0 

TATATTCAGA 750 

GAATACCTAG 800 

GCTCTTCAAG 850 

TCAGGCTGTA 900 

CAACATTTAC 950 

AACTTAGGAA 1000 

ACTTGATCCA 1050 

TTAAACATAT 110 0 

CTTGGTGTTG 1150 

AAGAAGAGTA 12 00 

TTACAGGAAT 12 50 

TTTAAAGAAA 13 00 

TTTATTTGCA 13 50 

TGGAAAGCTA 14 00 
1401 



2 ) INFORMATION FOR SEQ ID NO: 672 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Cytophaga lytica 

(B) STRAIN: DSM 2039 

(C) ACCESSION NUMBER: M22 535 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 672 



ATGTCTAAAG TTACAGGTAA AGTTTCCCAA ATTATTGGCC CAGTTATAGA 50 

TGTGGAGTTC CAAGCAGGGG TAGATCTTCC AAAAATTTAT GATTCATTAG 100 

AAATTAAAAA AGCAGATGGA TCAATTTTGG TTTTGGAAGT ACAATCACAC 150 

ATTGGTGAGA ACACAGTAAG AACTATATCT ATGGATTCTT CTGATGGTTT 2 00 

45 AAGTCGTGGA GCAGAGGTTA ATGCAACAGG AAGCGCTATA CAAATGCCAG 2 50 

TTGGAGATGA CGTTTACGGA CGTTTATTTA ACGTAATTGG AGACGCTATT 300 

GATGGTCTTG GGAATTTACC AAAATCTGGT AAAGATGGTT TGCCAATACA 3 50 

CAGAGAGGCA CCAAAATTTG AAGACTTATC TACTTCTACA GAAGTATTAT 4 00 

TTACAGGTAT TAAAGTAATT GACCTTATTG AGCCTTATGC AAAAGGTGGT 450 

50 AAGATTGGTT TATTTGGAGG TGCCGGAGTA GGTAAAACAG TATTAATTCA 500 

GGAATTAATT AACAACATTG CAAAAGGTCA CGGTGGACTT TCTGTATTTG 550 

CTGGTGTAGG TGAGCGTACT CGTGAGGGTA ACGATTTACT ACGTGAGATG 600 

TTAGAGTCTG GTATTATTAA ATACGGAGAT GACTTTATGC ACTCTATGGA 650 

AGAAGGTGGT TGGGATTTAT CTAAAGTTGA TAAATCTGTA ATGAAAGATT 700 

55 CTAAAGCAAC CTTTGTATTT GGACAAATGA ATGAGCCACC AGGAGCACGT 750 

GCACGTGTTG CATTATCTGG TTTAACTATT GCAGAATATT TCCGTGATGG 8 00 

AGCAGGTGAA GGTCAAGGTA AAGATGTACT TTTCTTTGTG GATAACATTT 850 

TCCGTTTTAC ACAAGCTGGT TCTGAGGTAT CTGCATTACT TGGTCGTATG 900 

CCATCTGCGG TAGGTTACCA ACCAACATTA GCAACAGAAA TGGGTGCTAT 950 

60 GCAAGAGCGT ATTACATCAA CAAAAAGAGG TTCTATTACA TCTGTACAGG 1000 
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CGGTTTACGT ACCTGCGGAT GATTTAACGG 

TTTGCTCACT TAGATGCAAC AACGGTATTG 

AGGTATTTAC CCAGCGGTAG ATCCATTAGA 

CTCCAGAAAT TTTAGGAAAA GATCACTACT 

5 GAGTTGTTAC AACGTTATAA AGAATTACAA 

TATGGAAGAA TTATCTGAGG AAGATAAAAT 

GTGTACAACG TTTCTTATCT CAGCCTTTCC 

GGTCTTAAAG GTGTTTTAGT AG AT AT C AAG 

TATGATTATG GATGGTGAAT TAGATCACTT 

10 TTAAAGGTAC TATTGAAGAA GCTATAGAAG 
GAAGCATAA 



PCT/CA00/0115O 

ATCCAGCACC GGCAACTACC 1^5 0 

TCTCGTAAAA TTGCAGAGTT 1100 

TTCTACTTCT AGAATCTTAG 1150 

CTTGTGCACA ACGTGTAAAA 12 00 

GATATTATTG CTATCCTTGG 125 0 

GGCAGTTGGT AGAGCAAGAC 1300 

ACGTAGCAGA GCAATTTACA 13 50 

GATACTATTA AAGGATTTAA 14 0 0 

ACCAGAATCT GCATTTAACC 14 50 

CAGGAGAAAA AATGCTTGCT 150 0 

1509 



15 2 ) INFORMATION FOR SEQ ID NO: 673 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ehrlichia risticii 

(B) STRAIN: HRC-IL 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 673 



CCTAAAATAT ATGATGCATT ATATGTAAAA CTAGATAATG AAAATTTGTG 50 

TTTAGAAGTT TCACAAATTA TTGGAGATAA TGTTGTTAGA TGTATTGCAA 10 0 

TGGGAGCTAC TTATGGATTA AATCGTGGTT TAGAAGTAGT TTGTTCAGGA 15 0 

AATCCAATTC AGGTTCCTGT AGGTGAACAA GTTTTAGGTA GAATGTTTAA 2 00 

35 TGTTGTTGGT AAAACAATTG ACAATCTTGA ATCTTTAGAT GATAAAAATA 250 

TAAAAATGAT GCCAATTCAT CGAAATCCAC CATCATTTGA AGAGCAATCC 3 00 

AATGAAATTG AAATTTTTGA AACAGGCATT AAAGTTATTG ATTTATTAAT 350 

TCCATATGCT AAAGGTGGTA AGATTGGATT ATTTGGTGGA GCAGGGGTTG 4 00 

GGAAAACGGT TCTTGTTCAA GAATTAATTC ACAATATCGC AAAAGGTCAT 450 

40 GGTGGTCTAT CTGTTTTTGC TGGAGTTGGT GAAAGAACTC GTGAAGGTAA 500 

TGACTTGTAT TATGAAATGA TTGAAGGTGG AGTTATAGAT AAAACAGCCT 550 

TAGTGTTTGG GCAAATGAAT GAACCTCCTG GCGCAAGAAT GCGCGTAGCA 60 0 

TTAACTGCTT TAACAATGGC TGAATATTTC CGTGATGTTC AAAACCAAGA 650 

TGTTTTGTTA TTTATTGATA ATATCTTTAG ATTTACACAA GCTGGTAGTG 700 

45 AAGTTTCAGC ATTATTAGGA AGAATGCCAT CTGCTGTTGG TTATCAACCA 750 

ACTTTGGCAT ATGAAATGGG ATTGTTACAA GAAAGAATCA CTTCCACTAA 800 

AAGTGGTTCT ATAACATCT 819 



50 



60 



2) INFORMATION FOR SEQ ID NO: 674 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 840 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

381 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Vibrio cholerae 

(B) STRAIN: ATCC 25870 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 674 



AGAGCGAAGT 
GAACGTCTGG 
CGCTATCGTT 
AAAACACTGG 
CGTATCATGA 
CGCAGAAGAG 
AGTCTAGTGC 
ATCTGTCCGT 
TGTAGGTAAG 
AGCACTCAGG 
GGTAACGACT 
ACAACCAGAA 
CACCAGGCAA 
AAGTTCCGTG 
CCGTTACACC 
CTTCAGCGGT 
CAAGAACGTA 



ACCAAGTGTT 
TTCTGGAAGT 
ATGGGTAGCT 
CGCTCCAATT 
ACGTGCTTGG 
GTGTACTCGA 
AACTGAACTT 
TTGCGAAAGG 
ACCGTTAACA 
TTTGTCAGTA 
TCTACCACGA 
CTGTCGAAAG 
CCGTCTGCGT 
ATGAAGGCCG 
CTAGCGGGAA 
AGGTTACCAA 
TCACGTCAAC 



TACGATGCTC 
TCAACAGCAG 
CGGATGGTTT 
TCAGTACCAG 
TGATGCGATT 
TTCACCGTCC 
TTGGAAACGG 
CGGTAAAATC 
TGATGGAACT 
TTTGCTGGGG 
AATGCAGGAA 
TAGCGATGGT 
GTAGCACTGA 
TGACGTACTG 
CGGAAGTATC 
CCAACACTGG 
CAAAAAAGGT 



TGAATGTTGT GGATTCCAAA 
CTAGGCGGTG GCGTGATTCG 
ACGTCGTGGA ATGACAGTAC 
TAGGTACTAA AACCCTAGGT 
GACGAACGTG GCGACATTGG 
TGCTCCAAGC TACGAAGAAC 
GTGTTAAGGT TATCGACCTG 
GGTCTGTTCG GTGGTGCGGG 
TATCAACAAC ATCGCGCTAC 
TAGGTGAGCG TACTCGTGAG 
GCGGGCGTTG TAAACGTTGA 
TTACGGTCAG ATGAACGAGC 
CTGGTCTGAC TATGGCGGAA 
CTGTTTATCG ACAACATCTA 
TGCTCTGCTT GGCCGTATGC 
CTGAAGAGAT GGGTGTTCTG 
TCTATCACCT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
840 



2) INFORMATION FOR SEQ ID NO: 675 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Vibrio cholerae 

(B) STRAIN: ATCC 25870 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 675 

CGGCGGTATC CTAGTTGTAG CGGCAACTGA CGGTCCAATG CCACAAACTC 50 

GTGAGCACAT CCTGCTGGGT CGCCAAGTAG GTATTCCTTA CATCATCGTA 100 

TTCATGAACA AGTGTGACAT GGTTGACGAT GAAGAGCTTC TAGAGCTGGT 150 

AGAGATGGAA GTTCGTGAGC TGCTGTCTGA GTACGATTTC CCAGGTGATG 2 00 

ACCTGCCAGT AATCCAAGGT TCAGCACTAG GCGCGCTAAA CGGCGAAGCA 2 50 

CAGTGGGAAG CGAAGATTGT TGAGCTAGCA GAAGCACTGG ATACTTAYAT 3 00 

TCCAGAGCCA GAGCGTGCAG TAGACATGGC ATTCCTGATG CCAATCGAAG 350 

ACGTATTCTC AATCCAAGGT CGTGGTACAG TAGTAACTGG CCGTATCGAG 4 00 

CGCGGCATCC TGAAAGTGGG TGACGAAGTA GCGATCGTTG GTATCAAAGA 4 50 

GACAGTAAAA ACGACCTGTA CAGGTGTAGA GATGTTCCGT AAGCTGCTTG 500 

ACGAAGGTCG TGCAGGTGAG AACGTAGGTG CACTRCTACG TGGTACTAAG 550 

CGTGAAGAAG TAGAGCGTGG TCAAGTACTG GCGAAGCCAG GTTCAATCAC 600 

ACCACACACT AAGTTCGAAT CAGAAGTATA CGTACTGTCA AAAGATGAAG 650 

GTGGCCGTCA TACTCCATTC TTCAAAGGTT ACCGTCCACA GTTCTACTTC 700 

CGTACAACTG ACGTAACAGG CAGCATTGAG CTACCAGARG GCGTAGAAAT 750 

GGTAATGCCA GGCGACAACG TGAAGATGGT TGTAGACCTG ATTGCACCAA 800 

TCGCGATGGA CGAAGGTCTA CGCTTCGC 828 
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2) INFORMATION FOR SEQ ID NO: 676 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 98 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania enriettii 
15 (B) STRAIN: ATCC 50120 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 676 

CTTCTCGGAT GGCGTGCCGC CGGTGCTGAC GGCGCTGGAC GTGACGGAGG 50 

2 0 AACTCGGGCG CGACGAGCCG CTGACGCTAG AGATCGTGCA ACACTTGGAC 100 

GCGCACACCG GCCGCTGCAT TGCGATGCAG ACGACGGACC TACTGAAGCT 150 

GAAGTCGAAG GTCGTGTCGA CCGGCGGGAA CATCTCTGTG CCTGTGGGCC 200 

GCGAGACGCT TGGCCGCATC TTCAACGTAC TGGGCGACGC GATTGACCAG 250 

CGCGGCCCCG TGGGCGAGAA GATGCGCATG GCGATCCACG CGGAGGCGCC 3 00 

2 5 GAAGCTGGCG GACCAGGCGG CGGAGGACAC GATCCTGACG ACCGGCATCA 3 50 

AAGTGATCGA TCTGATCCTG CCGTACTGCA AGGGCGGCAA GATCGGTCTG 4 00 

TTCGGCGGTG CCGGTGTGGG GAAGACTGTG ATCATCATGG AGCTGATCAA 4 50 

CAACGTCGCG AAGGGCCACG GTGGCTTCTC CGTGTTCGCC GGCGTTGGGG 500 

AGCGCACCCG CGAGGGGACG GATCTGTACC TGGAGATGAT GCAATCGAAG 550 

3 0 GTGATCGACC TGAAAGGTGA GTCGAAGTGT GTGCTTGTGT ACGGGCAGAT 600 

GAACGAGCCC CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA 650 

TGGCCGAGTA CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC 700 

GACAACATCT TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CAGCGCTGCT 750 

GGGCCGCATC CCCGCCGCTG TGGGCTACCA ACCGACGCTT GCGGAGGATC 8 00 

35 TCGGCATGTT GCAGGAGCGC ATCACGTCGA CGACGAAGGG GTCGATCACG 850 

TCTGTGCAGG CGGTGTACGT GCCTGCGGAT GATATCACGG ATCCCGCGCC 900 

GGCGACGACA TTCTCGCACC TGGACGCGAC GACGGTGCTG GACCGCGCGG 950 

TGGCGGAGTC TGGCATCTAC CCCGCGGTCA ACCCACTGGA GTGCGCGTCG 10 00 

CGTATCATGG ACCCTGATGT GATCGACGTG GATCACTACA ACGTTGCGCA 1050 

4 0 GGACATTGTT CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG 1100 

CGGTGCTTGG TATCGACGAG CTGAGCGAGG AGGACAAGGT TGTGGTGGAC 1150 

CGTGCGCGCA AGGTGACCCG GTTCCTGTCA CAGCCGTTCC AGGTTGCGGA 12 00 

GGTGTTTACT GGCATGACAG GCCATTACGT ACAGCTGAGC GACACGGTGG 12 50 

AGTCGTTCTC CGGTTTGCTG ATGGGGTCGT ACGACCAGAT TCCGGAGA 12 98 

45 



2) INFORMATION FOR SEQ ID NO: 677 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1083 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia micro tti 
60 (B) STRAIN: Persing-1 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 677 

CAAGCTCAAG TCTGAGCGTG AGAGAGGTAT TACTATTGAC ATTACTCTCT 50 

5 GGAAATTTGA GACCCAGAAA TACGAGTACA CTGTCATAGA CGCACCTGGT 100 

CATCGTGACT TTATCAAAAA TATGATTACT GGGACTTCAC AAGCCGACGT 150 

TGCTATGCTT GTCGTTCCTG CTGAATCTGG CGGATTCGAG GCTGCTTTTT 2 00 

CCAAAGAAGG TCAGACCCGT GAACACGCCT TACTAGCCTT CACACTTGGC 2 50 

GTCAAACAGA TGATTGTTGC TATTAACAAA ATGGATTCTT GTCAGTACAA 300 

10 GGAGGATCGT TATATGGAAA TTTTCAAGGA AGTACAGCAG TACTTGAAGA 350 

AGGTGGGTTA CAAAGTTGAA AGCGTGCCGT TTGTTGCTAT TTCAGGATTC 4 00 

CACGGTGACA ACATGGTTGA AAAATCTACT AACATGCCTT GGTATAAGGG 4 50 

TAAGACCCTC GTAGAGGCAC TTGATCAAAT GGAGCCTCCA AAACGTCCGG 500 

TCGAAAAACC TCTTAGATTG CCCCTGCAGT GAGTCTATAA AATTGGAGGT 550 

15 ATTGGTACGG TACCAGTCGG AAGGGTCGAA ACAGGACAAC TGAAAGCAGG 600 

AATGATCATT ACTTTTGCCC CCACTGGTTT GACCACTGAA TGTAAATCTG 650 

TTGAAATGCA TCACGAGGTT GTGGAAGTGG CTAGCCCCGG TGATAACGTT 700 

GGATTTAATG TCAAGAATGT GTCTGTTAAG GATATTAAGA GAGGAAATGT 750 

GGCTTCGGAT TCGAAAAATG ACCCAGCCAA GGAAGCTACC TCTTTCTCTG 8 00 

20 CACAAGTCAT TGTACTCAAT CACCCTGGTA CCATCAAGGC CGGTTACTCA 850 

CCTGTGGTTG ATTGCCATAC TGCCCACATT GCTTGCAAAT TCGAATCTCT 900 

AGACACTAGG ATTGACAAGC GTACTGGCAA GACTTTGGAA GAAAATCCTA 950 

AGACTATTAA GAATGGTGAC GCTGCCATGG TGACTATGAA ACCAAATAAA 10 00 

CCCATGGTTG TGGAAACTTT CACCGACTAC GCCCCGTTGG GCCGGTTCGC 1050 

25 CGTGCGTGAT ATGCGCCAAA CCGTTGCCGT CGG 1083 



2 ) INFORMATION FOR SEQ ID NO: 678 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
4 0 (A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: Lev-12 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 678 

45 TGTGCTCATT CAGGAGCTCA TTAACAACAT TGCCAAGGCT CACGGTGGTT 50 

ACTCTGTCTT CACTGGTGTC GGTGAGCGAA CTCGTGAGGG TAACGACTTG 100 

TACCACGAAA TGAGGGAGAC TGGTGTTATC AACCTTGAGG GTGACTCCAA 150 

GGTCGCTCTT GGTGAGTTCT TTTTTTTCTT CAGGCTAATT AGTCGATGAC 2 00 

GTGGGCCCTG ACTAAAACTG TTTCTTCCAG TCTTCGGTCA GATGAACGAG 250 

50 CCCCCTGGAG CCCGTGCCCG AGTTGCCCTT ACTGGTTTGA CCATTGCCGA 3 00 

GTACTTCCGT GACGAGGAAG GCCAGGATGT GTTGCTTTTC ATTGACAACA 3 50 

TTTTCCGATT CACCCAGGCC GGTTCCGAGG TGTCTGCCTT GCTCGGTCGT 4 00 

ATCCCCTCTG CCGTCGGTTA CCAGCCCACT CTTTCCACCG ACATGGGTGG 4 50 

TATGCAGGAG CGAATGTAGG TTGCATTCTC TGTGATTTTA CGGCAAGCCT 500 

55 TGACTTTTTT TTTCTAGTAC CACCACCAAG AAGGGTTCCA TTACCTCCGT 550 

C 551 



60 2) INFORMATION FOR SEQ ID NO: 679 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ciryptococcus neoformans 

(B) STRAIN: ATCC 44104 



15 



30 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 679 



TGTGCTCATT CAGGAGCTCA TTAACAACAT TGCCAAGGCC CACGGTGGTT 50 

ACTCCGTCTT CACCGGTGTC GGTGAGCGAA CTCGTGAGGG TAACGACTTG 10 0 

TACCACGAAA TGAGGGAGAC TGGTGTTATC AACCTTGAGG GTGACTCCAA 150 

GGTCGCTCTT GGTGAGTTCT TTTTTTTGGG AGCTAATTAG TCGATGACGT 2 00 

2 0 GGGCCCTGAC CAAAACCGTT TCTTTCAGTC TTCGGTCAGA TGAACGAGCC 250 

CCCTGGAGCC CGTGCCCGAG TTGCTCTTAC TGGTTTGACC ATTGCCGAGT 300 

ACTTCCGTGA CGAGGAAGGC CAGGATGTGT TGCTTTTCAT TGACAACATT 350 

TTCCGATTCA CCCAGGCCGG TTCCGAGGTG TCTGCCTTGC TCGGTCGTAT 4 00 

CCCCTCTGCC GTCGGTTACC AGCCCACTCT TTCCACCGAT ATGGGTGGTA 450 

2 5 TGCAAGAGCG AATGTAAGTT GCATTTTTTG TGATTTTACG GCAAGTCTTG 500 

ACTTACATTT TTTTCTAGCA CCACCACCAA GAAGGGTTCC ATTACCTCCG 550 
TC 



2) INFORMATION FOR SEQ ID NO : 68 0 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1018 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



552 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cunninghamella bertholletiae 

(B) STRAIN: ATCC 42115 

4 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 680 

TCTCCCTGCT ATTTTAAACG CTCTTGAAGT TAAGGATCAT GCTGGTGGTC 50 

GTCTTGTTCT TGAAGTTGCT CAACACTTGG GTGAAAACAC TGTACGTACT 100 

ATTGCTATGG ATGGTACTGA AGGTAAGTTT ATTTTTAGAT GATCATAAAT 150 

50 AATTGATCAT AATGATAAAA AAAAAAGAAG AAGAAGAACA GGATGTATAT 2 00 

AATGGTTAAT AAATAATATT TTCATATTGK ATATAACTAT TTAATCTGTT 2 50 

TTTTTTTCTT CATGATTATA TATATATATG TMCTAATATC TAATATGAAC 300 

CTTTTTTATA AAATTAATCA GGTCTTGTCC GTGGTCAAAA GGTTGTTGAT 350 

ACTGGTGCTC CTATTACCAT TCCTGTTGGT AAGGAAGTTC TTGGTCGTAT 4 00 

55 CATCAACGTT ATTGGTGAAC CCATTGATGA ACGTGGTCCT ATTAAGGCCA 450 

AGTCTCACCG TGCTATTCAC GCTGAAGCTC CAGAATTCGT TGATCAATCT 500 

CCTACTCCCG AAATTCTTGA AACTGGTATT AAGGTTGTCG ATTTATTAGC 550 

TCCTTATGCT CGTGGTGGTA AGATTGGTCT TTTCGGTGGT GCTGGTGTAG 600 

GTAAAACTGT CTTGATTCAA GAACTTATTA ACAACATTGC TAAAGCCCAT 650 

60 GGTGGTTACT CTATTTTCTG TGGTGTTGGT GAACGTACTC GTGAAGGTAA 700 

385 
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CGATTTATAC CACGAAATGA TGGAAACTGG TGTCATTAAA CTTGAAGGTG 750 

ACTCCAAGTG TGCTCTTGTA TTCGGTCAAA TGAACGAACC TCCTGGTGCT 80 0 

CGTGCCCGTG TTGCTTTAAC TGGTTTAACC ATTGCTGAAT ACTTCCGTGA 850 

TGAAGAAGGT CAAGATGTGT TACTTTTCAT TGATAACATT TTCCGTTTCA 900 

CTCAAGCTGG TTCTGAAGTA TCTGCCCTTT TAGGTCGTAT TCCATCTGCT 950 

GTAGGTTACC AACCCACTTT ATCTACTGAT ATGGGTGGTA TGCAAGAACG 1000 

TATTACTACT ACCAAGAA 1018 



2 ) INFORMATION FOR SEQ ID NO: 681 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 681 
GGISSITTYG GIISIGGIAA RAC 23 

2 ) INFORMATION FOR SEQ ID NO: 682 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 682 
GTIACIGGYT CYTCRAARTT ICCI CC 2 6 

40 

2 ) INFORMATION FOR SEQ ID NO: 683 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 683 
55 GTIACIGGIT CISWIAWRTC ICCICC 26 



60 



2) INFORMATION FOR SEQ ID NO: 684 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 67 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Candida tropicalis 

(C) ACCESSION NUMBER: M64984 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 684 

15 ATGGCTGGTG CTTTAGAAAA CGCAAGAAAA GAAATTAAAC GTCTTTCATT 50 
AGATGACACC AATGAATCCC AATATGGTCA AATCTATTCT GTTTCCGGTC 100 
CGGTTGTTAT TGCCGAAAAC ATGATTGGAT GTGCCATGTA CGAATTGGTT 150 
AAAGTTGGTC ATGATAATTT AGTTGGGGAA GTTATTAGAA TTAATGGTGA 2 00 

TAAAGCAACC ATTCAAGTTT ATGAAGAAAC TGCAGGGGTC ACTGTTGGTG 25 0 

2 0 ATCCAGTTTT AAGAACTGGT AAACCATTAT CTGTTGAATT AGGTCCTGGT 300 

TTAATGGAAA CTATTTATGA TGGTATTCAA AGACCTTTAA AAGCCATTAA 350 
AGATGAATCC CAATCTATTT ATATCCCAAG AGGTATTGAT GTTCCTGCTT 4 00 

TATCAAGAAC TGTTCAATAT GATTTCACTC CAGGTCAATT GAAAGTTGGT 4 50 

GATCATATCA CTGGTGGGGA CATTTTTGGT TCTATTTATG AAAACTCTTT 500 

25 ATTGGATGAC CATAAGATTT TGTTACCTCC AAGAGCAAGA GGTACTATTA 550 
CTTCTATTGC TGAAGCCGGT TCTTATAATG TTGAAGAACC AGTTTTGGAA 600 
GTTGAATTTG ATGGTAAGAA ACATAAATAC TCTATGATGC ATACATGGCC 650 
AGTTAGAGTT CCAAGACCAG TTGCTGAAAA ATTGACTGCT GATCATCCAT 700 
TGTTGACCGG TCAAAGAGTC TTGGATTCTT TATTCCCATG TGTTCAAGGT 750 

30 GGTACTACTT GTATCCCAGG GGCTTTTGGT TGTGGTAAAA CTGTTATTTC 800 
TCAATCTTTG TCCAAATTCT CCAACTCTGA TGTTATTATC TATGTTGGTT 850 
GTTTCACTAA AGGTACTCAA GTCATGATGG CTGATGGTGC CGACAAATCT 900 
ATTGAATCTA TTGAAGTTGG TGACAAAGTC ATGGGTAAAG ATGGTATGCC 950 

AAGAGAAGTT GTTGGCTTAC CAAGAGGTTA TGATGATATG TACAAGGTTC 1000 

3 5 GTCAACTTTC TAGTACTAGA CGTAATGCTA AATCCGAAGG CTTGATGGAT 1050 

TTCACTGTTT CTGCTGATCA TAAACTTATC TTGAAAACTA AACAAGATGT 1100 

CAAGATTGCT ACACGTAAAA TTGGTGGCAA CACCTATACT GGTGTTACTT 1150 
TCTATGTTTT GGAAAAGACT AAGACTGGTA TTGAATTAGT TAAAGCCAAG . 12 00 

ACTAAAGTTT TCGGTCATCA TATCCATGGT CAAAATGGCG CTGAAGAAAA 1250 

4 0 AGCTGCTACT TTTGCTGCTG GCATTGACTC TAAAGAATAC ATTGATTGGA 13 00 

TCATTGAAGC TAGAGATTAT GTACAAGTTG ATGAAATTGT CAAGACCAGC 13 50 

ACCACTCAAA TGATCAACCC AGTTCATTTT GAATCTGGTA AACTCGGTAA 14 00 

CTGGTTACAC GAACACAAGC AAAACAAATC ACTTGCTCCA CAATTGGGTT 1450 

ACTTGTTGGG TACTTGGGCT GGTATTGGAA ATGTTAAATC TTCTGCTTTC 1500 

4 5 ACCATGAACT CCAAAGATGA TGTTAAATTA GCTACAAGAA TTATGAACTA 1550 

CTCTTCAAAA TTGGGCATGA CTTGTTCTTC TACTGAATCC GGTGAACTCA 1600 

ATGTCGCTGA AAACGAAGAA GAATTTTTCA ATAACCTTGG TGCTGAAAAG 1650 

GATGAAGCTG GTGATTTCAC TTTTGATGAA TTTACCGATG CTATGGATGA 1700 

ATTGACTATC AATGTTCATG GTGCAGCTGC AAGCAAGAAG AACAATTTGT 1750 

50 TGTGGAATGC TTTGAAATCT CTTGGTTTCA GAGCCAAGTC TACTGATATT 18 00 

GTCAAGAGTA TTCCTCAACA TATTGCTGTT GATGATATTG TTGTCAGAGA 1850 

ATCTTTGATT GCCGGTTTAG TTGATGCTGC TGGTAATGTT GAAACCAAAT 1900 

CCAATGGTTC TATTGAAGCT GTTGTTAGAA CTTCTTTCAG ACATGTCGCT 1950 

AGAGGTCTTG TCAAGATTGC TCATTCTTTG GGTATTGAAT CATCTATTAA 2 0 00 

55 TATTAAAGAT ACTCACATTG ATGCTGCTGG TGTTAGACAA GAATTTGCTT 2050 

GTATTGTCAA TTTGACTGGT GCTCCACTTG CTGGTGTTCT TTCTAAATGT 2100 

GCACTTGCAA GAAACCAAAC TCCAGTTGTC AAATTTACCA GAGACCCAGT 2150 

TTTGTTCAAC TTTGATTTGA TCAAATCTGC AAAAGAAAAC TATTATGGTA 22 00 

TTACTTTGGC TGAAGAAACT GATCATCAAT TCCTTTTATC CAACATGGCC 2250 

60 TTGGTGCACA ACTGTGGTGA ACGTGGTAAT GAGATGGCTG AAGTTTTGAT 2300 
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GGAATTCCCA GAATTGTTTA CTGAAATTTC TGGTAGAAAA GAACCAATTA 2350 

TGAAACGTAC CACTTTGGTT GCCAATACTT CTAATATGCC AGTCGCTGCC 24 00 

AGAGAAGCTT CTATTTATAC TGGTATTACA TTGGCTGAAT ATTTCAGAGA 24 50 

TCAAGGTAAG AATGTTTCTA TGATTGCTGA TTCTTCTTCA CGTTGGGCTG 2 500 

5 AAGCTTTGAG AGAAATTTCT GGTAGATTGG GTGAAATGCC TGCTGATCAA 2 550 

GGTTTCCCAG CTTATTTGGG TGCTAAATTG GCTTCTTTCT ATGAGCGTGC 2 600 

CGGTAAAGCC ACTGCTTTGG GTTCACCAGA TAGAGTTGGT TCAGTTTCTA 2 650 

TTGTTGCTGC TGTTTCTCCA GCTGGTGGTG ATTTCTCTGA TCCAGTTACT 2700 

ACTTCTACTT TGGGTATTAC TCAAGTTTTC TGGGGGTTGG ATAAGAAATT 2 750 

10 GGCCCAAAGA AAACATTTCC CATCTATTAA CACCAGTGTT TCTTATTCTA 2 800 

AATACACCAA TGTTTTGAAC AAATACTATG ATTCCAACTA TCCAGAATTC 2 850 

CCACAATTGA GAGACAAAAT TAGAGAAATT TTATCTAATG CTGAAGAATT 2 900 

GGAACAAGTT GTTCAATTAG TTGGTAAATC TGCATTGTCT GATTCTGATA 2 950 

AGATTACTTT AGATGTTGCT ACCTTGATTA AAGAAGATTT CTTGCAACAA 3 000 

15 AATGGTTATT CTTCATATGA TGCATTCTGT CCAATTTGGA AGACTTTTGA 3 050 

TATGATGAGA GCATTTATTT CATATTATGA TGAAGCACAA AAAGCAATTG 310 0 

CCAATGGTGC TCAATGGTCT AAATTAGCTG AAAGTACTAG TGATGTTAAA 3150 

CATGCTGTTT CTTCAGCTAA ATTCTTTGAA CCATCAAGAG GTCAAAAAGA 32 00 

AGGTGAAAAA GAATTTGGAG ATTTATTAAC CACTATCTCC GAAAGATTTG 32 50 

2 0 CTGAAGCTTC AGAATAA 32 67 



2) INFORMATION FOR SEQ ID NO: 685 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1782 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Enterococcus hirae 

(B) STRAIN: ATCC 9790 

(C) ACCESSION NUMBER: D17462 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 685 



TTGCAAATTG GAAAAATCAT AAAAGTCTCC GGTCCTCTCG TTATGGCAGA 50 

AAATATGTCA GAAGCAAGTA TTCAAGACAT GTGTTTAGTG GGAGATTTAG 100 

GAGTCATCGG CGAAATCATT GAGATGCGTC AAGATGTGGC GTCTATTCAA 150 

GTATATGAAG AAACTTCAGG AATTGGTCCC GGAGAACCTG TTCGTTCCAC 2 00 

4 5 TGGGGAAGCA CTATCTGTTG AGCTAGGACC AGGAATCATT TCACAAATGT 2 50 

TTGACGGGAT TCAAAGACCA CTGGATACAT TTATGGAAGT GACTCAAAGT 3 00 

AACTTCTTAG GACGTGGGGT CCAATTACCA GCTTTAGATC ATGAGAAACA 3 50 

ATGGTGGTTT GAAGCGACAA TCGAAGAAGG AACAGAAGTA AGTGCTGGAG 400 

ACATCATTGG GTACGTGGAT GAAACGAAGA TCATTCAGCA CAAAATCATG 4 50 

50 GTCCCTAATG GTATCAAAGG AACTGTACAA AAAATTGAAT CTGGATCATT 500 

TACGATCGAT GATCCGATTT GTGTGATCGA AACGGAACAA GGCTTAAAAG 550 

AGCTGACGAT GATGCAAAAA TGGCCAGTAC GTCGTGGTCG ACCAATCAAA 600 

CAAAAATTAA ATCCAGATGT ACCGATGATC ACCGGTCAAA GGGTCATTGA 650 

CACGTTTTTC CCAGTAACTA AAGGAGGAGC GGCAGCCGTT CCAGGTCCGT 700 

55 TTGGTGCAGG GAAGACAGTT GTGCAACACC AGATTGCTAA GTGGTCGGAC 750 

GTAGATCTAG TGGTTTACGT TGGTTGTGGG GAACGAGGAA ATGAAATGAC 8 00 

GGATGTCGTC AATGAATTTC CTGAACTGAT CGATCCAAAT ACAGGCGAGT 8 50 

CTTTGATGGA ACGAACTGTG TTGATCGCTA ATACATCGAA CATGCCAGTA 9 00 

GCTGCTCGAG AAGCTTCTAT TTATACGGGA ATCACGATTG CCGAGTACTT 950 

60 CCGTGACATG GGGTATGATG TAGCAATCAT GGCAGATTCC ACTTCTCGTT 10 00 
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SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
GGGCAGAAGC ACTGCGTGAA 
GATGAAGGTT ATCCCGCTTA 
ACGTTCAGGA CGTGTCATTG 
TCACTGCCAT CAGTGCGGTT 
5 GTGACTCAAA ATACCTTACG 
TAGTCTTGCT CAAAAAAGAC 
ACTCATTATA TTCAACAGAA 
CAGGATTGGT CTGATATGGT 
AGAACAATTA AATGAAATCG 

10 ATAACGATCG CTTGACCCTT 
TTACAACAAA ATGCTTTTGA 
ACAATTCAAC ATGTTGAAAG 
AAGCCTTATC GTTGGGAGCG 
GCGGTCAGAG AACGCATTAG 
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ATGAGCGGAC GTTTAGAAGA 
TCTGGGCTCT CGTCTAGCTG 
CTCTAGGCTC TGACCAACGT 
TCTCCTTCTG GTGGAGATAT 
TGTGGTGAAG GTTTTCTGGG 
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GAAGTAGCAA AATCGATTCG 
TGATGTAGAT ACGTTTACTT 
TTATTTTGAC TTTTGGGAAA 
TACTTCAATG AAATCATGGA 
TCGGAGCAAG TATATTCCAG 
ATGAAGAAAT CAAAGAAACG 
ACCGATGATT AA 



PCT/CAOO/01150 

AATGCSTGGT 105CT 

AATACTATGA 1100 

GAGGGCAGTA 1150 

CTCTGAACCA 1200 

GATTAGATTC 1250 

ATCCAAAGTT 13 00 

AATCTTACAA 13 50 

TGCAAGAAGA 14 00 

TCGCTTTCTG 1450 

AGAAGACTAT 1500 

CAAGAGAAAA 1550 

GAAGCTCGAA 1600 

AGGTACAGTA 1650 

AAGAAGAGTT 1700 

ATCCAATTGA 1750 
1782 



20 2) INFORMATION FOR SEQ ID NO: 686 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1781 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia pneumoniae 

(B) STRAIN: CWL 02 9 

(C) ACCESSION NUMBER: Genome project 

3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68 6 

CAGGGACATG TTATAGAAGC TTATGGAAAC TTGTTACGTG TACGCTTTGA 50 

CGGATATGTT AGACAAGGTG AAGTTGCATA TGTCAACGTA GATAATACCT 100 

GGTTAAAAGC AGAAGTGATT GAAGTTGCTG ATCAAGAAGT CAAGGTTCAG 150 

4 0 GTATTTGAAG ATACACAAGG CGCGTGTCGA GGAGCTCTTG TTACGTTTTC 2 00 

AGGACATCTT TTAGAAGCCG AGTTAGGGCC TGGCTTGCTT CAGGGCATTT 2 50 

TCGATGGACT TCAAAATCGT CTTGAGGTGC TAGCTGAAGA TAGTTCTTTC 3 00 

TTGCAGAGAG GCAAGCATGT TAATGCTATT TCTGATCATA ATTTATGGAA 3 50 

TTATACTCCC GTAGCTTCTG TTGGGGATAC TTTAAGACGA GGAGATCTTC 400 

45 TAGGAACAGT ACCTGAAGGA CGATTTACTC ATAAGATTAT GGTTCCTTTT 450 

TCTTGCTTTC AAGAGGTTAC CCTGACTTGG GTAATTTCTG AAGGAACCTA 50 0 

TAATGCTCAT ACTGTGGTCG CAAAAGCTCG AGATGCTCAG GGTAAAGAAT 550 

GTGCCTTTAC TATGGTGCAA AGATGGCCGA TCAAACAAGC TTTTATTGAA 60 0 

GGAGAGAAGA TCCCTGCGCA TAAGATTATG GATGTGGGTT TGCGAATCTT 650 

50 AGATACGCAA ATTCCAGTAT TGAAGGGGGG AACTTTCTGT ACCCCAGGAC 700 

CTTTTGGTGC AGGGAAAACA GTCTTACAAC ACCATCTTTC TAAGTACGCT 750 

GCTGTAGATA TTGTGATTTT GTGTGCGTGC GGAGAGCGTG CTGGTGAAGT 800 

TGTTGAGGTA TTACAAGAGT TCCCTCATCT TATCGACCCC CATACCGGAA 850 

AGTCTTTAAT GCACAGAACA TGTATTATTT GTAACACATC ATCCATGCCT 90 0 

55 GTGGCTGCCC GAGAGTCTTC GATCTATTTA GGAGTGACGA TTGCAGAATA 950 

CTATCGCCAG ATGGGACTAG ATATTCTGCT TTTAGCTGAT TCTACATCCC 1000 

GATGGGCACA AGCCCTTAGA GAGATTTCGG GACGTCTTGA AGAAATCCCT 1050 

GGAGAGGAAG CATTTCCTGC ATACCTGTCT TCTAGAATAG CTGCTTTTTA 1100 

TGAGCGAGGA GGAGCTATCA CCACGAAAGA TGGTTCTGAA GGATCTTTAA 1150 

60 CTATATGTGG TGCGGTGTCT CCTGCAGGAG GAAACTTTGA AGAACCAGTC 1200 

389 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
ACTCAATCTA CATTAGCTGT 
ACGAGCTGAC GCACGTAGGT 
CAAAATATTT GAACCAGGTA 
TGGGGTGGTG CTGTGAAAAA 
5 AATCGGCAAG CGTATGGAAG 
ACATGGAAAT CTACTTAAAG 
CAGAACGCAT TCGATCCTGT 
AGAGTTATTT TCATTAATCA 
ATAGTCCTGA TGATGCAAGA 
10 AAGACATTAA ATGGCCTGAA 
AGAGGTCATA GTTAGACTGT 
GATATGCAAA CAATCTACAC 



AGTCGGAGCG TTCTGTGGTC 
ATCCTTCAAT AGACCCTTTG 
GGACAAATTT TAGAAGAGAA 
AGCAGCACAG TTTCTAGAGA 
TTGTCAGTGA AGAAGGGGTT 
GCAGAACTTT ATGATTTTTG 
GGACTGTTAT TGTCCTTTTG 
GTCGTATTTT TGATGCTAAA 
AGCTTTTTCC TTGAGCTGCA 
ATTTCTTTCA GAGGAATATC 
TGGAAAAAAC AATGGTACAA 
AAAAATAACT G 



PCT/CA00/01150 

TTTCAAAAGC 12 50 

ATTTCTTGGT 13 00 

GGTTTCAGGC 1350 

AAGGTTCAGA 14 00 

TCTATGGAAG 14 50 

TkATCTCCAG 1500 

AGAGACAGAT 1550 

TTTGTTTTTG 1600 

GAGCAAGATT 1650 

ATGAGAGTAA 1700 

ATGGCGTAAG 1750 
1781 



15 



2) INFORMATION FOR SEQ ID NO: 687 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1758 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Halohacterium salinarum 
(C) ACCESSION NUMBER: S56356 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 687 

ATGAGTCAAG CTGAAGCAAT CACTGACACC GGCGAAATCG AGAGCGTGAG 50 

CGGTCCGGTC GTGACCGCCA CGGGCCTCGA CGCGCAGATG AACGACGTCG 100 

TCTACGTGGG CGACGAGGGT CTGATGGGCG AGGTCATCGA GATCGAAGGC 150 

3 5 GACGTAACCA CCATCCAGGT CTACGAGGAG ACGTCCGGCA TCGGGCCGGG 2 00 

CCAGCCCGTC GACAACACGG GCGAACCGCT CACCGTGGAC CTGGGCCCGG 2 50 

GGATGCTGGA CTCCATCTAC GATGGTGTGC AGCGTCCACT GGACGTCCTC 3 00 

GAAGACGAGA TGGGGGCGTT CCTCGATCGC GGTGTTGACG CACCCGGCAT 3 50 

CGACCTCGAC ACCGACTGGG AGTTCGAGCC CACCGTCGAG GCGGGCGACG 4 00 

4 0 AGGTCGCGGC CGGCGATGTC GTCGGCACCG TCGACGAAAC GGTCAGCATC 4 50 

GAACACAAGG TTCTGGTGCC CCCCCGCAGT GACGGCGGCG AAGTCGTCGC 50 0 

CGTCGAATCC GGCACGTTCA CCGTCGACGA CACGGTCGTC GAGTTGGACA 550 

CCGGCGAGGA GATCCAGATG CACCAGGAGT GGCCGGTCCG CCGCCAGCGC 60 0 

CCCACCGTCG ACAAGCAGAC GCCGACGGAG CCGCTGGTGT CCGGCCAGCG 6 50 

45 CATCCTCGAC GGCCTGTTCC CGATCGCGAA AGGCGGGACG GCCGCGATCC 70 0 

CGGGGCCGTT CGGGTCCGGG AAGACGGTCA CCCAGCAGTC CCTCGCGAAG 750 

TTCGCCGACG CGGACATCGT TGTCTACATC GGCTGTGGTG AGCGCGGCAA 80 0 

CGAGATGACG GAAGTCATCG AGGACTTCCC GGAGCTGCCC GACCCCCAGA 850 

CCGGGAACCC GCTGATGGCC CGCACCACGC TCATCGCGAA CACGTCGAAC 90 0 

50 ATGCCGGTTG CCGGGCGTGA GTCCTGCATT TACACGGGAA TCACCATCGC 950 

GGAGTACTAC CGCGACATGG GCTACGACGT GGCGCTGATG GCCGACTCCA 1000 

CCTCGCGGTG GGCGGAGGCC ATGCGGGAGA TCTCCTCGCG ACTCGAGGAG 1050 

ATGCCCGGCG AGGAGGGGTA TCCCGCGTAC CTGGCCGCCC GCCTCTCGGA 1100 

GTTCTACGAG CGCGCCGGCT ACTTCGAGAA CTTCAACGGG ACCGAGGGCT 1150 

55 CCATCTCGGT CATCGGTGCG GTGTCGCCGC CCGGCGGGGA CTTCTCCGAG 12 00 

CCGGTCACCC AGAACACGCT GCGCATCGTG AAGACGTTCT GGGCGCTTGA 12 50 

CTCGGACCTC GCCGAGCGCC GGCACTTCCC GGCGATCAAC TGGGACGAGT 13 00 

CCTACAGCCT CTACAAGGAC CAACTCGACC CGTGGTTCAC GGACAACGTC 13 50 

GTCGACGACT GGGCCGAGCA GCGCCAGTCG GCGGTCGACA TCCTCGACGA 14 00 

60 GGAATCCGAA CTCGAAGAGA TCGTGCAGCT CGTCGGGAAG GACGCGCTGC 14 50 

390 
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CCGAGGACCA GCAGCTCACG CTGGAAGTCG 

TGGCTCCAGC AGAACGCGCT CCACGACGTG 

GAAGACGTAC GCCATCCTCT CCGGCATCAA 

TTGAGGCGTT GGACGCCGGT GTGCCAGTCG 

GCCGCGCCGC GCCTGAATCG TCTCGGCACG 

GGCGGAGGTC GCGGAGATCA AACAGCAGAT 
TCTACTGA 



PCT/CA00/01150 

CGCGGTACAT CCGCGAGGCG L500 

GATCGCTACT GCCCGCCCGA 1550 

GACGCTTCAC GAGGAGTCCT 16 00 

AGGAGATCAC GTCGATCGAC 1650 

ACGCCCGACG ACGAGCACGA 17 00 

TACCGAGCAG CTTCGGGAGC 1750 

1758 



2) INFORMATION FOR SEQ ID NO: 688 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human 

(C) ACCESSION NUMBER: L09234 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 688 

GAATTCCGGC AGCTGACTAG TCTTGTGATT GGGGTCCTGG GCTGATAAAA 50 

TCATTCCAAA TGACGAGCAC ATTGATAAAG ACGTCCGATG AGGACCGGGA 100 

GTCCAAATTC GGCTTTGTTT TTGCCGTATC TGGACCTGTG GTGACAGCTG 150 

AACGAATGGC CGGTTCTGCT ATGTACGAAC TGGTGCGTGT CGGTTATTAT 2 00 

GAACTGGTCG GAGAGATCAT CCGGTTGGAG GGTGACATGG CAACAATCCA 2 50 

AGTATACGAA GACACCTCAG GTGTGACAGT AGGCGATCCC GTGCTGCGCA 3 00 

CAGGCAAGCC GCTGTCCGTG GAACTGGGAC CCGGAATCAT GGGCAGCATC 350 

TTCGACGGTA TCCAGCGACC GCTGAAGGAT ATCAATGAAC TGTCAAATAG 4 00 

TATCTACATC CCGAAAGGTG TCAATGTGCC TGCCCTGAGT CGCACTGCAC 450 

AGTGGGACTT CAGTCCCGTC AGTGTCAAGG TTGGAAGCCA CATTACTGGT 500 

GGTGACCTGT ACGGTTTGGT CCACGAAAAT ACTCTGGTGA AACACAAGTT 550 

GCTGCTGCCG CCCCGTGCCA AGGGAACTGT CACGTACATT GCAGAACCTG 600 

GAAACTACAC AGTTGATGAT GTTGTCCTGG AGACAGAATT TGACGGCGAG 650 

CGATCAAAGT TCACCATGCT GCAAGTGTGG CCTGTACGTC AGCCCAGGCC 700 

TGTTACAGAA AAGTTGCCAG CTAACTACCC CCTCCTTACT GGCCAGCGTG 750 

TGCTCGACTC CCTATTCCCG TGTGTCCAGG GTGGAACAAC AGCTATTCCT 800 

GGGGCCTTCG GATGTGGCAA GACTGTAATA TCACAGTCTT TGTCAAAATA 850 

CTCAAACTCC GATGTAATTA TCTATGTAGG TTGTGGTGAG CGAGGTAATG 900 

AAATGTCAGA AGTACTCAGG GATTTCCCGC AGTTGTCGTT GGAGATTGAT 950 

GGTGTGACTG AATCAATCAT GAAGAGAACA GCCCTGGTCG CAAACACATC 1000 

AAACATGCCT GTGGCTGCTC GAGAAGCATC TATCTACACA GGTATTACAC 1050 

TGTCAGAATA CTTCAGGGAC ATGGGTTACA ATGTATCCAT GATGGCTGAC 1100 

TCAACTTCAC GATGGGCCGA AGCTCTTCGA GAAATCTCAG GTCGATTGGC 1150 

TGAAATGCCT GCCGACAGCG GTTATCCCGC CTACCTAGGT GCACGACTTG 12 00 

CCAGTTTCTA CGAGCGTGCC GGCCGTGTGA AGTGCTTGGG TAAC CCAGAC 12 50 

AGGGAGGGCT CCGTGAGTAT AGTGGGCGCC GTGTCGCCGC CCGGTGGAGA 13 00 

CTTCTCAGAT CCCGTGACGA CGGCCACACT AGGTATCGTC CAGGTGTTCT 13 50 

GGGGTCTCGA CAAGAAACTT GCCCAGCGAA AGCACTTCCC ATCCATCAAC 14 00 

TGGCTCATCT CGTACAGTAA ATACATGCGT GCTCTGGATG ACTTCTACGA 14 50 

CAAGAATTTC CCAGAGTTTG TCCCACTGCG TACAAAGGTG AAGGAGATTT 1500 

TGCAGGAGGA AGAAGACCTG TCTGAAATTG TGCAGTTGGT CGGTAAAGCT 1550 

TCATTGGCAG AAACTGACAA GATCACACTT GAGGTTGCCA AACTATTAAA 1600 

GGATGATTTC CTGCAACAGA ACAGCTATTC ACCATATGAC CGTTTCTGCC 1650 

CATTCTACAA GACAGTAGGA ATGCTGAAAA ATATGATTGC TTTCTACGAT 1700 

391 



SUBSTITUTE SHEET (RULE 26) 
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ATGTCTCGGC ATGCAGTTGA ATCTACTGCT 
TTGGAATGTT ATTAGAGATT CTATGGGCAA 
CCATGAAATT CAAGGATCCA GTCAAGGATG 
GACTTTGAGC AGCTTCATGA AGACATTCAG 
5 GGATTAAAGT GGTAGCTGCC AGTGGTTCTC 
GGCAAGCTCT GTAGGGTTGC CGAGTGGCAT 
TTCCTTTGCC ACATAAAGAC TAAAGCAGGT 
CTGGTTCCAT TGGTGCTAAG ATTATGTTGT 
TTCCAACAGA GGAATTTACT TCCAGTTTTC 

10 TAAGTGTCGG TACAGAGGCA ATAATCTGAT 
AAGCAGGGAG AATTTGTAAT TATTACAAAT 
CAGCCTTGTA AATTCATTTG TCCCAGGACT 
ATTGCCGTCT GTATGTATGT ACACACCGTA 
CAGAAGGTGA ATTACACCAC TTACTCATTG 

15 AACTGCCATC CATTGTCCTA TTTATTCACA 
TCCAGTGTTG CAAATTGTGT TTAGAAAATT 
ACCTCACATT GTAACTCAGT ATTTACACAC 
TGTAGAAAAA ATAATTGTTG TATATTGAAA 
TTTAAAATGG TGAATGTATT TTATATTTCT 

2 0 TGCATTTTGC TTAATGGAGA TGTATGTAAA 
ACAAATTATG TATATGTGAA ATGGAGATGG 
TGCCAGTATT AATTTAAACA ACTGTAGTTG 
TTTTTTTTCC TATAAAATTG GTGGATGTAT 
ATGTGTAAAA TAGTGATCCC AGTAACTGTA 

2 5 TGACTGTATA GTTGTACAGG TGTTGTTACT 
AAGGTGAAAG TCTATTATTG TATTGTAATG 
AAAATCCTAC TTCTGTGTAT AAATGTTACC 
GTGTAGAAAT ACAGTCAACT TCATGTTCAT 
ATAAATTATG CCCGGAAT 

30 



PCT/CAOO/01150 

CAGAGCGAGA ACAA&ATCAC 175^0 

TATTCTGTAT CAGCTTTCCT 18 0 0 

GAGAAGCGAA GATCAAGGCA 18 50 

CAAGCCTTCA GGAACCTGGA 1900 

TCGGTGCAGT TGTCACATTT 1950 

CGGTGCTAGA CACCTGAGCA 2 0 00 

GGAATTTCAG TTGTAAAAAG 2 050 

GCCCTTTTCT GCTTCTCACA 2100 

TTCCATTTTC CTCCTCATTT 2150 

AACTCTGTAC CGTCACTTAC 22 00 

CCCATTATCT CTGTGCACCA 22 50 

CCCTCTTGTG TGTACGTGAG 23 00 

CTGCAGTATT TGAAGTCAGT 23 50 

TGTCACGTAG CAAGTGTGCA 24 00 

TAACTAGTTT TCTTTGCATT 2450 

ATGCCATCGA GACTGGTCGA 2 500 

ACGTTTACTT GCTACAGAAA 2550 

GTACAAGTGA CAAAGTTGCA 26 00 

TTTGTAGACA CAAGAGTTAA 2650 

CCTAAAATAG CAGTTTGTGC 2700 

TTTCTAATTT GCTGATTGAT 27 50 

TGGGATGTAG TGGGAAGATT 2800 

GTGTCGGAGA TTTTGATTGT 2850 

AAGCTTTAGA ATACAGTTAC 2 900 

TTTAAGAATT TATTGACACA 2 950 

TTTAAAGCAT TTAAGGTTTA 3 0 00 

ATTCCTCATA TAACATAACT 3 050 

TAGCATTTCA CTGTTGTCAC 3100 

3118 



2) INFORMATION FOR SEQ ID NO: 689 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1836 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plasmodium falciparum 
4 5 (B) STRAIN: 3D7 

(C) ACCESSION NUMBER: L082 00 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 68 9 

50 ATGACAAAAG TTGCTGTTGA AAAAGAGGAA CCAGGAGTTG TTTATAAGGT 50 

GGCTGGTTCA TTAGTTATTG CTGAGAATAT GAGTGGAACT CGTATGTACG 100 

AGTTAGCTAA AGTAGGATGG AATAAATTGG TTGGAGAAAT TATTAGATTA 150 

GAAGGGAATT ATGCATATAT ACAAGTTTAT GAAGATACTT CAGGTTTATC 2 00 

TGTAGGAGAC CCTGTTATAA AAACAGGAAA TGCTTTATCA GTCGAATTGG 2 50 

55 GTCCTGGTAT TTTAGATAAT ATTTATGATG GTATTCAAAG ACCATTAGAA 3 00 

AGAATAGCAA ATGTGTGTGG TGATGTATAT ATATATAAAG GTATTGATAT 350 

GACATCTTTA GATCATGATA AACAATGGCA ATTTTATGCT GATAAGAAAT 4 00 

TAAAATTAAA TGATATTGTT ACTGGTGGAG ATATCTTTGG ATTTGTTGAT 4 50 

GAAAATAAAT TATTTAAAGA ACACAAAATT ATGGCTCCAC CTAATGCTAA 500 

60 AGGGAGGCTT ACATATATTG CTCCAGATGG ATCATATACT TTAAAAGATA 550 

392 
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AAATATTTGA ATTAGAATAT 
CATTTATGGC CTGTTCGTGA 
GGATACTTTA TTATTAACAG 
CAGTTCAAGG AGGTACTTGT 
5 ACTTGTGTTT CTCAGGCCTT 
ATATGTAGGA TGTGGTGAAA 
ACTTTCCTGA ATTAACTACT 
CAAAGAACGT GTTTAGTTGC 
AGAAGCTAGT ATTTATACAG 

10 TGGGTTATAA TGCTACCATG 
GCCTTAAGAG AAATTTCAGG 
TTATCCAGCT TATTTAGGTG 
GAAAAGTCAA ATGTATTGGT 
GTGGGTGCTG TGTCTCCACC 

15 AGCAACCATG TCTATTGTTC 
CTCAAAGAAA ACATTTCCCT 
TATGTCAGAC AATTAGAACA 
ATCTTTAAGA CAAAAAATTA 
ATGATATTGT TCAACTAGTA 

2 0 GTTGTTATGG AAGTAGCCAA 
TGCATTTAGC GATTATGATT 
TGATGAGAAT TATTTGCCAC 
GAATATGACT CAAGAGAAAG 
ATTAAGACCA ACTATAAATA 

25 AAAATTCAGA TGAATATTTC 
ATAACAGTAG GTTTAAGAAA 



CAAGGAAAAA AATATACATA 
TCCTAGACCT GTTTTAGAAA 
GGCAAAGAGT TTTAGATTCG 
GCTATTCCTG GTGCATTTGG 
ATCAAAATAT TCTAATAGTG 
GAGGTAATGA AATGGCTGAA 
AAAGTAGATA ATGAAGATGT 
TAATACTTCT AACATGCCTG 
GTATTACTTT ATGTGAATAT 
ATGGCTGATA GTACAAGTAG 
ACGTTTAGCT GAAATGCCTG 
CTAGATTAGC TTCCTTTTAT 
TCTCCATCTC GTATAGGATC 
AGGTGGTGAT TTCTCTGACC 
AAGCATTTTG GGGGTTAGAT 
TCTGTTAATT GGTCTACATC 
ATACTTTGAT AATTTTGATC 
GTGATATTTT ACAACAAGAA 
GGAAAGGATT CATTATCAGA 
AATTATTAGA GAAGATTTTC 
ATATGTGCCC ATTACAAAAA 
TTTTATGCTC AATGCTTAAG 
AAAAATTGGT TGGGGATCTA 
AAATTACACA TATGAAATTT 
AAAAAGTATT TTAAGGCACT 
CTTGATGGAA AAATGA 



PCT/CA00/01150 

T&3TTTATCT 600 

AGGTAACAGG 650 

TTATTTCCAA 700 

TTGTGGAAAA 750 

AAGTTATTAT 800 

ATTTTATCCG 850 

AGGTATTATG 900 

TCGCTGCAAG 950 

TTCCGTGATA 1000 

ATGGGCAGAA 1050 

CAGATAGTGG 1100 

GAACGTGCAG 1150 

CATTACAATT 120 0 

CTGTAACTAC 1250 

AAAAAACTAG 13 0 0 

CTTTTCAAAG 1350 

AAGATTTCTT 14 00 

AGTGACTTGA 1450 

AGAC CAAAAA 1500 

TTCAACAAAA 1550 

ACAGTTGGTA 1600 

AACATTACAA 1650 

TATATAATAC 1700 

GAAAACCCAA 1750 

TGAAGAAGAA 1800 
1836 



30 2) INFORMATION FOR SEQ ID NO: 690 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3216 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(B) STRAIN: X2180-1A 

(C) ACCESSION NUMBER: J05409 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 690 

ATGGCTGGTG CAATTGAAAA CGCTCGTAAG GAAATAAAAA GAATCTCATT 50 

AGAAGACCAT GCTGAATCTG AATATGGTGC CATCTATTCT GTCTCTGGTC 100 

CGGTCGTCAT TGCTGAAAAT ATGATTGGTT GTGCCATGTA CGAATTGGTC 150 

50 AAGGTCGGTC ACGATAACCT GGTGGGTGAA GTCATTAGAA TTGACGGTGA 200 

CAAGGCCACC ATCCAAGTTT ACGAAGAAAC TGCAGGCCTT ACGGTCGGTG 250 

ACCCTGTTTT GAGAACAGGT AAGCCTCTGT CGGTAGAATT GGGTCCTGGT 300 

CTGATGGAAA CCATTTACGA TGGTATTCAA AGACCTTTGA AAGCCATTAA 350 

GGAAGAATCG CAATCGATTT ATATCCCAAG AGGTATTGAC ACTCCAGCTT 4 00 

55 TGGATAGGAC TATCAAGTGG CAATTTACTC CGGGAAAGTT TCAAGTCGGC 450 

GATCATATTT CCGGTGGTGA TATTTACGGT TCCGTTTTTG AGAATTCGCT 500 

AATTTCAAGC CATAAGATTC TTTTGCCACC AAGATCAAGA GGTACAATCA 550 

CTTGGATTGC TCCAGCTGGT GAGTACACTT TGGATGAGAA GATTTTGGAA 600 

GTTGAATTTG ATGGCAAGAA GTCTGATTTC ACTCTTTACC ATACTTGGCC 650 

60 TGTTCGTGTT CCAAGACCAG TTACTGAAAA GTTATCTGCT GACTATCCTT 700 

393 
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TGTTAACAGG TCAAAGAGTT TTGGATGCTT TGTTTCCTTG TGTTCAAGGT 1B0 

GGTACGACAT GTATTCCAGG TGCTTTTGGT TGTGGTAAGA CCGTTATCTC 800 

TCAATCTTTG TCCAAGTACT CCAATTCTGA CGCCATTATC TATGTCGGGT 850 

GCTTTGCCAA GGGTACCAAT GTTTTAATGG CGGATGGGTC TATTGAATGT 900 

5 ATTGAAAACA TTGAGGTTGG TAATAAGGTC ATGGGTAAAG ATGGCAGACC 950 

TCGTGAGGTA ATTAAATTGC CCAGAGGAAG AGAAACTATG TACAGCGTCG 1000 

TGCAGAAAAG TCAGCACAGA GCCCACAAAA GTGACTCAAG TCGTGAAGTG 1050 

CCAGAATTAC TCAAGTTTAC GTGTAATGCG ACCCATGAGT TGGTTGTTAG 1100 

AACACCTCGT AGTGTCCGCC GTTTGTCTCG TACCATTAAG GGTGTCGAAT 1150 

10 ATTTTGAAGT TATTACTTTT GAGATGGGCC AAAAGAAAGC CCCCGACGGT 12 00 

AGAATTGTTG AGCTTGTCAA GGAAGTTTCA AAGAGCTACC CAATATCTGA 1250 

GGGGCCTGAG AGAGCCAACG AATTAGTAGA ATCCTATAGA AAGGCTTCAA 13 00 

ATAAAGCTTA TTTTGAGTGG ACTATTGAGG CCAGAGATCT TTCTCTGTTG 1350 

GGTTCCCATG TTCGTAAAGC TACCTACCAG ACTTACGCTC CAATTCTTTA 14 00 

15 TGAGAATGAC CACTTTTTCG ACTACATGCA AAAAAGTAAG TTTCATCTCA 1450 

CCATTGAAGG TCCAAAAGTA CTTGCTTATT TACTTGGTTT ATGGATTGGT 1500 

GATGGATTGT CTGACAGGGC AACTTTTTCG GTTGATTCCA GAGATACTTC 1550 

TTTGATGGAA CGTGTTACTG AATATGCTGA AAAGTTGAAT TTGTGCGCCG 1600 

AGTATAAGGA CAGAAAAGAA CCACAAGTTG CCAAAACTGT TAATTTGTAC 1650 

2 0 TCTAAAGTTG TCAGAGGTAA TGGTATTCGC AATAATCTTA ATACTGAGAA 1700 

TCCATTATGG GACGCTATTG TTGGCTTAGG ATTCTTGAAG GACGGTGTCA 1750 

AAAATATTCC TTCTTTCTTG TCTACGGACA ATATCGGTAC TCGTGAAACA 18 0 0 

TTTCTTGCTG GTCTAATTGA TTCTGATGGC TATGTTACTG ATGAGCATGG 18 50 

TATTAAAGCA ACAATAAAGA CAATTCATAC TTCTGTCAGA GATGGTTTGG 190 0 

25 TTTCCCTTGC TCGTTCTTTA GGCTTAGTAG TCTCGGTTAA CGCAGAACCT 1950 

GCTAAGGTTG ACATGAATGG CACCAAACAT AAAATTAGTT ATGCTATTTA 2 000 

TATGTCTGGT GGAGATGTTT TGCTTAACGT TCTTTCGAAG TGTGCCGGCT 2 050 

CTAAAAAATT CAGGCCTGCT CCCGCCGCTG CTTTTGCACG TGAGTGCCGC 210 0 

GGATTTTATT TCGAGTTACA AGAATTGAAG GAAGACGATT ATTATGGGAT 2150 

30 TACTTTATCT GATGATTCTG ATCATCAGTT TTTGCTTGCC AACCAGGTTG 22 00 

TCGTCCATAA TTGCGGAGAA AGAGGTAATG AAATGGCAGA AGTCTTGATG 22 50 

GAATTCCCAG AGTTATATAC TGAAATGAGC GGTACTAAAG AACCAATTAT 23 00 

GAAGCGTACT ACTTTGGTCG CTAATACATC TAACATGCCG GTTGCAGCCA 23 50 

GAGAAGCTTC TATTTACACT GGTATCACTC TTGCAGAATA CTTCAGAGAT 24 00 

35 CAAGGTAAAA ATGTTTCTAT GATTGCAGAC TCTTCTTCAA GATGGGCTGA 24 50 

AGCTTTGAGA GAAATTTCTG GTCGTTTGGG TGAGATGCCT GCTGATCAAG 2 5 00 

GTTTCCCAGC TTATTTGGGT GCTAAGTTGG CCTCCTTTTA CGAAAGAGCC 2 550 

GGTAAAGCTG TTGCTTTAGG TTCCCCAGAT CGTACTGGTT CCGTTTCCAT 2 6 00 

CGTTGCTGCC GTTTCGCCAG CCGGTGGTGA TTTCTCAGAT CCTGTTACTA 2 650 

40 CTGCTACATT GGGTATCACT CAAGTCTTTT GGGGTTTAGA CAAGAAATTG 2 7 00 

GCTCAAAGAA AGCATTTCCC ATCTATCAAC ACATCTGTTT CTTACTCCAA 2 750 

ATACACTAAT GTCTTGAACA AGTTTTATGA TTCCAATTAC CCTGAATTTC 2 8 00 

CTGTTTTAAG AGATCGTATG AAGGAAATTC TATCAAACGC TGAAGAATTA 2 850 

GAACAAGTTG TTCAATTAGT TGGTAAATCG GCCTTGTCTG ATAGTGATAA 2 9 00 

45 GATTACTTTG GATGTTGCCA CTTTAATCAA GGAAGATTTC TTGCAACAAA 2 950 

ATGGTTACTC CACTTATGAT GCTTTCTGTC CAATTTGGAA GACATTTGAT 3 000 

ATGATGAGAG CCTTCATCTC GTATCATGAC GAAGCTCAAA AAGCTGTTGC 3 050 

TAATGGTGCC AACTGGTCAA AACTAGCTGA CTCTACTGGT GACGTTAAGC 3100 

ATGC CGTTTC TTCATCTAAA TTTTTTGAAC CAAGCAGGGG TGAAAAGGAA 3150 

50 GTCCATGGCG AATTCGAAAA ATTGTTGAGC ACTATGCAAG AAAGATTTGC 32 00 

TGAATCTACC GATTAA 3216 



55 2) INFORMATION FOR SEQ ID NO: 691 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1860 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS : Double 

394 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Schizosaccharomyces pombe 

(B) STRAIN: 972 h- 

(C) ACCESSION NUMBER: S47814 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 91 

ATGGCGGGAG GAATTGAACT GGCCAAGAAG GCTATCAGGA GCCTCAAAAA 50 

TTACGACGAG CATGAAAACC GATATGGATC TATTTTCAGC GTTTCTGGTC 100 

CTGTCGTTGT TGCAGCCAAT ATGCTTGGAT GTTCGATGTA CGAACTCGTT 150 

15 CGCGTTGGTC ATGAAGAACT AGTTGGTGAA GTAATTCGTA TCCATCAAGA 2 00 

TAAATGTACT ATTCAAGTAT ACGAAGAGAC GTCCGGTCTC ACTGTTGGTG 2 50 

ATCCTGTCCA ACGCACTGGA AAGCCATTAT CTGTTGAATT AGGTCCTGGT 300 

TTAGCTGAGA CTATTTATGA TGGTATCCAA CGTCCGTTAA AGCAAATTTT 3 50 

CGACAAATCT CAAAGTATTT ATATTCCTAG AGGTATTAAT ACAGAATCAC 4 00 

2 0 TTAATCGTGA GCATAAGTGG GATTTCACAC CAAATAAGGA TTTACGCATT 450 

GGCGATCATG TATCCGGTGG TGATGTTTTC GGTTCTGTAT TTGAAAACTC 500 

TCTTTTCAAT GATCATAAAA TTATGTTACC CCCTAGAGCC CGTGGTACCG 550 

TCACATATAT TGCTGAAGCT GGATCATACC ATGTTGATGA AAAACTTCTT 600 

GAAGTCGAGT TTAATGGCAA GAAACATTCT TTTAGTATGT TGCATACTTG 650 

25 GCCTGTCCGT GCTGCTCGTC CAGTTGCGGA CAACTTAACT GCTAATCAAC 700 

CTTTATTGAC TGGTCAACGT GTTTTGGATG CGTTATACCC CTGTGTTCAA 750 

GGTGGCACTA CTGCTATCCC CGGTGCCTTT GGTTGTGGTA AAACAGTTAT 800 

TTCACAATCT CTTTCTAAGT ACTCTAATTC TGATTTGATT GTTTACGTCG 850 

GTTGTGGTGA ACGTGGAAAC GAAATGGCAG AAGTGTTAAT GGATTTCCCA 90 0 

30 GAACTAACAA TTGATATTAA TGGTAAACCA GAGCCCATTA TGAAGCGTAC 950 

TACATTGGTA GCCAACACTT CTAACATGCC TGTCGCTGCT CGTGAAGCTT 1000 

CCATTTATAC CGGTATTACA CTTGCTGAAT ATTATCGTGA TCAAGGTAAG 1050 

AACGTTTCAA TGATGGCTGA TTCTACATCT CGTTGGGCTG AAGCTTTGCG 1100 

TGAAATTTCT GGTCGTTTGG CTGAGATGCC TGCCGATTCT GGTTATCCCG 1150 

35 CTTATTTGGG TGCCAAATTG GCTTCTTTTT ACGAACGTGC TGGTCGTGCT 12 00 

CGTTGCTTGG GAAGTCCTGA CCGTGAAGGA ACAGTTTCAA TTGTTGGAGC 1250 

TGTTTCTCCT CCGGGTGGTG ATTTTTCTGA TCCTGTTACT AGTGCAACCT 13 0 0 

TGGGAATTGT TCAAGTCTTC TGGGGTTTGG ACAAGAAATT GGCCCAACGT 1350 

AAACACTTTC CCTCAATCAA CACCTCTCTT TCCTATTCTA AATACATCAA 14 00 

40 TGCTTTGCAA CCTTGGTATG AGGAAAGAGT TCCAGGCTTT AATACTCTTC 1450 

GTGATCAAAT CAAACAGATC ATTCAACAAG AAGATTCCAT GTTGGAAATT 1500 

ATTCAGTTGG TTGGTAAGTC GGCTCTTTCT GAAACGGATA AAGTTACTTT 1550 

GGACATAGCC GGTATTATTA AGAATGACTT CTTACAACAA AACGGTTATT 1600 

CTGATTACGA TCGCTGTTGC CCTCTTTACA AGACTTATCA TATGATGCGA 1650 

45 AACATGATTG CTTACTACAC AAAGGCTAAA AGTGCCGTTG AAACTGGTAG 1700 

CGTTCCTTGG TCAAAGATTA AAGAAAGTAC TTCAGATATC TTTTATGAGT 1750 

TAACCTCGAT GAAATTCGAA AACCCTAATG AAGGCGAGAA GGAAATAGTC 1800 

GAACACTATG AAACTCTGCA CAAGAAGATT GAGGACAAGT TTCACACTCT 1850 

GACTGAGTAA 1860 

50 



2) INFORMATION FOR SEQ ID NO: 692 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 183 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

60 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma congolense 
5 (B) STRAIN: IL3000 

(C) ACCESSION NUMBER: Z25814 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6 92 

10 ATGACGAGCG ATAAAAACCC TTACAAAACA GAGCAGCGCA TGGGGGCCGT 50 

GAAGGCCGTC TCCGGGCCAG TTGTCATTGC TGAAAACATG GGCGGTAGCG 10 0 

CTATGTATGA GCTTGTGCAG GTAGGTTCCT TCCGGTTAGT GGGCGAGATC 15 0 

ATTCGTCTAG AGGGCGATAC CGCCACTATT CAGGTCTATG AGGAAACAGG 200 

TGGCCTCACT GTCGGAGACC CGGTGTACTG TACGGGTAAG .CCTCTTTCGC 250 

15 TTGAGCTTGG ACCTGGAATC ATGTCTGAAA TATTTGACGG TATCCAGCGG 3 00 

CCTCTTGACA CCATCTACCG CATGGTGGAA AACGTGTTTA TCCCCAGGGG 350 

CGTTCAGGTG AAGTCACTCA ATGACCAGAA ACAGTGGGAC TTTAAGCCAT 4 00 

GCCTGAAGGT TGGAGATCTT GTGTCTGGTG GTGATATCAT TGGCTCAGTG 450 

GTGGAGAACT CTCTCATGTA CAATCACAGC ATTATGATTC CGCCCAATGT 500 

20 GCGGGGCCGT GTTACTTCCA TTGTTCCTTC AGGAAATTAC ACCCTCCAAG 550 

ATGACATTAT TGAATTGGAA TATAATGGGA CAGTGAAATC ACTAAAACTT 600 

ATGCATCGCT GGCCAGTACG GACCCCGCGT CCTGTGGCGT CAAAAGAATC 650 

CGGCAATCAT CCGCTTCTCA CCGGACAGCG TGTGCTCGAT GCTCTCTTTC 7 00 

CATCCGTCCA GGGTGGAACA TGCGCCATCC CTGGCGCGTT TGGATGCGGA 750 

25 AAGACGGTTA TCAGTCAGGC TCTTTCGAAG TTCTCCAACA GCGACGCTGT 800 

TATCTATGTC GGCTGCGGCG AGCGTGGGAA TGAGATGGCA GAGGTGCTCA 850 

TGGACTTCCC CACACTCACC ACCGTTATTG ATGGTCGTGA GGAGTCCATC 900 

ATGAAGCGTA CCTGCCTGGT GGCAAACACC TCAAATATGC CTGTCGCTGC 950 

TCGTGAGGCG TCTATTTACA CTGGCATCAC TTTAGCTGAG TATTATCGTG 1000 

3 0 ATATGGGCAA GCACATTGCT ATGATGGCCG ACTCTACCTC TCGATGGGCT 1050 

GAGGCTCTCC GTGAGATCTC TGGGCGTCTC GCTGAAATGC CCGCTGATGG 1100 

TGGTTACCCT GCGTACCTCA GTGCGCGTCT TGCTTCCTTC TACGAGCGTG 1150 

CGGGGCGCGT GACATGCATC GGTGGGCCAA AACGCGAGGG CTCAGTAACC 12 00 

ATCGTTGGTG CCGTTTCTCC TCCTGGAGGT GACTTTTCTG ACCCAGTGAC 12 50 

35 GTCCGCTACG CTTGGTATTG TGCAAGTCTT TTGGGGTCTT GAGAAGCGTC 13 00 

TTGCGCAACG TAAACACTTT CCTTCTGTTA ATTGGCTCAT TTCCTATTCA 13 50 

AAATACCTTA ATGCTTTGGA GCCCTTCTTC AACACGCTTG ACCCTGACTA 14 00 

CATGCGCCTG CGGTCAGTTG CTGCGGAGAT CCTTCAGCGT GAGGAAGAGT 1450 

TGCAAGAAAT TGTTCAACTT GTCGGTAAGG ACTCACTTTC GGAGTCTGAC 1500 

40 AAAATTATTC TAGAAACGGC TAAGGTTATT CGTGAAGAGT TTCTCCAGCA 1550 

GAATGCCTTT ACGCCGTACG ACAAGTATTG CCCGCCGTAC AAGACCTGCT 1600 

GGATGCTACG TAACATTGTC GCGTTCTACG AGGAGAGCCA GCGCGTTGTA 1650 

GCTGAGTCCG CTGGGGAACT TAAGATTACG TGGAACTACA TTCGTGAAAT 17 00 

GATTCCTCAT ATTTACACGG GTTTAACTGA GATGAAGTTC CGTGATCCTC 17 50 

45 AGGAGGGTGA GGAGGCCAAC GTAGAATTCT ACAGAAAACA AAATGAGGAA 18 00 

ATTGTCAGCG CATTCGCCTC GCTGCTGCAA TAA 183 3 



50 2) INFORMATION FOR SEQ ID NO: 693 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1758 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Thermus thermophilus 

(B) STRAIN: HB8 

(C) ACCESSION NUMBER: D63799 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 693 

ATGATCCAAG GGGTGATCCA GAAGATCGCG GGCCCGGCGG TGATCGCCAA 50 

GGGCATGCTC GGGGCCCGCA TGTACGACAT CTGCAAGGTG GGCGAAGAGG 100 

GCCTCGTGGG CGAGATCATC CGCCTGGACG GGGACACGGC CTTCGTCCAG 150 

10 GTCTACGAGG ACACCTCGGG CCTAAAGGTG GGGGAGCCCG TGGTCTCCAC 2 00 

GGGCCTTCCC TTGGCGGTGG AGCTCGGCCC CGGGATGCTG AACGGCATCT 2 50 

ACGACGGCAT CCAGCGCCCC CTGGAGCGCA TCCGGGAGAA GACGGGGATC 3 00 

TACATCACCC GGGGCGTGGT GGTCCACGCC CTGGACCGGG AGAAGAAGTG 3 50 

GGCCTGGACG CCCATGGTCA AGCCCGGGGA CGAGGTGCGG GGGGGTATGG 4 00 

15 TCCTGGGCAC GGTGCCCGAG TTCGGCTTCA CCCACAAGAT CCTGGTACCC 4 50 

CCGGACGTGC GGGGCCGGGT CAAGGAGGTG AAGCCCGCCG GGGAGTACAC 500 

CGTGGAGGAG CCGGTGGTGG TCCTCGAGGA CGGCACCGAG CTCAAGATGT 550 

ACCACACCTG GCCCGTTCGC CGGGCGAGGC CCGTGCAAAG GAAGCTTGAC 600 

CCCAACACCC CCTTCCTCAC GGGGATGCGC ATCCTGGACG TCCTCTTCCC 650 

20 CGTGGCCATG GGGGGCACCG CCGCCATCCC TGGGCCCTTC GGCAGCGGCA 70 0 

AGACCGTGAC CCAGCAGTCC CTGGCCAAGT GGTCCAACGC CGACGTGGTG 750 

GTCTACGTGG GCTGCGGGGA GCGGGGGAAC GAGATGACCG ACGTGCTCGT 8 00 

GGAGTTCCCC GAGCTCACCG ACCCCAAGAC GGGTGGGCCC TTGATGCACC 850 

GCACCGTCCT CATCGCCAAC ACCTCCAACA TGCCCGTGGC CGCCCGCGAG 900 

25 GCCAGCATCT ACGTGGGCGT GACCATCGCC GAGTACTTCC GCGACCAGGG 950 

CTTCTCCGTG GCCCTCATGG CCGACTCCAC GAGCCGCTGG GCCGAGGCTT 100 0 

TGCGCGAGAT CTCTAGCCGC CTCGAGGAGA TGCCCGCCGA GGAGGGCTAC 1050 

CCGCCCTACC TCGCCGCCAG GCTCGCCGCC TTCTACGAGC GGGCGGGCAA 110 0 

GGTCATCACC CTGGGCGGCG AGGAGGGGGC GGTGACCATC GTGGGGGCCG 1150 

3 0 TCTCCCCGCC GGGCGGCGAC ATGTCCGAGC CCGTGACCCA GTCCACCTTG 12 0 0 

AGGATCGTGG GGGCCTTCTG GCGGCTTGAC GCCTCCCTGG CCTTCCGCCG 12 50 

CCACTTCCCC GCCATCAACT GGAACGGCTC CTACAGCCTC TTCACCTCCG 13 0 0 

CCCTTGACCC CTGGTACCGG GAGAACGTGG CCGAGGACTA CCCCGAGCTC 13 50 

CGCGACGCCA TCTCCGAGCT TTTGCAGCGG GAGGCGGGCC TCCAGGAGAT 14 00 

3 5 CGTCCAGCTC GTGGGGCCGG ACGCCCTCCA GGACGCCGAG CGCCTCGTCA 14 50 

TTGAGGTGGG CCGGATCATC CGCGAGGACT TCCTGCAGCA GAACGCCTAC 1500 

CACGAGGTGG ACGCCTACTG CTCCATGAAG AAGGCCTACG GGATCATGAA 1550 

GATGATCCTC GCCTTCTACA AGGAGGCGGA GGCGGCCATC AAGCGGGGGG 1600 

TTTCCATAGA CGAGATCCTG CAGCTCCCCG TTCTGGAGCG CATCGGCCGC 1650 

40 GCCCGCTACG TGAGCGAGGA GGAGTTCCCC GCCTACTTTG AGGAGGCCAT 1700 

GAAGGAGATC CAGGGGGCCT TCAAGGCTGG CCTAAAGGGG GAGAGATGGA 1750 

CCTTCTGA 1758 



45 



55 



60 



2) INFORMATION FOR SEQ ID NO: 694 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 694 
CGGCGCIATC YTSGTTGTTG C 21 
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2) INFORMATION FOR SEQ ID NO: 695 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 695 
GTTTCACGTG ATGACGTACA 2 0 

15 



2 ) INFORMATION FOR SEQ ID NO: 696 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 696 
30 AT I GG I C AYR TIGAYCAYGG IAARAC 26 



35 



2) INFORMATION FOR SEQ ID NO: 697 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 697 

45 

CCIACIGTIC KICCRCCYTC RCG 23 



50 2) INFORMATION FOR SEQ ID NO: 698 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) . TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: extracted from J01690 



10 



15 



20 



25 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 698 



GTGTCTAAAG 
TATCGGCCAC 
CCGTACTGGC 
GATAACGCGC 
CGTTGAATAC 
GGCACGCCGA 
GGCGCGATCC 
TGAGCACATC 
TCCTGAACAA 
GAAATGGAAG 
CACTCCGATC 
AGTGGGAAGC 
CCGGAACCAG 
CGTATTCTCC 
GCGGTATCAT 
ACTCAGAAGT 
CGAAGGCCGT 
GTGAAGAAAT 
CCGCACACCA 
CGGCCGTCAT 
GTACTACTGA 
GTAATGCCGG 
CGCGATGGAC 
TTGGCGCGGG 



AAAAATTTGA 
GTTGACCACG 
TAAAACCTAC 
CGGAAGAAAA 
GACACCCCGA 
CTATGTTAAA 
TGGTAGTTGC 
CTGCTGGGTC 
ATGCGACATG 
TTCGTGAACT 
GTTCGTGGTT 
GAAAATCCTG 
AGCGTGCGAT 
ATCTCCGGTC 
CAAAGTTGGT 
CT AC CTGTAC 
GCTGGTGAGA 
CGAACGTGGT 
AGTTCGAATC 
ACTCCGTTCT 
CGTGACTGGT 
GCGACAACAT 
GACGGTCTGC 
CGTTGTTGCT 



ACGTACAAAA 
GTAAAACTAC 
GGCGGTGCTG 
AGCTCGTGGT 
CCCGTCACTA 
AACATGATCA 
TGCGACTGAC 
GTCAGGTAGG 
GTTGATGACG 
TCTGTCTCAG 
CTGCTCTGAA 
GAACTGGCTG 
TGACAAGCCG 
GTGGTACCGT 
GAAGAAGTTG 
TGGCGTTGAA 
ACGTAGGTGT 
CAGGTACTGG 
TGAAGTGTAC 
TCAAAGGCTA 
ACCATCGAAC 
CAAAATGGTT 
GTTTCGCAAT 
AAAGTTCTGG 



CCGCACGTTA 
TCTGACCGCT 
CTCGTGCATT 
ATCACCATCA 
CGCACACGTA 
CCGGTGCTGC 
GGCCCGATGC 
CGTTCCGTAC 
AAGAGCTGCT 
TACGACTTCC 
AGCGCTGGAA 
GCTTCCTGGA 
TTCCTGCTGC 
TGTTACCGGT 
AAATCGTTGG 
ATGTTCCGCA 
TCTGCTGCGT 
CTAAGCCGGG 
ATTCTGTCCA 
CCGTCCGCAG 
TGCCGGAAGG 
GTTACCCTGA 
CCGTGAAGGC 
GCTAA 



ACGTTGGTAC 50 

GCAATCACCA 100 

CGACCAGATC 150 

ACACTTCTCA 2 00 

GACTGCCCGG 250 

TCAGATGGAC 3 00 

CGCAGACTCG 350 

ATCATCGTGT 4 00 

GGAACTGGTT 4 50 

CGGGCGACGA 500 

GGCGACGCAG 550 
TTCTTATATT 600 
CGATCGAAGA 650 
CGTGTAGAAC 700 
TATCAAAGAG 750 
AACTGCTGGA 800 
GGTATCAAAC 850 
CAC CAT C AAG 900 
AAGATGAAGG 950 

TTCTACTTCC 1000 

CGTAGAGATG 1050 

TCCACCCGAT 110 0 

GGCCGTACCG 1150 
1185 



35 



40 



45 



2 ) INFORMATION FOR SEQ ID NO: 69 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 699 
GTI AC I GGYT CYTYRARRTT I CC I CC 



26 



50 



2) INFORMATION FOR SEQ ID NO: 7 00 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 700 

60 
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PCT/CA00/01150 

24 



5 2) INFORMATION FOR SEQ ID NO: 701 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 01 

GTGTTCACGA TCATCGATGC G 21 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 702 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 702 
CTCTCGATAT CCGCGAAGCG 20 

2) INFORMATION FOR SEQ ID NO: 703 



(i) SEQUENCE CHARACTERISTICS: 
4 0 (A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 703 
TATGGAAATT CGAAACATCT 2 0 

50 



2) INFORMATION FOR SEQ ID NO: 704 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 

400 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 704 
AGTGCTCCAA TTAATGTTGG 



PCT/CA00/01150 



20 



10 



15 



20 



2 ) INFORMATION FOR SEQ ID NO: 705 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 705 
GTACAGTTCC AATACCTGAA 



20 



25 2 ) INFORMATION FOR SEQ ID NO: 706 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



30 



35 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 706 
TGAAATCTTC ACATCCAACA 



20 



40 



45 



50 



55 



60 



2 ) INFORMATION FOR SEQ ID NO: 707 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 707 
TWACCATTTC AGTACCTTCT GGTAA 



2 ) INFORMATION FOR SEQ ID NO: 708 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 

401 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 708 
TCRTCCATIC CIARIATIGC IATIAT 2 6 

10 



2) INFORMATION FOR SEQ ID NO: 709 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1656 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Borrelia burgdorferi 
25 (C) ACCESSION NUMBER: extracted from AE001122 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 709 

ATGAATGAAG TTTTATTTGT AAAGACTGCT GGTAGGAATT TAAAAGCAGA 50 

30 AGTAATTCGT ATTAGGGGCA ATGAAGTTGA TGCACAGGTT TTTGAATTGA 100 

CAAAAGGGAT ATCTGTTGGA GACCTAGTTG AATTTACAGA CAAACTTTTA 15 0 

ACAGTTGAAC TCGGACCAGG GCTTTTAACT CAAGTATATG ATGGGCTTCA 200 

AAATCCTTTG CCTGAATTGG CTATTCAATG TGGATTTTTT TTAGAAAGGG 250 

GAGTATATTT AAGGCCCTTG AATAAAGATA AAAAGTGGAA TTTTAAAAAA 3 00 

35 ACCTCCAAAG TTGGAGATAT CGTTATTGCA GGAGATTTTT TAGGTTTTGT 3 50 

AATTGAGGGA ACTGTTCACC ATCAAATAAT GATTCCATTT TATAAAAGGG 40 0 

ATTCTTATAA AATTGTGGAG ATTGTAAGTG ATGGCGACTA TTCGATTGAT 450 

GAGCAAATTG CTGTAATTGA AGATGATTCT GGTATGAGGC ATAATATTAC 500 

AATGTCTTTT CATTGGCCTG TTAAAGTTCC TATTACTAAT TATAAGGAAC 550 

40 GCCTTATTCC TAGTGAACCT ATGTTGACTC AAACTAGAAT TATAGATACA 60 0 

TTTTTCCCAG TTGCCAAAGG TGGAACTTTT TGCATTCCGG GTCCTTTTGG 650 

AGCAGGAAAA ACGGTTCTTC AGCAGGTTAC AAGTCGAAAT GCTGATGTTG 700 

ATGTAGTGAT TATTGCAGCT TGTGGTGAGC GAGCAGGAGA AGTGGTAGAA 750 

ACTCTTAAAG AATTTCCCGA ATTAATGGAT CCAAAAACCG GCAAATCTTT 800 

45 AATGGACAGG ACTTGTATTA TTTGTAATAC ATCTTCAATG CCAGTTGCAG 850 

CTAGAGAAGC TTCTGTTTAT ACTGCTATTA CTATTGGTGA GTATTACAGG 900 

CAAATGGGCC TTGATATTCT TCTTTTGGCA GATTCAACTT CAAGATGGGC 950 

TCAAGCAATG AGAGAAATGT CTGGACGCCT TGAGGAAATT CCTGGCGAGG 10 0 0 

AGGCTTTTCC GGCATATCTT GAGTCTGTTA TTGCTTCCTT TTATGAAAGG 1050 

50 GCAGGTATTG TAGTTCTTAA TAATGGGGAT ATTGGATCTG TAACAGTTGG 110 0 

TGGCTCTGTA AGTCCTGCTG GTGGTAATTT TGAAGAGCCA GTTACTCAAG 1150 

CAACTTTAAA AGTTGTAGGA GCATTTCACG GGCTTACAAG AGAAAGGTCT 12 00 

GATGCTAGGA AATTTCCAGC TATTAGTCCT CTTGAATCTT GGAGTAAATA 1250 

TAAAGGCGTT ATTGATCAAA AAAAGACTGA ATATGCAAGA TCTTTTTTGG 13 0 0 

55 TGAAAGGTAA TGAAATTAAT CAAATGATGA AAGTTGTTGG AGAAGAAGGC 1350 

ATAAGTAACG ATGATTTTTT AATTTATTTA AAATCCGAGC TACTTGATTC 14 00 

GTGCTATTTG CAGCAAAATT CATTTGATTC TATTGATGCT GCTGTTAGTT 1450 

CAGAGCGTCA AAATTATATG TTTGATATAG TTTATAACAT TCTTAAAACT 1500 

AACTTTGAGT TTTCTGATAA ACTTCAAGCA AGAGATTTTA TAAATGAGTT 1550 

60 AAGGCAAAAT CTTTTAGACA TGAATCTTTC TTCTTTTAAG GATCATAAGT 1600 

402 
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TTAATAAATT GGAGCATGCT TTGGGTGAAT TGATAAATTT TAAAAAGbTA 1650 
ATTTAG 1656 



2 ) INFORMATION FOR SEQ ID NO: 710 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1818 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Treponema pallidum 

(B) STRAIN: Nichols 

(C) ACCESSION NUMBER: extracted from AE000520 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 710 



GTGATCAAAG ACGATGTGGT TACAGGCCGT GTAGTGAGGG TGTCTGGTCC 5 0 

CATTGTGTAT GCCGAGGGCC TCTCTGCGTG CAGCGTATAC GATGTTGTCG 10 0 

2 5 ACGTAGGGGA AGCATCGCTC ATCGGAGAAA TTATC CGGTT GGATGAGAGC 150 

AAGGCGGTCG TGCAAGTATA CGAGGATGAC ACAGGTATGC GAGTCGGGGA 2 00 

GAAGGTGACA AGCTTGCGTC GACCACTCTC AGTCCGCTTA GGGCCTGGAT 2 50 

TAATCGGCAC CATTTATGAC GGTATTCAGC GCCCACTTGA GCGCCTCTTC 3 00 

CAAGAAGACG GCGCCTTCTT GCGTCCTGGT GCGCGTTCAC AACCGCTTGA 3 50 

3 0 TGGCTCCGTA CGCTGGGATT TTCGTCCTCA TTGTAACGAG CGCGGTGAGG 4 00 

CCCTGTGCGC GGGGATTCCG ATTGCAC CTG GGTCAGTGTT AGGGACCGTG 4 50 

CAGGAGACTC CTTCTGTTGT GCACACTATC ATGGTTCCTC CTGACATCCG 500 
GGGGAGCGTG CTATCTTCGT TCAAGGGCGC AGGTGCTTAC ACAATAGATG 550 
AAGAAATTGG ACGCACTGAT CTTGGTGAGC CGCTTTTTCT ATCCCAGTAC 600 
35 TGGCCAGTGC GTCGTGCGCG TCCTTTCAGC AAAAAACTTG CAGTGTGTGA 650 
GCCACTAGTT ACTGGACAGC GGGCGATTGA TGTTTTCTTC CCCCTATCAA 700 
AGGGAGGAAC GGCGGCTATT CCAGGGGGAT TTGGAACTGG GAAGACAATG 750 
ACGCAGCATG CCGTTGCCAA GTGGTGTGAT GCAGATATTA TCGTGTACAT 800 
CGGCTGCGGA GAGCGGGGCA ACGAGATGAC AGACGTGCTC TCTGAATTTC 850 

4 0 CCAAACTCAT CGATCCGCGC ACAGGACGCT CTCTTATGGA GCGGACGATT 900 

TTGATCGCAA ATACGTCCAA TATGCCTGTG TCCGCACGCG AGGTGTCGCT 950 

GTATTCAGGG ATTACCCTTG CGGAATACTA CCGTGATATG GGTATGCATG 1000 

TGGCCATCAT GGCTGATTCT ACCAGCCGCT GGGCGGAGGC GCTGCGTGAA 10 50 

TTGTCTGGGC GCATGGAAGA AATGCCTGCG GAGGAGGGAT TCCCTGCGTA 1100 

45 CCTTCCGACG CGTCTTGCAG AATTTTATGA GCGCGCAGGA CGCGTGGAAA 1150 

CCTGTGTGGC GCGCGAGGGC TCTGTGAGCA TCATTGGTGC TGTTTCTCCC 12 00 

CTGGGTGGAG ATTTCTCTGA GCCGGTGACG CAGCACACAA AGCGCTTCAT 12 50 

CCGTTGCTTT TGGGCCTTGG ATCGTGAACT TGCACACGCG CGTCATTACC 13 00 

CTGCCATTGG GTGGATAGAT TCATACTCTG AATATGCGCA GGAAGTAAGT 13 50 

50 GCATGGTGGA GTAAGTATGA CCCGCGCGCA GGCGCGTTGC GCGCCGCAGC 14 00 

CTTGGATTTG CTGAGAAAGG AACAGCGGTT ACAGCAAATT GTCAGGCTTG 14 50 

TCGGTCCTGA TGCGCTGCCT GGAGAAGATC GTCTGGTGCT AATGGTGTGT 1500 

GAAATGATCA AAGGTGGCTT TCTGCAGCAG AACGCTTTTG ATCCGACGGA 1550 

TGTGTTCTCC TGTCCCGAAA AGCAGGTGCA GATCTTGCGT ACCATAGTGG 1600 

55 ATTTTCACGA ACGTGCCGTG GTGCTGCTGC GTGCAGGTAT TTCGCTTTCT 1650 

GCGCTGTCCC AGCTTTCGTG CCGGGAGCTC ATCGTACGTA TGAAAACTAC 1700 

GTACGGGAAT GAGGATGTAC ACAAGATGCA GAAAGTGTAC GACACGATGT 1750 

GCACTGAGTT TGACCAACTG AGTGTGTGTG CTGCCGCGCG CACACAAGGG 18 00 

GGGGAGAAAG TCGAATGA 1818 



60 



403 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



2) INFORMATION FOR SEQ ID NO: 711 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1779 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia, trachomatii 
15 (B) STRAIN: MoPn 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 711 

CAGGGCTATG TCGTAGAAGC TTACGGAAAT TTATTGCGGG TGCATTTTGA 50 

2 0 TGGGCATGTG CGTCAAGGAG AAGTGGCCTA TGTCAGCGTG GATGATACTT 100 

GGTTGAAAGC GGAAATTATA GAAGTTGTGG GAGATGAGGT TAAAGTCCAA 150 

GTTTTTGAGG AAACTCAAGG AATTTCTCGA GGCGCTTTGG TAACTTTTTC 2 00 

CGGGCATTTA TTAGAAGCGG AACTTGGGCC CGGTCTATTG CAAGGTATTT 250 

TTGACGGACT TCAGAATCGC TTAGAGGTAT TGGCAGATAC AAGCTTGTTC 300 

2 5 TTGAAAAGAG GGGAGTATGT TAATGCCATT TGTCGGGAAA CTGTATGGGC 350 

TTATACGCAA AAGGCTTCTG TCGGGGATGT TCTATCTCGG GGAGATGTGC 4 00 

TTGGTACAGT AAAGGAAGGG CGGTTTGATC ATAAAATCAT GGTTCCTTTC 450 

TCTTGTTTTG AGGAAGTGAC TATCACTTGG GTCATTTCTT CAGGAGATTA 500 

CACTGTTGAT ACCGTTATTG CTAAAGGACG TACTGCTTCA GGAGCCGAGC 550 

3 0 TTGAATTTAC AATGGTTCAG AAATGGCCCA TTAAACAGGC TTTTTTAGAA 600 

GGGGAAAAGG TACCGTCTCA TGAAATTATG GATGTTGGGT TACGAGTATT 650 
AGATACTCAG ATCCCCGTCT TAAAGGGAGG AACTTTTTGT ACTCCAGGGC 700 
CTTTTGGTGC AGGAAAGACC GTTTTACAGC ACCATTTATC TAAGTATGCA 750 
GCTGTAGATA TCGTAGTTTT GTGTGCTTGT GGAGAGCGAG CTGGAGAGGT 800 

3 5 TGTAGAAATT CTTCAGGAGT TCCCGCATTT GACAGATCCT CATACGGGGC 850 
AGTCTTTGAT GCATAGGACC TGTATTATTT GTAATACATC TTCCATGCCT 900 
GTAGCAGCTA GAGAGTCCTC CATTTATTTG GGTATTACTA TAGCAGAATA 950 

TTACCGTCAA ATGGGGTTGC ATGTTTTGTT ATTGGCTGAC TCGACATCTA 1000 

GATGGGCTCA AGCTTTAAGG GAAATTTCAG GGCGATTAGA AGAAATCCCT 1050 

40 GGAGAAGAAG CTTTCCCAGC CTATTTGGCG TCTCGAATAG CAGCTTTTTA 1100 

TGAGCGAGGC GGGGCTGTGA AAATGAAAGA TGGATCGGAA GGCTCCTTGA 1150 

CTATCTGTGG AGCGGTTTCT CCCGCAGGAG GAAATTTTGA AGAGCCTGTT 12 00 

ACACAAGCAA CTTTATCTGT TGTTGGGGCT TTCTGTGGGC TTTCTAAGGC 12 50 

TAGAGCAGAT GCTAGACGGT ATCCTTCTAT TGATCCGATG ATTTCATGGT 13 00 

45 CTAAGTACTT GGATTCTGTG GCGGAGATTT TAGAGAAAAA AGTTCCAGGA 1350 

TGGGGAGATT CCGTTAAAAA AGCTTCTCGT TTCTTAGAAG AAGGAGCAGA 14 00 

AATTGGTAAG CGAATAGAAG TTGTTGGGGA AGAAGGGATT TCTATGGAAG 14 50 

ATATAGAAAT CTTTTTGAAA TCAGAGTTGT ATGATTTCTG TTACTTACAG 1500 

CAAAACGCTT TCGATGCAGA GGACTGTTAT TGTCCTTTTG ATCGTCAAAT 1550 

50 AGAGCTTTTT TCTTTAATGA GTCATATTTT TAGCTCTAGA TTCTGTTTTG 1600 

ATTGTCCAGA TAATGCTCGG AGTTTCTTTT TAGAGCTTCA AAGTAAAATT 1650 

AAAACGCTGA ATGGTCAAAA ATTCCTTTCT GAAGACTATC AGAAGGGGCT 17 00 

AGAAGTGATC TATAAACTAT TAGAAAGCAA AATGGTGCAG ACGGCGTAGG 1750 

TATGCAAACA ATATATACAA GAATTACGG 177 9 

55 



2) INFORMATION FOR SEQ ID NO: 712 

60 (i) SEQUENCE CHARACTERISTICS: 

404 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

(A) LENGTH: 965 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 
10 (B) STRAIN: V583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 712 

GTGCAAATTG GAAAAATTGT CAAAGTTTCA GGTCCTTTGA TTTTAGCTGA 50 

15 AAACATGTCA GATGCTAGTA TCCAAGACAT TTGTCATGTA GGAGATTTAG 100 

GCGTTATCGG AGAGATTATT GAAATGCGAG GCGACGTCGC TTCGATTCAA 150 

GTATATGAAG AAACAACAGG CATTGGACCA GGAGAAC C AG TTATTTCAAC 200 

AGGAGAACCA TTATCTGTTG AATTAGCCCC AGGTTTAATT GCCGAAATGT 2 50 

TTGATGGTAT TCAACGACCA TTGGATACAT TTCAAGAAGT AACCCACAGT 3 00 

20 AACTTTTTAG GCCGTGGCGT TAAAATTGAT GCGTTAGATC GTGAGAAAAA 3 50 

ATGGACGTTT GAACCAACTG TGGCAGTTGG TGAAGAAGTG TCGGCAGGTG 400 

ACATCGTCGG TGTGGTTCAA GAAACACCGA TTATTCAACA TAAAATTATG 450 

GTGCCTTTCG GCGTTTCAGG AACGATTGC C GAAATTAAAG CAGGTGACTT 500 

TGCCATTGAT GAAACAGTTT ACTCAGTGGA AACGGCTAAA GGAACGGAAA 550 

25 GTTTTAGCAT GATGCAAAAA TGGCCCGTTC GGCGGGGACG TCCCATTTTA 60 0 

GAAAAACTAA GTCCCAAAGT ACCGATGGTG ACCGGACAAC GCGTAATTGA 650 

TACCTTTTTC CCAATTACGA AAGGCGGAGC GGCAGCAGTT CCAGGACCAT 70 0 

TTGGCGCTGG AAAAACAGTC GTTCAGCACC AAATTGCTAA GTGGGCCGAT 750 

GTCGACTTAG TCGTTTACGT TGGTTGTGGG GAACGCGGGA ATGAAATGAC 800 

3 0 AGATGTTTTA AATGAATTTC CAGAATTAAT TGACCCAACA ACTGGTGAGT 8 50 

CTTTGATGAA TCGGACGATT TTAATTGCGA ATACGTCAAA TATGCCGGTA 90 0 

GCGGCACGGG AAGCCTCGAT TTATACAGGG ATTACCATTG CAGAATATTT 950 

CCGTGATATG GGTTA 965 



35 



50 



2) INFORMATION FOR SEQ ID NO: 713 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1737 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Methanosarcina barkeri 

(C) ACCESSION NUMBER: extracted from J04836 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 713 



GTGGAAGTAA AAGGTGAAAT TTATCGTGTG TCTGGGCCTG TCGTCACCGC 50 

CATCGGCTTG CAGGCAAAAA TGTATGACCT GGTCAAAGTC GGTAATGAAG 100 

55 GTTTAATGGG TGAAGTCATT CAGATATTAG GGCCCAAGAC CATCATCCAG 150 

GTATATGAAG AGACCGCAGG TATCAAGCCA GGGGAACCCT GTGTATCTAC 200 

AGGGTCGTCT CTGTCCGTAG AACTTGGTCC GGGTCTTCTT TCCAGTATTT 250 

ATGACGGGGT TCAAAGGCCT CTGCACGTCC TGCTTGAAAA AATGGGTAGC 300 

TTCATCCAGA GAGGTGTCAG CGCAGATGGG CTTGATCATA AGAAACTCTG 3 50 

60 GGATTTCAAA CCCATTGTCA AGAAGGGCGA TTCCGTAAAA GGTGGAGACG 4 00 

405 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TAATTGGTGT TGTACAGGAA ACCGTGAATA 
CCTCCTGATA TCTCAGGTAC AATTTC CGAC 
GGTAGTAGAC ACAATCTGTA CTCTGACTGA 
TGCAGAGGTG GCCTGTTCGA AGACCCAGAC 
5 CCAACCAGGC CTCTGGTTAC AGGAATGAGA 
TGTGGCAAAA GGCGGAACAG CTGCAATCCC 
AGAC CGTAAC TCAGCAGTCG CTTGCAAAAT 
GTCTACATCG GTTGTGGTGA GCGTGGAAAC 
CGAATTCCCT GAACTCGAAG ATCCGCAGAC 

10 GTACTGTTCT TATCGCTAAC ACTTCAAACA 
GCATCTGTGT ATACCGGAAT CACCATTGCA 
ATTAGATGTA TCCCTTATGG CAGACTCCAC 
TGAGAGAAAT CTCTTCCCGT CTGGAAGAAA 
CCAGCATACC TGTCTGCAAG ACTGGCCGAA 

15 TGCGGAGAGT CTTTGCGGCG AAACAGGTTC 
TATCTCCACC TGGCGGTGAC TTCTCAGAGC 
CGTATCGTAA AAGTGTTCTG GGCTCTCGAT 
TCACTTCCCG GCCATCAACT GGCTGAACAG 
GTCTTAATGA CTGGTTTGCA GATAATGTGG 

2 0 AGGGAAAGAG CAATGGAAAT GCTCCAGACA 
CGTGCAGCTT GTAGGTTCCG ATGCTCTGCC 
TTGAAATCAC CCGTATGCTT AGGGAAATTT 
CACCCAGTAG ATGCATACAG CCCGTTCGAT 
GGCAATCATG AAATGGGGAG ACGCTGCGAT 

25 TTCCCGTAAC TGAAATTATC AAGCTTGAAT 
GTCAAGTACG AAGAGAAGTT TGATGAGTCT 
GATGGATAAA GAGTTTGCAT CCCTGAGAGG 



PCT/CA00/01150 

TTGAACATAA GATC^TGGTG ~45b 

ATAAAGAGCG GAAACTTTAC 5X)0 

TGGGACCGAA TTGCAGATGA 550 

CTGTGAAGGC AAAACTTACT 600 

ATCCTTGATG GGCTTTTCCC 650 

CGGACCTTTC GGATCGGGAA 700 

GGAGTGATAC CGAAATTGTG 750 

GAAATGGCAG ATGTTCTGAG 800 

CGGGCGCCCA CTTATGGAGC 850 

TGCCTGTGGC CGCAAGAGAA 900 

GAATACTACC GTGACATGGG 950 

CTCAAGGTGG GCAGAAGCCA 1000 

TGCCTGGTGA AGAAGGTTAC 1050 

TTCTACGAGC GTGCCGGGGT 1100 

CATTACTGTT ATTGGAGCAG 1150 

CTGTTACACA GAATACCCTG 1200 

GCCAAACTAT CTCAGAGGCG 12 50 

TTACAGTCTG TATAAGGACA 1300 

CTCCTGATTA TGTGCCTTTG 1350 

GAATCTGAAC TGCAGGAAAT 1400 

AGACGACCAG CAGCTTCTGC 1450 

TCCTGCAGCA GAATGCATTC 1500 

CAGCAGTACA AGATCCTTAA 1550 

GGATGCCTTG AAATCAGGTG 160 0 

CCAAAAATGT GCTTGCTAAG 1650 

ATGAATGCTG TCCTGGCACA 1700 

TAGGTAA 1737 



30 



40 



2) INFORMATION FOR SEQ ID NO: 714 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1785 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Methanosarcina jannaschii 

(C) ACCESSION NUMBER: extracted from U674 77 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 714 

ATGAAAAGAG AGGTTGAGAA TATGCCAGTT GTTGGTAAGA TTATTAAAAT 50 

CGCAGGGCCT GTTGTAGTTG CAGAGGGAAT GAAAGGAGCT CAGATGTATG 100 

AGGTCGTTAA AGTAGGAGAA GAGAAATTGA CTGGAGAAAT CATTCAGTTG 150 

50 CACGATGATA AAGCAGTTAT TCAGGTTTAT GAAGAAACAT CTGGAATTAA 2 00 

ACCAGGAGAG CCAGTTGTTG GTACTGGAGC TCCATTGTCT GTTGAATTAG 250 

GGCCAGGGAT GTTAAGAGCT ATGTATGATG GTATTCAGAG GCCTTTAACA 3 00 

GCAATTGAAG AGAAAACAGG TTCAATCTTT ATCCCAAGAG GAGTTGATGT 3 50 

CCCTGCATTA CCAAGAGATA TAAAATGGGA ATTTAAACCA GTGGTAAATG 4 00 

55 AAGGAGATTA TGTTGAAGAA GGAGACATAA TTGGAACTGT TGATGAAACT 450 

CCTTCAATAG TTCATAAAAT CTTAGTTCCA ATTGGTGTTA AAGGAAAAAT 500 

TGTTGAAATA AAAGAGGGTA AATTTACAGT TGAAGAGACA GTTGCAGTTG 550 

TAGAAACAGA AAATGGAGAA AGGAAAGAAA TTACAATGAT GCAAAAATGG 600 

CCAGTAAGAA AACCAAGACC ATATAAAGAG AAACTACCTC CAGAAATTCC 650 

60 ATTAATTACA GGGCAAAGAG TTGAAGACAC TTTCTTTACA TTAGCAAAAG 700 

406 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GAGGAACAGC 
CAGCATCAGT 
ATGTGGAGAA 
ACTTAGAAGA 
5 ATAGCCAACA 
TACAGGAATT 
TATTAACAGC 
TCAGGTAGAT 
AGCTTCAAGA 

10 TAGGGAAAGA 
CCACCAGGAG 
AGTTAAGGTA 
TCCCAGCTAT 
ACAGAGTGGT 

15 TGAAGCAATG 
AGTTAGTTGG 
GTTGCAAGAA 
GGTAGATACC 
TTATGACATT 

2 0 CCAGCTAAGA 
ATACATCCCA 
AGAAAATTAA 



AGCAATTCCA 
TGGCAAAGTG 
AGAGGAAACG 
TATTAGAACT 
CATCAAACAT 
ACAATTGCAG 
AGATTCAACA 
TGGAAGAAAT 
TTGGCTCAGT 
TAACAGACAA 
GGGACTTCTC 
TTCTGGGCGT 
CAACTGGTTG 
GGAACACAAA 
AGCTTATTAC 
GCCTGATGCA 
TGTTGAGGGA 
TACTGTCCTC 
CTACCAAGAA 
TTTTAGGAGT 
CACGATGAGT 
GAATGAATTA 



GGTCCATTCG 
GTCTGACGCT 
AGATGACAGA 
GGAAACAAAT 
GCCTGTCGCT 
AGTACTTCAG 
TCAAGATGGG 
GCCAGGGGAA 
TCTATGAAAG 
GGATTCGTTT 
AGAACCAGTT 
TAGATGCAAA 
CAGAGTTATT 
TACTGGTCCA 
AAAAAGAGGC 
TTGCCAGATA 
GGATTTCTTA 
CAATGAAACA 
GCATTGAAGG 
TTCAGTTAAG 
TTATAAATGT 
GGTTCATTAA 



GTTCAGGAAA 
GATGTCGTTG 
GGTTATTGAA 
TAATGGATAG 
GCAAGGGAAG 
AGATATGGGT 
CAGAGGCAAT 
GAAGGGTATC 
AGCTGGAAGA 
GTATCGTTGG 
ACATCAAACA 
CTTGGCAAGA 
CATTATACAT 
GATTGGAGAC 
AGAGTTGCAA 
GGGAGAGAGT 
CAGCAAGATG 
GTACTTAATG 
CAGTTGAAAG 
CAAGATATTG 
TAAATCAAAA 
ACTAA 



PCT/CAOO/01150 

AACGGTTACT ~7 50 

TTTATATCGG 800 

GAGTTCCCAC 850 

AACTGTATTA 90 0 

CATCTGTCTA 950 

TATGGAGTTT 100 0 

GAGAGAAATT 1050 

CAGCATACTT 1100 

GTTATAACCT 1150 

AGCTGTTTCA 12 0 0 

CACTAAGGAT 12 5 0 

AGAAGACACT 13 0 0 

TGATGATGTT 135 0 

AATTAAGAGA 14 00 

GAGATTGTTC 14 5 0 

TATTTTAGAA 150 0 

CGTTTGATGA 1550 

TTAAAGATAA 1600 

AGGAGTTGAA 1650 

CAAGAATGAA 1700 

GAAATAATGG 17 50 
1785 



25 



2) INFORMATION FOR SEQ ID NO: 715 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 13 54 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(B) STRAIN: W83 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 715 

TGAGCGAGGT GATCAAGGTG ATCGGCAAAA ATGCTTATGT GCAGGTTTTC 50 

GAAAGTACTC GCGGTATGCA CGTAGGAGAT GAGGCAGAGT TTACCGGCAG 100 

TATGCTTGAG GTAACGCTCG GCCCCGGTAT GCTTTCGAAG AACTACGACG 150 

4 5 GTCTGCAACA CGACTTGGAC AAGATGGACG GGATCTTCCT CAAACGAGGC 2 00 

GATTATACTC CCGCTCTCGA TGACGACAAG CTGTGGGACT TCAAGCCTTT 250 

GGCCAATGTG AACGACAATG TGATCGCAGG CTCATGGCTC GGAGAGGTGA 3 00 

CGGAAAATTT CCAACCGCAC AAGATCATGG TACCTTTCGT TTTCGAAGGC 350 

AATTACAAGG TGAAGAGTCT GGCCAAAGCC GGTTCGTACA AAGTGAACGA 400 

50 TGTGATCGCT GTGGTAACGG ATCAGGACGG GAAAGACCAC AATGTAACCA 4 50 

TGGTGCAGAA ATGGCCGGTG AAACGTGCTA TCACTTGCTA TCGCGAGAAG 500 

CCGCGTCCTT TCAAACTGCT CGAAACGGGT ATCCGTATCA TCGACACTTT 550 

CAACCCCATC GTAGAGGGTG GTACGGGATT TATCCCCGGT CCTTTCGGTA 600 

CGGGAAAGAC GGTGCTCCAG CATGCTATCT CGAAGCAGGC GGAAGCCGAT 650 

55 ATCGTGATCA TTGCAGCCTG TGGCGAGCGT GCAAACGAGG TTGTGGAGAT 700 

CTTTGCGGAA TTCCCCCACC TGAATGACCC CCACACGGGA CGTAAATTGA 7 50 

TGGAACGTAC CATTATTATT GCTAATACGT CGAATATGCC TGTGGCTTCG 800 

CGTGAGGCAT CCGTATATAC GGCCATGACG ATAGCCGAGT ACTATCGCTC 850 

CATGGGCCTT CGCGTGCTGA TGATGGCAGA CTCCACTTCG CGTTGGGCAC 900 

60 AGGCTCTGCG TGAGATGTCT AACCGTCTGG AAGAGCTTCC CGGACCGGAT 950 

407 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GCTTTCCCGA TGGACTTGTC AGCTATCGTA 
AGGATACGTT TACCTGAACA ACGGTTCGGC 
GTACGGTATC TCCCGCCGGT GGTAACCTCA 
ACCAAGAAAG TGGCTCGCTG CTTCTATGCT 
CCGCAAACGT TATCCGGCTG TAAACCCCAT 
TCGAATATCC CGAATTCGAG AGCTATATAT 
ATTATATTCA ACAGAAGTTG GCAGATATAT 
CTGGACTACT AAGGTGAATG AGCTGAAGAT 
AAAT 



PCT/CA00/01150 

GCCAACTTCT ACG^TCGTGC 1/5 00 

CGGTTCGGTA ACGTTCATCG 1050 

AAGAGCCTGT GACGGAAAAC 1100 

TTGGAGCAGA ATCGTGCCGA 1150 

CGATAGTTAC TCGAAGTACA 12 00 

CGAACCACAT CAGTTTACTC 12 50 

GGATCAAATC TTACAGCAGA 1300 

GCGCTTGCAT CAGGGTAAAG 13 50 

1354 



2 ) INFORMATION FOR SEQ ID NO: 716 



15 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: Type 4 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 716 



TTTGACTCAA 
3 0 CAGGTATGCA 
GGGTTAATCG 
AGTCTATGAA 
CTGGAGAACC 
TTTGATGGCA 

3 5 TGATTTTCTA 

AGTGG.CATTT 
GATATTCTTG 
GGTTCCTTAT 
TTACAATTGA 

4 0 TAT AAAGGAA 

TTCTAAACGT 
TTGATGCATT 
CCGTTTGGAG 
CAATGTTGAT 

45 TGACGGATGT 
CAATCAATTA 
TGTTGCTGCT 
ATTTTCGTGA 
CGTTGGGCAG 

50 TGGTGATGAG 
ATGAAAGAGC 
ACGATTACTG 
ACCAGTTACT 
ATGCTCCGTT 

55 TCTTATTCAC 
GAAGACAGAT 
GGGAATCTAG 
TCTGATAATG 
TTATTTGCAA 

60 CAAAACAAGA 



GGGAAGATTA 

GGAGGCTAAT 

GTGAAATTAT 

GAAACATCTG 

TCTCTCGGTT 

TACAACGCCC 

GTTCGTGGGG 

TGATTCCACT 

GAACTGTCAA 

GGAGTATCTG 

TGAAGTTGTA 

CGCTTATGCA 

TTAATTCCAG 

CTTTCCAGTA 

CAGGAAAGAC 

ATTGTTATTT 

ACTGAATGAG 

TGCAACGGAC 

CGTGAGGCTT 

TATGGGCTAC 

AAGCGCTACG 

GGTTATCCTG 

AGGACGTTCT 

CTATTGGAGC 

CAAAACACTT 

GGCACAGCGA 

TATATAAAGA 

TGGAATAGTA 

TTTAGAGGAA 

AACGACTAAC 

CAGAACGCTT 

AGCAATGCTA 



TAAAAGTATC 

ATTCAAGATA 

TGAAATGAGA 

GTCTTGGTCC 

GAATTAGGGC 

ATTAGATCGA 

TAGAAGTTCC 

ATAGCAATTG 

GGAAACCGAG 

GAGAAGTCGT 

TATGAAATAA 

AAAATGGCCT 

AAGAACCATT 

ACCAAAGGGG 

AGTTGTACAA 

ATGTCGGTTG 

TTTCCTGAGT 

AGTTCTGATT 

CAATTTATAC 

TCTGTCGCCA 

TGAAATGTCA 

CTTATCTGGG 

CAGGTTCTAG 

TGTATCGCCA 

TACGGATTGT 

CGTCATTTTC 

CAGTGTGGGC 

AAATAACTCG 

ATTGTTCGTC 

GATGGAAATT 

TTGATTCGGT 

AGTAATATTC 



GGGACCTCTA 

TTTGCCGTGT 

AGAGATCAGG 

GGGAGAACCT 

CAGGATTGAT 

TTTAAATTGG 

AAGTTTGGAT 

GTCAAAAAGT 

GTAGTTAATC 

TTCTATTGCA 

AAAAATTGGA 

GTCCGCAAGG 

AATCACAGGT 

GAGCTGCAGC 

CACCAAGTAG 

TGGAGAACGT 

TGATTGACCC 

GCTAATACTT 

AGGAATTACC 

TTATGGCTGA 

GGACGTCTAG 

AAGTCGTATC 

GGCTTCCAGA 

CCTGGTGGAG 

GAAAGTTTTT 

CTGCAATTAA 

ACTTATATAG 

TGCGATGAAC 

TTGTTGGAAT 

GCTAAACAAA 

AGATACATTC 

TCACTTTTGC 



GTTATTGCAT 5 0 

AGGTAAGCTA 10 0 

CATCTATCCA 15 0 

GTTGTTACAA 20 0 

TTCTCAAATG 2 50 

CTACTCATAA 3 00 

AGAGATATTA 350 

GAGTACGGGT 4 00 

ATAAAATTAT 4 50 

TCTGGCGATT 500 

CGGTAGTTTC 550 

CGCGTCCTGT 600 

CAACGAGTTA 650 

AGTTCCTGGA 700 

CTAAATTTGC 750 

GGAAATGAAA 800 

TAATACCGGA 850 

CAAATATGCC 900 

ATGGCTGAGT 950 

TTCAACTTCA 1000 

AAGAAATGCC 1050 

GCTGAATATT 1100 

ACGTGAAGGA 1150 

ATATTTCAGA 1200 

TGGGGGCTTG 1250 

CTGGCTTACA 1300 

ATGGTAAAGA 13 50 

TACTTACAAC 1400 

TGATTCTCTG 1450 

TTCGAGAAGA 1500 

ACTTCGTTTG 1550 

TGATCAGGCA 1600 



408 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

AATCATGCTT TAGAGTTGGG TTCTTACTTT 

CGTGGCAGTT CGAGACCGTA TGGCGAGAAG 

GATTAGATGA AATCAAAATT ATATCAAATG 

TTGATATTAG AAACAGGAGG TCTATAAATG 



PCT/CA00/01150 

ACAGAGATTA TGG&AGGTAC 1^50 

TAAATATGTT TCAGAAGATA 1700 

AGATTACACA TCAAATTCAT 1750 

AGTGTTAT 1788 



2) INFORMATION FOR SEQ ID NO: 717 

10 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia mallei 
20 (B) STRAIN: GB8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 717 

CGATCCTGGT GTGCTCGGCC GCTGACGGCC CGATGCCGCA AACGCGTGAG 50 

2 5 CACATCCTGC TGGCGCGTCA GGTCGGTGTG CCGTACATCA TCGTGTTCCT 100 

GAACAAGTGC GACATGGTGG ACGACGCGGA GCTGCTCGAG CTGGTCGAAA 150 

TGGAAGTGCG CGAACTGCTG TCGAAGTACG ACTTCCCGGG CGACGACACG 2 00 

CCGATCATCA AGGGTTCGGC GAAGCTGGCG CTGGAAGGCG ACAAGGGCGA 2 50 

GCTGGGCGAA GTGGCGATCA TGAACCTGGC CGACGCGCTG GACACGTACA 300 

30 TCCCGACGCC GGAGCGTGCG GTCGACGGCG CGTTCCTGAT GCCGGTGGAA 3 50 

GACGTGTTCT CGATCTCGGG CCGTGGTACG GTGGTGACGG GTCGTGTCGA 4 00 

GCGCGGCGTG ATCAAGGTTG GCGAGGAAAT CGAAATCGTC GGTATCAAGG 4 50 

CGACGGCGAA GACGACCTGC ACGGGCGTGG AAATGTTCCG CAAGCTGCTG 500 

GACCAGGGTC AGGCGGGCGA CAACGTCGGT ATCCTGCTGC GCGGCACAAA 550 

35 GCGTGAAGAC GTGGAGCGCG GCCAGGTTCT GGCGAAGCCG GGTTCGATCA 600 

CGCCGCACAC GCACTTCACG GCAGAAGTGT ACGTGCTGAG CAAGGACGAA 650 

GGCGGCCGCC ACACGCCGTT CTTCAACAAC TACCGTCCGC AGTTCTACTT 700 

CCGTACGACG GACGTGACGG GCTCGATCGA GCTGCCGAAG GACAAGGAAA 750 

TGGTGATGCC GGGCGACAAC GTGTCGATCA CGGTGAAGCT GATCGCGCCG 8 00 

40 ATCGCGATGG AAGAAGGTCT GCG 823 



2) INFORMATION FOR SEQ ID NO: 718 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 
("O STRANDEDNESS: Double 

50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 
55 (A) ORGANISM: Burkholderia pseudomallei 

(B) STRAIN: 1026B 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 718 

60 GCGATCCTGG TGTGCTCGGC CGCTGACGGC CCGATGCCGC AAACGCGTGA 50 
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GCACATCCTG CTGGCGCGTC 
TGAACAAGTG CGACATGGTG 
ATGGAAGTGC GCGAACTGCT 
GCCGATCATC AAGGGTTCGG 
5 AGCTGGGCGA AGTGGCGATC 
ATCCCGACGC CGGAGCGTGC 
AGACGTGTTC TCGATCTCGG 
AGCGCGGCGT GATCAAGGTT 
GCGACGGCGA AGACGACCTG 

10 GGATCAGGGT CAGGCGGGCG 
AGCGTGAAGA CGTGGAGCGC 
ACGCCGCACA CGCACTTCAC 
AGGCGGCCGC CACACGCCGT 
TCCGTACGAC GGACGTGACG 

15 ATGGTGATGC CGGGCGACAA 
GATCGCGATG GAAGAAGGTC 



AGGTCGGTGT GCCGTACATC 
GACGACGCGG AGCTGCTCGA 
GTCGAAGTAC GACTTCCCGG 
CGAAGCTGGC GCTGGAAGGC 
ATGAACCTGG CCGACGCGCT 
GGTCGATGGC GCGTTCCTGA 
GCCGTGGTAC GGTGGTGACG 
GGCGAGGAAA TCGAAATCGT 
CACGGGCGTG GAAATGTTCC 
ACAACGTCGG TATCCTGCTG 
GGCCAGGTTC TGGCGAAGCC 
GGCTGAAGTG TACGTGCTGA 
TCTTCAACAA CTACCGTCCG 
GGCTCGATCG AGCTGCCGAA 
CGTGTCGATC ACGGTGAAGC 
TGCG 



PCT/CAOO/01150 

ATSGTGTTCC /LOO 

GCTGGTCGAA 150 

GCGACGACAC 2 00 

GACAAGGGCG 2 50 

GGACACGTAC 3 00 

TGCCGGTGGA 3 50 

GGTCGTGTCG 4 00 

CGGTATCAAG 4 50 

GCAAGCTGCT 500 

CGCGGCACGA 550 

GGGTTCGATC 600 

GCAAGGACGA 650 

CAGTTCTACT 700 

GGACAAGGAA 750 

TGATCGCGCC 800 

824 



20 2) INFORMATION FOR SEQ ID NO: 719 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 0 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium beijerincki (deposited as 
Clostridium butyricum) 

(B) STRAIN: ATCC 82 60 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 719 

TGTATCAGCA GCAGATGGTC CAATGCCACA AACAAGAGAA CATATACTAC 50 

TAGGATCAAG AGTTGGTATC CAATATATCG TAGTATTCTT AAATAAAGCA 100 

GATATGGTAG ACGATCCAGA ATTATTAGAA TTAGTAGAAA TGGAAGTAAG 150 

4 0 AGAATTATTA AGCGAATATG ACTTCCCAGG AGACGATATT CCAGTAATAA 2 00 

CAGGATCAGC ATTAAAAGCA TTAGAAAATC CAACAGATGA AGAAGCAATT 2 50 

AAGCCAATCA TGGATTTAAT GGAAGCAGTA GATAGCTATA TCCCAACTCC 300 

AGAAAGAGCA ACAGATAAGC CATTCTTAAT GCCAATCGAA GATGTATTCA 3 50 

CAATTACAGG AAGAGGAACA GTTGCAACAG GAAGAGTTGA AGCTGGAGTA 400 

45 CTTCATGTAG GAGATGAAGT AGAAATCGTT GGATTAACAG AAGAAAAGAA 450 

GAAAGTTGTA GTAACTGGAA TCGAAATGTT CAGAAAGTTA TTGGATGAAG 500 

CACAAGCTGG AGATAACATC GGAGCATTAT TAAGAGGAGT TCAAAGAACT 550 

GATATTGAAA GAGGTCAAGT TTTATCAAAA CCAAATTCAG TACACCCTCA 600 

CACTAAATTT GTAGGTCAAG TATACGTACT TAAAAAAGAA GAAGGTGGAA 650 

50 GACATACTCC ATTCTTTGAT GGATACAGAC CACAATTCTA TTTCAGAACA 700 

ACAGACGTTA CAGGRTCAAT CAAGTTACCA GATGGAATGG AAATGGTAAT 750 

GCCTGGAGAT CACATTGATA TGAATGTTGA ATTAATCACT CCAATCGCAA 800 



55 



2) INFORMATION FOR SEQ ID NO: 72 0 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 799 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium innocuum 

(B) STRAIN: ATCC 14501 



10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 0 

GGTGCTATCC TGGTTGTTGC TGCATCTGAT GGTCCTATGC CTCAGACTCG 50 

TGAGC AC AT C CTGCTTGCTC GTCAGGTAGG TGTTCCTTAC ATCGTTGTAT 10 0 

TCCTGAACAA ATGCGACATG GTTGATGACG AAGAACTGAT CGACCTTGTT 150 

15 GAAATGGAAG TACGTGAGCT GTTAAGCGAG TACGGATTCG ACGGAGATAA 200 

CGCTCCGGTT ATCCGTGGTT CTGCACTGAA GGCTCTGGAA GGTGACGACA 250 

AATACGTTGG CGCTATCAAA GAACTGATGG ATGCAGTTGA TGAATTCATC 300 

CCAGATCCAA CTCGTGAAAC TGACAAACCA TTCCTGATGT CTGTAGAAGA 3 50 

CGTTATGACA ATCACAGGAC GTGGTACAGT TGCTACAGGA CGTGTTGAGC 400 

20 GTGGGGTAGT AAAACTGGGA GAAGAAGTTG AAATCGTTGG TATCAAGGAT 450 

ACTCAGAAAA CTGTTGTTAC CGGACTGGAA ATGTTCCGTA AGCAGCTGGA 500 

CTTCGCAGAA TC CGGAGACA ACATCGGTGC TCTGCTGCGT GGTATCAACC 550 

GTGACCAGAT TCAGCGTGGA CAGGTTCTTG CTAAACCAGG ATCCGTACAT 60 0 

CCACACACAA AGTTCAAGGC TCAGGTTTAT GTATTAACAA AAGAAGAAGG 650 

2 5 TGGACGTCAC ACTCCATTCG TTTCTAACTA CCGTCCTCAG TTCTACTTCC 700 

GTACAACTGA CGTAACTGGT GTTATTACAT TACCGGAAGG AACTGAAATG 750 

GTTATGCCTG GTGACAACGT TGAAATGAAC GTTGAGCTGA TTGCTCCAA 799 



30 



40 



2) INFORMATION FOR SEQ ID NO: 721 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 78 9 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 194 02 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 721 

TTGTATCAGC AGCAGATGGT CCAATGCCAC AAACAAGAGA ACACATCCTA 50 

TTAGCATCAA GAGTAGGAGT TAAC C ACATA GTAGTATTCT TAAACAAAGC 100 

< AGACCAAGTA GATGATCCAG AATTACTAGA ATTAGTAGAA ATGGAAGTAA 150 

50 GAGAATTATT AAGCGAATAC GGATTTGACG GAGACGAATG TCCAGTAGTA 2 00 

GTAGGATCAG CATTAAAAGC AATCGAAGAA GGGGATGACC AATGCATCCT 2 50 

AGACTTAATG AAAGCTGTAG ATGAATATAT CCCAACTCCA GAAAGAGCAA 3 00 

CAGATCAACC ATTCTTAATG CCTGTAGAAG ATGTATTTAC AATTACAGGA 350 

AGAGGAACAG TTGCAACAGG AAGAGTTGAA AGAGGAGTAC TACACGTAGG 4 00 

55 AGATGAAGTA CAAATCGTAG GAATGAAAGA AGAAATCGGA AAGACAACAA 450 

TCACAGGAGT AGAAATGTTC AGAAAGATGT TAGATGAAGC AATGGCTGGA 50 0 

GATAACATCG GAGCATTATT AAGAGGAGTA CAAAGAGACG AAATCGAAAG 550 

AGGTCAAGTA CTAGCAAAAC CAGGTTCAGT AACACCTCAC AAAAAATTCG 60 0 

TAGGTCAAGT TTACGTATTA AAGAAAGAAG AAGGTGGAAG ACACACTCCA 65 0 

60. TTCTTTAACG GATACAGACC ACAATTCTAC TTCAGAACAA CAGACGTAAC 700 
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AGGATCAATC GCTTTACCAG AAGGAGTAGA AATGGTAATG CCAGGAGACC 7 5*> 

ATATAGACAT GAACGTAGAA TTAATCACAC CAGTAGCAA 78 9 



2) INFORMATION FOR SEQ ID NO: 722 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12464 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 722 

GTTCAGCAGC AGACGGTCCA ATGCCACAAA CAAGAGAACA TATACTACTA 50 

GCATCAAGAG TTGGTGTTGA CTATATCGTA GTATTCTTAA ACAAGGCAGA 100 

TATGGTAGAT GACGAAGAAT TATTAGAATT AGTAGAAATG GAAGTTAGAG 150 

AATTATTATC AGAATACAAC TTCCCAGGAG ATGATATTCC AGTAATCAAG 2 00 

GGATCAGCTT TAGTAGCATT AGAAAACCCA ACAGATGAAA AATCAATCGC 250 

TCCAATCTTA GAATTAATGG AAGCAGTAGA TAGCTACATT CCAACACCAG 3 00 

AAAGAGCAAC AGATAAGCCA TTCTTAATGC CAGTAGAAGA TGTATTCACA 3 50 

ATAACTGGTA GAGGAACAGT TGCAACAGGA AGAGTTGAAA GAGGAGTTCT 400 

TCATGTAGGA GACGAAGTAG AAATCGTTGG ATTATCAGAA GAAAGCAGAA 4 50 

AAGTAGTAGT AACAGGAATA GAAATGTTCA GAAAGTTACT AGACGAAGCA 500 

CAAGCTGGAG ATAATGTTGG AGTACTTTTA AGAGGTGTTC AAAGAACAGA 550 

TATCGAAAGA GGTCAAGTAT TAGCAAAGAC TGGATCAGTT AAGCCACACA 600 

GCAAGTTCGT AGGTCAAGTA TACGTACTTA AGAAAGAAGA AGGTGGAAGA 650 

CATACTCCAT TCTTCGATGG ATACAGACCA CAATTCTACT TCAGAACAAC 7 00 

AGACGTTACT GGATCAATCA AATTACCAGA CGGAATGGAA ATGGTTATGC 7 50 

CAGGAGACCA CATTGATATG AACGTTGAAT TAATC AC TCA AGTAGCAA 798 



2) INFORMATION FOR SEQ ID NO: 723 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tertiu/n 

(B) STRAIN: ATCC 14573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 723 

GTTCAGCAGC AGATGGTCCA ATGCCTCAAA CAAGAGAACA CATACTACTA 50 

GCTTCAAGAG TTGGTGTTGA CTACATAGTT GTTTTCTTAA ACAAGGCAGA 100 

TATGGTAGAT GACGAAGAAT TATTAGAATT AGTTGAAATG GAAGTAAGAG 150 

AATTATTATC AGAATACAAC TTCCCAGGAG ATGATATTCC AGTAATAAAG 2 00 
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GGTTCAGCTT TACAAGCATT AGAAAACCCA 
TCCAATCCTT GAGTTAATGG AAGCTGTAGA 
AAAGAGCAAC AGATAAGCCA TTCTTAATGC 
ATCACTGGTA GAGGAACAGT TGCTACAGGA 
5 TCACGTAGGA GACGAAGTAG AAATCGTTGG 
AAGTAGTAGT AACAGGAATA GAAATGTTCA 
CAAGCTGGAG ACAACGTAGG AGTTCTTTTA 
CATCGAAAGA GGTCAAGTTT TAGCAAAAGT 
AGAAATTTGT AGGTCAAGTA TACGTACTTA 
10 CATACTCCAT TCTTCGATGG ATACAGACCA 
AGAYGTTACT GGTTCAATCA AGTTACCAGA 
CAGGAGACCA CATTGATATG AACGTTGAAT 



PCT/CA00/01150 

ACAGATGAAA i^AGCAATCGC 2 50 

TAGCTACATT CCAACTCCAG 300 

CAGTAGAAGA TGTATTCACA 350 

AGAGTTGAAA GAGGAGTTCT 400 

ATTATCAGAA GACAGCAGAA 4 50 

GAAAGTTACT AGACGAAGCG 500 

AGAGGAGTTC AAAGAACTGA 550 

TGGATCAGTT AAGCCACACA 600 

AAAAAGAAGA AGGTGGAAGA 650 

CAATTCTACT TCAGAACAAC 700 

TGGAATGGAA ATGGTTATGC 750 

TAATCACTCA AGTAGCTAT 799 



15 



25 



50 



2) INFORMATION FOR SEQ ID NO: 724 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 8 01 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tetani 

(B) STRAIN: ATCC 19406 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 724 

TAGTAAGTGC AGCAGATGGT CCAATGCCAC AAACAAGAGA ACACATACTA 50 

TTAGCATCCA GAGTTGGAGT TGAGCACATA GTAGTATTCT TAAATAAAGC 100 

AGACCAAGTA GATGACGCAG AGTTAATCGA ATTAGTAGAA ATGGAAGTAA 150 

35 GGGAATTAAT GAACGAATAC GGATTCCCAG GAGATGACGC ACCAGTAGTA 2 00 

GTAGGATCCG CATTAAAAGC ATTAGAAAAT CCAGAAGATG ATGCAGCAAC 2 50 

ACAATGCATA ATGGACTTAA TGGCAGCAGT AGATGAATAT ATACCAACAC 3 00 

CAGAAAGAGC AACAGATAAG CCATTCTTAA TGCCAGTAGA AGATATCTTC 3 50 

ACAATCACAG GAAGAGGAAC AGTTGCAACA GGAAGAGTAG AAAGAGGAAT 400 

40 TCTAAAAGTA GGAGACGAAA TAGAAATCGT AGGATTAAGT GATGAAAGCA 4 50 

AGAAATCAGT AATCACAGGA ATAGAAATGT TCAGAAAACT ATTAGATGAA 500 

GCACAAGCAG GAGATAACAT CGGAGCATTA TTAAGAGGTG TTCAAAGAGA 550 

TGAAATCCAA AGAGGTCAAG TATTAGCAGC AACAGGATCA GTAAAACCAC 600 

ATAAGAGTTT TACAGGTCAA GTATATGTAT TAAAGAAAGA AGAAGGAGGA 650 

4 5 AGACACACTC CATTCTTTAA CGGATACAGA CCACAATTCT ACTTTAGAAC 700 

AACAGACGTA ACAGGTTCAA TCGCACTACC AGAAGGAGTA GAAATGGTAA 750 

TGCCAGGAGA CCACATAGAC ATGAAGGTAG AATTAATAAC AAGAGTAGCA 800 

A 801 



2) INFORMATION FOR SEQ ID NO: 725 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 633 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus malodoratus 

(B) STRAIN: ATCC 43197 



(xi) SEQUENCE DESCRIPTION: SEQ 



ID NO: 725 



TATGCCTCAA 
CTTACATCGT 
TTACTAGAAT 
CTTCCCAGGC 
TAGAAGGCGA 
GTTGATGAAT 
GATGCCAGTC 
CTGGTCGTGT 
GTTGGTATTG 
GTTCCGTAAA 
TGTTACGTGG 
AAACCAGCTT 
TTTAACTAAA 



ACTCGTGAAC 
TGTATTCTTA 
TAGTTGAAAT 
GACGACACTC 
TGCTTCATAC 
ATATCCCAAC 
GAAGATGTAT 
TGAACGTGGA 
CTGAAGCAAC 
TTGTTAGATT 
TGTTGCACGT 
CAATCACTCC 
GAAGAAGGCG 



ACATCTTGTT 
AACAAAATGG 
GGAAGTTCGT 
CAGTTATCGC 
GAAGAAAAAA 
ACCAGTTCGT 
TCTCAATCAC 
CAAGTTCGCG 
TGCTAAAACA 
ACGCTGAAGC 
GAAGACATCC 
ACATACAAAA 
GACGTCATAC 



ATCTCGTAAC GTTGGTGTTC 
ATATGGTTGA TGATGAAGAA 
GACTTATTGT CAGAATACGA 
TGGTTCAGCT TTGAAAGCTT 
TCTTAGAATT AATGGCTGCT 
GATACTGACA AACCATTCAT 
TGGTCGTGGA ACTGTTGCAA 
TTGGTGACGA AGTTGAAATC 
ACTGTTACAG GTGTTGAAAT 
AGGCGATAAC ATTGGTGCAT 
AACGTGGACA AGTATTGGCT 
TTCTCTGCAG AAGTTTACGT 
TCC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
633 



2) INFORMATION FOR SEQ ID NO: 726 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus sulfureus 

(B) STRAIN: ATCC 49903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 726 

CACGTGAACA CATCTTGTTA TCTCGTAACG TAGGTGTTCC TTACATCGTT 50 

GTATTCTTAA ACAAAATGGA TATGGTTGAT GACGAAGAAT TATTAGAATT 10 0 

AGTAGAAATG GAAGTTCGTG ACTTATTATC AGAATACGAT TTCCCAGGCG 150 

ATGACACTCC AGTTGTTGCA GGTTCTGCTT TGAAAGCTTT AGAAGGCGAC 200 

GCTTCTTACG AAGAAAAAAT CATGGAATTA ATGGCTGCAG TTGACGAGTA 250 

CATCCCAACT CCAACTCGTG ACACTGACAA ACCATTCATG ATGCCAGTTG 300 

AGGATGTATT CTCAATCACT GGACGTGGTA CTGTTGCTAC AGGTCGTGTT 3 50 

GAACGTGGAC AAGTTCGCGT TGGTGACGTT GTAGATATCG TTGGTATCGC 4 00 

TGACGAAACT GCTCAAACAA CTGTAACAGG TGTTGAAATG TTCCGTAAAT 450 

TATTAGACTA CGCTGAAGCA GGCGATAACA TCGGTGCTTT ATTACGTGGT 500 

GTTGCTCGTG AAGACATCCA ACGTGGACAA GTTTTAGCTA AACCAGCTTC 550 

AATCACTCCA CATACAAAAT TCTCTGCTGA AGTATACGTA TTAAGCAAAG 60 0 

AAGAAGGTGG ACGTCATACT CCA 623 



2) INFORMATION FOR SEQ ID NO: 727 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 6 bases 

(B) TYPE: Nucleic acid 
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SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus garvieae 

(B) STRAIN: ATCC 49156 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 727 

CGGTCCTATG CCTCAAACTC GTGAACACAT CTTGTTATCA CGTAACGTTG 50 

GCGTACCATA CATCGTTGTT TTCTTGAACA AAATGGATAT GGTTGATGAC 100 

GAAGAATTGC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTGTCTGA 150 

15 ATATGACTTC CCAGGCGACG ATGTTCCTGT AATCGCTGGT TCTGCTTTGA 2 00 

AAGCTCTTGA AGGAGATCCT TCATACGAAG AAAAAATCAT GGAATTGATG 2 50 

GCTGCAGTTG ACGAATACGT TCCAACTCCA GAACGTGATA CTGACAAACC 3 00 

ATTCATGATG CCAGTCGAAG ACGTATTCTC AATCACTGGA CGTGGTACTG 3 50 

TTGCTACAGG CCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGATGAAGTA 4 00 

20 GAAATCGTTG GTATTGCTGA CGAAACTGCT AAAACAACTG TAACAGGTGT 4 50 

TGAAATGTTC CGTAAATTGT TAGACTATGC TGAAGCAGGG GATAACATTG 500 

GTGCATTGCT ACGTGGGGTT GCTCGTGAAG ACATCCAACG TGGACAAGTA 550 

TTGGCTAAAG CTGGTACAAT CACACCTCAT ACAAAATTCA AAGCTGAAGT 600 

TTATGTTTTG ACAAAAGAAG AAGGTGGACG TCACACTCCA TTCTTC 646 



25 



40 



2) INFORMATION FOR SEQ ID NO: 72 8 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 82 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycoplasma, pirum 

(B) STRAIN: ATCC 25960D 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 728 



GAGCAATTTT AGTTGTTTCT GCAACTGATG GTCCAATGCC TCAAACTCGT 50 

GAACATATCT TATTAGCACG CCAAGTTGGT GTTCCTAAAA TGGTTGTTTT 100 

45 CTTAAACAAA TGTGATGTTG CTTCTGATCC AGAAATGCAA GAATTAGTTG 150 

CTGAAGAAGT AAAAGACTTA TTAAAATCTT ATGGTTTTGA TGGTGACAAT 2 00 

ACTCCAATTA TTCGTGGTTC AGCATTAGAA GCATTAAATG GTAAACCTGA 2 50 

ATGAGAAGAA AAAATTAAAG AATTAATGAA GGCAGTGGAT GACACTATTC 3 00 

CTGATCCAGT TCGTGATACT GAAAAGCCAT TCTTGTTACC AATTGAAGAC 3 50 

50 GTAATGACAA TTACAGGTCG TGGTACTGTT GTTACAGGTC GTGTAGAACG 400 

TGGTACTCTA AAATTAAATG ATGAAGTTGA AATTGTTGGT TTAGGTGAAA 450 

CATTTAAATC TGTTGTAACA GGTATTGAAA TGTTCCGTAA AGAATTAGAT 500 

GAAGCTCGTG CTGGTGACAA TGCTGGTATT TTACTTCGTG GTGTTGACCG 550 

TGGTCAAGTA CAACGTGGTC AAGTTCTTGC TAAACCAGGT TCTATTACTC 600 

55 CTCATACTAA ATTTAAAGCT GAAATTTATG CTTTGAAAAA AGAAGAAGGT 650 

GGTCGTCATA CTGCTTTCTT AAACGGTTAT CGTCCTCAAT TCTATTTCAG 700 

AACAACTGAT GTTACAGGTT CTATTAAATT AAAAGATGGA ACTGAAATGG 750 

TTATGCCTGG TGACAATACT GAAATCACTG TAGAATTAAT TTCACCAATT 800 

GCTTGTAAAA AGGAAGTAAG TTT 823 



60 
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2) INFORMATION FOR SEQ ID NO: 72 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycoplasma salivarium 
15 (B) STRAIN: ATCC 23064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72 9 

GGAGCAATCT TAGTTGTTGC TGCAACTGAT GGTGCGATGC CTCAAACTCG 50 

2 0 TGAACACGTT TTACTTGCAA AACAAGTTGG TGTTCCTAAA ATCGTTGTTT 100 

TCTTAAACAA AATCGATATG TTCAAGCCAG AAGAAAGAGC CGAAATGGTT 150 

GAAATGGTTG AAATGGACAT TCGTGACTTA TTAAACAAAT ACGACTTTGA 2 00 

TGGTGACAAT ACTCCTATTA TTGCTGGTTC AGCATTAAAA GCATTACAAG 250 

GTGATCCAGA ATATGAAAAG AATATTCTAG AACTTATGGA TGCAGTAGAC 3 00 

2 5 ACATACATTG ATGAACCTAC TCGTGATAAA GATAAACCAT TCTTAATGGC 3 50 

TGTAGAAGAC GTATTTACAA TTACTGGTCG TGGAACTGTT GCTACTGGTA 4 00 

GAGTAGAACG TGGTACATTA CATCTAAATG ATGAAGTTGA AATCGTTGGT 450 

CTACGTCCAA CAATTAAAAC TGTTGTTACT GGAATTGAAA TGTTCCGTAA 5 00 

AAATTTAAAA GAAGCTCAAG CTGGAGATAA TGCAGGACTT TTACTACGTG 550 

3 0 GAATTGATAG AGATCAAGTA GAACGTGGAC AAGTTTTGGC CAAACCAAAA 6 00 

AGTATTATTC CTCACACAGA ATTTGAAGCT GCTGTGTATG TTCTAAAAGC 650 

TGAAGAAGGT GGACGTCACA CTCCATTCTT TGAACACTAT AAACCACAAT 700 

TTTACTTTCG TACAACCGAC GTTACTGGTG GAATTAAATT CAAACCTGGA 7 50 

CGTGAAATGG TTATGCCTGG CGAAAATGTT GAATTTACAG TTACTTTAAT 800 

3 5 TGCTCCTATT GCAGTTGAAG AAGGAA 82 6 



40 



45 



2) INFORMATION FOR SEQ ID NO: 730 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 0 



55 TGGTATGTTC CGCAGCYGAY 
CTGYTGGCTC GCCAAGTAGG 
ATGCGACATG GTTGACGATG 
TCCGYGACCT GYTGTCMAGC 
GTACAAGGTT CTGCACTGAA 

60 GAAAATCTTC GAAYTGGCTG 



GGYCCTATGC 
YGTACCTTAC 
CCGAGCTGTT 
TACGACTTCC 
AGCTTTGGAA 
CTGCTTTGGA 
41 



CTCAAACTCG 
ATCATCGTRT 
GGAAC TGGTT 
CMGGCGACGA 
GGYGAYGCTG 
CAGCTACATC 
6 



CGAACACATC 50 

TCATGAACAA 10 0 

GAAATGGAAA 15 0 

CTGCCCAATC 2 00 

GTTACGAAGA 2 50 

CCAACTCCTG 3 00 
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AGCGTGCTGT GGACAAACCK 
ATCTCTGGYC GYGGTACMGT 
CCACGTTGGT GACGAGATYG 
CCACTTGTAC CGGCGTTGAA 
5 GCWGGTGACA ACGTAGGCGT 
AGAGCGTGGT CAAGTATTGG 
AATTCAAAGC AGAAGTATAC 
ACTCCATTCT TCGCYAACTA 
CGTAACYGGT GCAGTTACTT 
10 GTGAGAACGT AACCATTACY 
AAGGTCTGCG 



TTCYTGYTGC CTATCGAAGA 
AGTAACYGGY CGTGTAGAGC 
AAATCGTAGG TCTGAAAGAA 
ATGTTCCGCA AACTGCTGGA 
ATTGYTGCGT GGTACCAARC 
CYAAACCAGG YACCATYACT 
GTACTGAGCA AAGAAGAGGG 
CCGYCCMCAA TTCTACTTCC 
TGGAAGAAGG TGTAGAAATG 
GTAGAACTGA TTGCGCCTAT 



PCT/CAOO/01150 



CGTRTTbTCT 


350 


GCGGTRTCAT 


400 


ACCCAAAAAA 


450 


CGAAGGTCAA 


500 


GTGAAGACGT 


550 


CCSCACACCA 


600 


TGGTCGYCAC 


650 


GTACYACTGA 


700 


GTAATGCCAG 


750 


CGCTAGGAAG 


800 




810 



15 2) INFORMATION FOR SEQ ID NO: 731 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella, choleraesuis subsp . choleraesuis 
serotype Enteritidis 

(B) STRAIN: ATCC 13 076 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 731 

CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC GTGAGCACAT 50 

CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG TTCCTGAACA 100 

AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT TGAAATGGAA 150 

3 5 GTTCGYGAAC TGCTGTCTCA GTACGACTTC CCGGGCGACG ACACTCCGAT 2 00 

CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA GAGTGGGAAG 250 

CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT CCCGGAACCA 300 

GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG ACGTATTCTC 350 

CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA CGCGGTATCA 400 

40 TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA GACTCAGAAG 450 

TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG ACGAAGGCCG 500 

TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA CGTGAAGAAA 550 

TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA GCCGCACACC 600 

AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG GCGGCCGTCA 650 

45 CACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC CGTACTACTG 700 

ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT GGTAATGCCG 750 

GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA TCGCAATGGA 800 

CGACGGTCTG CGT 813 



50 



2) INFORMATION FOR SEQ ID NO: 732 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

417 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuAs 
serotype Gallinarum 
5 (B) STRAIN: ATCC 9184 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 732 

CTGGTTGTTG CTGCGACTGA CGGCCCGATG CCGCAGACCC GTGAGCACAT 50 

10 CCTGCTGGGT CGTCAGGTAG GCGTTCCGTA CATCATCGTG TTCCTGAACA 100 

AATGCGACAT GGTTGATGAC GAAGAGCTGC TGGAACTGGT TGAAATGGAA 150 

GTTCGYGAAC TGCTGTCTCA GTACGACTTC CCGGGCGACG ACACTCCGAT 2 00 

CGTTCGTGGT TCTGCTCTGA AAGCGCTGGA AGGCGACGCA GAGTGGGAAG 2 50 

CGAAAATCAT CGAACTGGCT GGCTTCCTGG ATTCTTACAT CCCGGAACCA 300 

15 GAGCGTGCGA TTGACAAGCC GTTCCTGCTG CCGATCGAAG ACGTATTCTC 350 

CATCTCCGGT CGTGGTACCG TTGTTACCGG TCGTGTAGAA CGCGGTATCA 4 00 

TCAAAGTGGG CGAAGAAGTT GAAATCGTTG GTATCAAAGA GACTCAGAAG 450 

TCTACCTGTA CTGGCGTTGA AATGTTCCGC AAACTGCTGG ACGAAGGCCG 50 0 

TGCCGGTGAG AACGTAGGTG TTCTGCTGCG TGGTATCAAA CGTGAAGAAA 55 0 

20 TCGAACGTGG TCAGGTACTG GCTAAGCCGG GCACCATCAA GCCGCACACC 600 

AAGTTCGAAT CTGAAGTGTA CATTCTGTCC AAAGATGAAG GCGGCCGTCA 650 

CACTCCGTTC TTCAAAGGCT ACCGTCCGCA GTTCTACTTC CGTACTACTG 70 0 

ACGTGACTGG CACCATCGAA CTGCCGGAAG GCGTAGAGAT GGTAATGCCG 750 

GGCGACAACA TCAAAATGGT TGTTACCCTG ATCCACCCGA TCGCAATGGA 8 00 

25 CGACGGTCTG CG 812 



2) INFORMATION FOR SEQ ID NO: 733 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 

serotype Paratyphi B 
(B) STRAIN: ATCC 8759 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 733 



TGGTTGTTGC TGCGACTGAC GGYCCGATGC CGCAGACCCG TGAGCACATC 50 

CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA 100 

ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT GAAATGGAAG 150 

TTCGYGAACT GCTGTCTCAG TACGACTTCC CGGGCGACGA CACTCCGATC 2 00 

50 GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG AGTGGGAAGC 2 50 

GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATT CCGGAACCAG 3 00 

AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC 3 50 

ATCTCTGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC GCGGTATCAT 4 00 

CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG ACTCAGAAGT 4 50 

55 CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGCCGT 50 0 

GCYGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC GTGAAGAAAT 550 

CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG CCGCACACCA 60 0 

AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG CGGCCGTCAT 650 

ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC GTACTACTGA 700 

60 CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG 750 

418 
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GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT CGCAATGGAC 80tf 
GACGGTCTGC GTTT 814 



2) INFORMATION FOR SEQ ID NO: 734 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 82 8 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



20 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella, choleraesuis subsp. choleraesuis 
serotype Virchow 

(B) STRAIN: ATCC 51955 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 734 



GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACCCG 50 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 100 

2 5 TCCTGAACAA ATGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 150 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CGGGCGACGA 200 

CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCGCTGGAA GGCGACGCAG 250 

AGTGGGAAGC GAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTACATT 3 00 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 3 50 

3 0 CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAAC 4 00 

GCGGTATCAT CAAAGTGGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAG 450 

ACTCAGAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AAC TGCTGGA 500 

CGAAGGCCGT GCCGGTGAGA ACGTAGGTGT TCTGCTGCGT GGTATCAAAC 550 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CACCATCAAG 60 0 

3 5 CCGCACACCA AGTTCGAATC TGAAGTGTAC ATTCTGTCCA AAGATGAAGG 650 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 700 

GTACTAC TGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 750 

GTAATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 800 

CGCAATGGAC GACGGTCTGC GTTTCGCA 82 8 

40 

2) INFORMATION FOR SEQ ID NO: 735 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia grlmesii 
55 (B) STRAIN: ATCC 14460 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 5 

GGCGCTATCC TGGTTGTTGC TGCGACTGAT GGCCCAATGC CACAGACCCG 50 
60 TGAGCACATC CTGCTGGGTC GTCAGGTTGG CGTTCCTTTC ATCATCGTAT 100 

419 
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TCATGAACAA ATGCGACATG GTTGATGATG 
GAAATGGAAG TTCGTGAACT TCTGTCTGCT 
CCTGCCAGTT GTTCGTGGTT CAGCGCTGAA 
AGTGGGAAGC TAAAATCATC GAACTGGCTG 
5 CCAGAACCAG AGCGTGCTAT CGACAAGCCG 
CGTATTCTCC ATCTCCGGYC GTGGTACYGT 
GCGGTATCGT TAAAGTTGGC GAAGAAGTTG 
ACCGTTAAGT CTACCTGTAC TGGCGTTGAA 
CGAAGGCCGT GCTGGTGAGA ACGTAGGTGT 

10 GTGAAGACAT CGAACGTGGT CAGGTACTGG 
CCACACACCA AATTCGACTC AGAAGTTTAC 
TGGTCGTCAC ACTCCATTCT TCAAAGGCTA 
GTACAACTGA CGTGACCGGT ACCATCGAAC 
GTAATGCCAG GCGATAACGT GAACATGGTT 

15 CGCGATGGAC GACGGTCTGC GTTTC 



PCT/CA00/01150 

AAGAGCTGCT GGAACVKSGTA 150, 

TATGACTTCC CTGGTGATGA 2 00 

AGCACTGGAA GGCGAAGCTG 2 50 

GCTACCTGGA TTCTTACATC 300 

TTCCTGCTGC CAATCGAAGA 350 

AGTTACCGGT CGTGTAGAGC 4 00 

AAATCGTTGG TATCAAAGAC 4 50 

ATGTTCCGCA AACTGCTGGA 500 

TCTGCTGCGT GGTATCAAGC 550 

CTAAACCAGG TTCAATCAAG 60 0 

ATCCTGAGCA AAGAAGAAGG 650 

CCGTCCACAG TTCTACTTCC 70 0 

TGCCAGAAGG CGTAGAGATG 750 

GTAACCCTGA TTCACCCAAT 80 0 

825 



20 



25 



30 



2) INFORMATION FOR SEQ ID NO: 736 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 98 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 73 6 



35 TATTAGTTTG 
ATACTATTAT 
CAAATGTGAC 
AAGTAAGAGA 
ATAGTAAGAG 

40 GGGAGATAAG 
CTCCAGAGAG 
TTCTCAATCA 
AGTACTAAAA 
CAAGAAAAGT 

4 5 CAAGCACAAG 
AAACGAGATA 
CACACACAAA 
GGAAGACATA 
AACAACAGAC 

50 TAATGCCTGG 



TTCAGCAACA 
CAAGACAAGT 
ATGGTAGATG 
TTTATTAACA 
GTTCAGCATT 
ATAGTAGAAT 
AGATACAGAT 
CAGGAAGAGG 
GTACAAGACG 
AGTAGTAACA 
CAGGGGATAA 
GAAAGAGGAC 
GTTTACAGCA 
CACCATTCTT 
GTAACAGGAG 
AGATAACGTA 



GATGGACCAA 
TGGAGTACCA 
ATGAAGAGTT 
GAATATGATT 
AATGGCATTA 
TATTCGAGCA 
AAACCATTCT 
AACAGTTGCA 
AAGTAGAKTT 
GGAGTAGAGA 
TATAGGAGCA 
AAGTACTAGC 
GAAGTATATG 
TGATGGATAT 
CTTGTAAGTT 
ACAATGGAAG 



TGCCACAAAC 
TATATAGTAG 
ATTAGAGTTA 
TCCCAGGAGA 
GAAGATCCAA 
AATAGATGAG 
TAATGCCAGT 
ACAGGAAGAG 
AGTAGGATTA 
TGTTCAGAAA 
TTATTAAGAG 
AAAGACTGGA 
TACTTAAAAA 
AGACCACAAT 
ACCAGAAGGA 
TAGACTTAAT 



AAGAGAGCAT 50 

TATTCTTAAA 100 

GTAGAGATGG 150 

TGACACTCCA 2 00 

AGAGYGAGTG 250 

TATATACCAG 300 

AGAGGACGTA 3 50 

TGGAAAGAGG 400 

ACAGAAGCAC 450 

ATTATTAGAC 500 

GAGTACAAAG 550 

TCAGTAAAGG 600 

AGAAGARGGT 650 

TCTATTTCAG 70 0 

ATAGAGATGG 750 

AAACTCAA 798 



55 



2) INFORMATION FOR SEQ ID NO: 737 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

420 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia pseudomallei 

(B) STRAIN: 1026B 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 737 



10 GTCAACATGA 
CTCCGTGTTC 
AC CACGAAAT 
GGCCAGATGA 
CCTCACGATG 

15 TCGTCGACAA 
CTGCTCGGCC 
AGAAATGGGC 
TCACGTCGGT 



TGGAGCTCAT 
GCGGGCGTGG 
GAAGGACTCG 
ACGAGCCGCC 
GCCGAGCACT 
CATCTACCGT 
GTATGCCGTC 
AAGCTGCAAG 
T 



CAACAACATC 
GCGAGCGTAC 
AACGTTCTCG 
GGGCAACCGT 
TCCGTGACGA 
TTCACGCTGG 
GGCAGTGGGC 
AGCGCATCAC 



GCGAAGGAGC 
CCGTGAAGGG 
ACAAGGTCGC 
CTGCGCGTGG 
AGGCCTCGAC 
CCGGTACCGA 
TATCAGCCGA 
GTCGACGAAG 



ACGGCGGTTA 50 

AACGACTTCT 100 

GCTGGTGTAC 150 

CGCTGACGGG 2 00 

GTGCTGTTCT 2 50 

AGTGTCGGCG 3 00 

CGCTGGCTGA 3 50 

AAGGGCTCGA 4 00 

411 



20 



25 



3 0 



35 



2) INFORMATION FOR SEQ ID NO : 73 8 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 94 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium bifexmentans 

(B) STRAIN: ATCC 638 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 8 



TACAAGAGCT 
TTCGCAGGTG 

40 GATGAGCGAT 
TGAATGAGCC 
ATGGCTGAAT 
AGATAATATA 
TTGGACGTAC 

4 5 ATGGGTAGAT 



TATTAACAAT 
TTGGAGAGAG 
ACAGGAGTTA 
ACCTGGAGCA 
ACTTCAGAGA 
TTCCGTTTCA 
TCCATCAGCA 
TACAAGAGAG 



ATAGCTACTC 
AACAAGAGAA 
TAAATAAAAC 
AGAATGAGAG 
TCAACAAGGG 
CTCAAGCAGG 
GTTGGATACC 
AATAACATCT 



AACACGGTGG 
GGTAACGACT 
AGCTCTAGTA 
TTGCTTTAAC 
CAAGACGTTT 
ATCTGAGGTT 
AACCAACATT 
ACAAATAAAG 



TATATCAGTA 50 

TATTCCATGA 100 

TTCGGACAAA 150 

TGGTCTTACA 2 00 

TATTATTCGT 250 

TCTGCACTTC 3 00 

AGCAAC AGAG 350 

GGTC 3 94 



50 



2) INFORMATION FOR SEQ ID NO: 73 9 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 94 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

421 
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(A) ORGANISM: Clostridium beijerincki (deposited as 
Clostridium butyricum) 

(B) STRAIN: ATCC 8260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 9 



2) INFORMATION FOR SEQ ID NO: 740 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 393 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 740 

TTATAAACAA TATTGCTAAG CAACATGGTG GTATTTCTGT ATTTTCAGGA 50 

GTAGGAGAAA GAACAAGAGA AGGTAACGAC CTTTATGGCG AAATGAGTGA 100 

GTCTGGAGTT ATAAATAAAA CAGCTCTAGT ATTTGGTCAA ATGAATGAAC 150 

CACCTGGAGC GAGAATGAGA GTTGCTTTAA CTGGACTTAC AATGGCAGAA 2 00 

CATTTTAGAG ATGAGCAAGG ACAAGACGTT TTACTTTTCG TTGATAATAT 2 50 

ATTCCGTTTC ACACAAGCTG GTTCAGAAGT TTCAGCACTT CTAGGACGTA 3 00 

TGCCATCAGC TGTTGGTTAT CAGCCAACAT TAGCTACTGA AATGGGTGCA 3 50 

CTTCAAGAGA GAATAACATC AACTAAGAAA GGTTCAATAA CAT 3 93 



2) INFORMATION FOR SEQ ID NO: 741 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 98 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium ramosum 

(B) STRAIN: ATCC 25582 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 741 



TTAATAAACA ACATAGCTAA 
AGTTGGTGAA AGATCAAGAG 
AGTCAGGAGT TATTGATAAG 
CCACCGGGTG CCAGAATGAG 
GTATTTTAGA GATAAAGGTC 
TCAGATATAC TCAAGCAGGT 
CCTTCAGCGG TTGGATATCA 
TCAGGAAAGA ATTACATCAA 



ACAACATGGT 
AAGGTAATGA 
ACAGCATTAG 
AGTTGCATTA 
AAGATGTGTT 
TCAGAGGTTT 
GCCAACACTT 
CAGTTAATGG 



GGTTTATCAG TATTTACTGG 
CTTATATCAT GAAATGAGAG 
TATTTGGACA AATGAATGAG 
ACAGGTCTTA CTATGGCAGA 
ACTATTCATA GATAACATAT 
CAGCATTACT TGGAAGAACA 
GCAACTGAAA TGGGTGCACT 
TTCTATTACG TCAG 



50 
100 
150 
200 
250 
300 
350 
394 
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TTGATTCAAG AATTCATTAA 
AGTTTTTGCT GGAGTTGGTG 
ATGAAATGAA GGAAAGTGGT 
CAGATGAATG AACCCCCAGG 
TACTATGGCA GAAGAATTCC 
TCATCGATAA TATTTTCCGT 
TTACTTGGAC GGGTACCATC 
CGAAATGGGT GCTTTACAAG 



TAACATTGCT 
AACGTAGCCG 
GTTTTATCTA 
AGCTCGTTTA 
GTGATGAACA 
TTTACTCAAG 
ACAAGCTGGG 
AACGGATTAC 



ACAGAACATG GTGGTTf ATC 
TGAAGGTAAT GATTTATATT 
AAACAACACT AGTATTTGGA 
AGAGTTGCTT TAACGGGTCT 
AGGTCAGGAT GTCTTATTAT 
CTGGATCTGA AGTATCTGCC 
TATCAGCCAA CTTTAGCAAC 
ATCAACTAAA AAAGGATC 



50 
100 
150 
200 
250 
300 
350 
398 



2) INFORMATION FOR SEQ ID NO: 742 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 80 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12964 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 742 

TAGCTAAGGA ACACGGTGGA CTTTCAGTAT TCACAGGTGT TGGAGAAAGA 50 

TCAAGAGAAG GTAATGATTT ATATTACGAA ATGAAAGAAT CAGGAGTTAT 100 

AGACAAGACA GCTCTAGTGT TTGGACAAAT GAATGAATCT CCAGGAGCTA 150 

GAATGAGAGT ATCTTTAACA GGATTAACTA TGGCTGAATA TTTCAGAGAT 2 00 

CAAGGTCAAG ATGTGCTTTT ATTCATAGAT AACATATTTA GATTTACTCA 2 50 

AGCTGGATCA GAAGTATCGG CTTTACTTGG AAGAATACCA TCAGCAGTTG 3 00 

GTTATCAACC AACACTAGCA ACTGAAATGG GTGCACTTCA AGAAAGAATT 3 50 

ACTTCAACTA AAAATGGATC AATAACTTCA 3 80 



2 ) INFORMATION FOR SEQ ID NO: 743 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 89 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tertium 

(B) STRAIN: ATCC 14573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 3 

TTAATAAATA ATATAGCAAA AGAGCATGGT GGTCTTTCTG TATTTACAGG 50 

AGTTGGAGAA AGGTCAAGAG AAGGTAACGA CTTATATTAT GAAATGAAAG 100 

AGTCAGGGGT TATAGATAAG ACAGCTTTAG TATTTGGACA AATGAATGAA 150 

TCACCAGGAG CAAGAATGAG AGTTTCATTA ACTGGATTAA CTATGGCTGA 2 00 

ATATTTTAGA GATCAAGGTC AAGACGTTCT TTTATTTATA GATAATATAT 2 50 

TTAGATTTAC TCAAGCGGGA TCAGAAGTTT CTGCGTTATT AGGAAGAATT 3 00 



423 
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CCTTCAGCAG TTGGATATCA ACCAACTCTT GCAACTGAAA TGGGAGCACT 35t> 
TCAAGAAAGA ATAACATCAA CAAAGAATGG ATCAATCAC 389 



2) INFORMATION FOR SEQ ID NO: 744 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Comamonas acidovorans 

(B) STRAIN: ATCC 15668 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 744 

TTCCCCCGCA CGCATGCCCA AGGTGTTCGA TGCCCTGAAG CTCGACGGCT 50 

CGGCCCTGAC GCTGGAAGTG CAGCAACTGC TGGGTGACGG CGTTGTGCGT 100 

ACCATCGCCC TGGGTTCGTC CGACGGTCTG CGTCGCGGCC TGATGGTGTC 150 

CAACACCGGC AACCCCATCA CCGTGCCCGT GGGCAAGGCG ACGCTGGGTC 2 00 

GCATCATGGA CGTGCTGGGC AATCCCATCG ACGAACGTGG TCCCGTGGAT 2 50 

CAGGCGCTGA CGGCTCCCAT CCACCGCAAG GCACCGGCTT ATGACGAGCT 3 00 

GTCGCCTTCG CAGGAACTGC TGGAAACCGG CATCAAGGTG ATCGACCTGA 3 50 

TCTCGCCCTT CGCCAAGGGC GGCAAGGTGG GTCTGTTCGG TGGCGCCGGT 4 00 

GTGGGCAAGA CCGTGAACAT GATGGAACTC ATCAACAACA TCGCCAAGGG 450 

CCACGGTGGT CTGTCGGTGT TCGCCGGTGT GGGTGAACGT ACCCGCGAAG 500 

GCAATGACTT CTATCACGAA ATGTCGGACG CCGGCGTGGT CAACCAGGAG 550 

TCGCTGAACG ACTCCAAGGT GGCCATGGTC TACGGCCAGA TGAACGAACC 600 

CCCGGGCAAC CGTCTGCGCG TGGCGCTGAC CGGCCTGACC ATGGCCGAAG 650 

CCTTCCGTGA CGAAGGCAAG GACGTGCTGT TCTTCGTGGA CAACATCTAC 70 0 

CGCTACACGC TGGCCGGTAC CGAAGTGTCC GCTCTGCTGG GTCGCATGCC 7 50 

TTCCGCCGTG GGCTACCAGC CCACGCTGGC CGAGGAAATG GGCCGCCTGC 800 

AAGAGCGCAT CACCTCGACC AAGGTCGGTT CGATCACTTC CAC 843 



2) INFORMATION FOR SEQ ID NO: 745 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. 
rhinoscleromatis 

(B) STRAIN: ATCC 13884 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 745 

GCCGTACCAC GCGTGTACGA AGCCCTTGAG GTACAGAATG GTAATGAAGT 50 
TCTGGTGCTG GAAGTTCAGC AGCAGCTGGG CGGCGGTATC GTACGTACCA 100 
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WO 01/23604 

TCGCCATGGG TTCTTCTGAT GGTCTGCGCC 
CTCGAGCACC CGATCGAAGT CCCGGTAGGT 
CATGAACGTA CTGGGTCAAC CGGTTGACAT 
AAGAGCGTTG GGCTATCCAC CGCGCGGCAC 
AGCTCTCAGG AACTGCTGGA AACCGGCATC 
TCCGTTCGCC AAGGGCGGTA AAGTTGGTCT 
GTAAAACTGT AAACATGATG GAGCTGATCC 
TCCGGTTACT CTGTGTTTGC GGGCGTAGGT 
CGACTTCTAC CACGAAATGA CCGACTCCAA 
TGGTGTACGG CCAGATGAAC GAGCCGCCGG 
CTGACCGGCC TGACCATGGC TGAGAAATTC 
ACTGCTGTTC GTCGATAACA TCTATCGTTA 
TATCCGCGCT GCTGGGTCGT ATGCCTTCAG 
CTGGCGGAAG AGATGGGCGT TCTGCAGGAA 
CGGTTCTATC ACCTCCGTA 



PCT/CA00/01150 

GCGGTCTGGA TGT^AAAGAC 450 

AAAGCAACGC TGGGTCGTAT 200 

GAAAGGCGAC ATCGGCGAAG 250 

CGTCCTATGA AGAGCTGTCC 300 

AAAGTTATCG ACCTGATGTG 3 50 

GTTCGGCGGT GCGGGTGTAG 4 00 

GTAACATCGC GATCGAGCAC 450 

GAGCGTACTC GTGAGGGTAA 500 

CGTTATCGAT AAAGTATCCC 550 

GAAACCGTCT GCGCGTTGCG 60 0 

CGTGACGAAG GTCGTGACGT 650 

CACCCTGGCC GGTACTGAAG 700 

CGGTAGGTTA TCAGCCGACC 750 

CGTATCACCT CCACCAAAAC 80 0 

819 



2) INFORMATION FOR SEQ ID NO: 746 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria, canis 

(B) STRAIN: ATCC 14687 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 6 



GCGATTCTAT TCCGCGCGTA TATGATGCTC 
CTGACGCTTG AAGTACAACA ACAGTTGGGT 
TGCGATGGGT AGTTCCGACG GCCTCAAACG 
CCGGTGCTCC AATTACAGTG CCTGTGGGCA 
ATGGACGTAT TAGGTAATCC GGTTGATGAA 
GCAAACCCGA GCAATCCACC AACCTGCTCC 
GCGCCACAGA GATTTTGGAA ACAGGTATTA 
CCGTTTGCCA AAGGCGGTAA AGTAGGTTTG 
CAAAACCGTA AATATGATGG AGTTGATTAA 
GCGGTTTGTC TGTATTTGCC GGTGTGGGTG 
GACTTTTATC ATGAGATGAA AGATTCCAAT 
GGTTTACGGT CAGATGAACG AGCCTCCCGG 
TAACTGGCTT GTCTATGGCC GAATTCTTCC 
GGTAAAGGCC GTGATGTATT GTTCTTTGTA 
CCTAGCCGGT ACAGAAGTAT CTGCATTGCT 
TAGGTTATCA GCCGACGTTG GCAGAGGAAA 
ATTACYTCAM CCCAAACAGG CTCT 



TTAAACTAGT GGATAGAGAA 50 

GATGGTGTCG TTCGTACTAT 100 

AGGTTTGGCG GTAGTTAACA 150 

AAGCAACATT AGGCCGTATT 2 00 

GCTGGTCCGA TTGGCTCCGA 250 

TAAGTTCGAC GAGCTTTCTA 3 00 

AAGTAATTGA TTTGCTTTGC 3 50 

TTTGGTGGTG CGGGAGTGGG 400 

CAACATCGCG AAAGCACACA 450 

AACGGACGCG TGAAGGTAAT 500 

GTATTAGATA AAGTAGCCAT 550 
TAACCGTTTG CGCGTTGCGC - 600 

GTGACGAGAA AGATGAAAAC 650 

GACAATATTT ACCGCTATAC 700 

TGGCCGTATG CCTTCGGCAG 750 

TGGGCCGCTT GCAAGAGCGT 800 

824 



2) INFORMATION FOR SEQ ID NO: 74 7 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 



10 



15 



20 



25 



<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria, cinerea 

(B) STRAIN: ATCC 14685 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 7 



CGCGACGCTA 
TCTGACTCTG 
TTGCAATGGG 
ACTGGTGCGC 
TGTCGACGTA 
AC AAAAGC CG 
TCAGCTACCG 
TCCGTTTGCT 
GCAAAACCGT 
AGCGGTCTGT 
CGACTTCTAC 
TGGTGTATGG 
TTGACCGGTT 
CGGTAAAGGC 
CTTTGGCCGG 
GTGGGTTACC 
TATTACCTCT 



TCCCGCATGT 
GAGGTTCAAC 
TAGTTCAGAC 
CAATCACTGT 
TTGGGTACGC 
TGCCATTCAC 
AATTGTTGGA 
AAAGGCGGTA 
GAACATGATG 
CCGTGTTCGC 
CACGAGATGA 
CCAAATGAAC 
TGACTATGGC 
CGCGACGTAT 
TACTGAAGTA 
AACCGACATT 
ACCCAAACCG 



TTACGATGCC 
AGCTTCTGGG 
GGCCTTAAAC 
GCCGGTAGGT 
CTGTTGATGA 
CAAACTGCTC 
AACCGGTATT 
AAGTAGGTCT 
GAATTGATCA 
AGGTGTGGGT 
AAGATTCCAA 
GAACCTCCGG 
CGAATACTTC 
TGTTCTTCGT 
TCTGCACTGT 
GGCTGAAGAA 
GTTCCATTAC 



CTGAAATTGG 
CGACGGCGTT 
GCGGTATGTC 
AAAGGTACAT 
AGCAGGTCCG 
CGAAATTCGA 
AAAGTGATCG 
GTTCGGTGGT 
ACAACATCGC 
GAGCGTACCC 
CGTATTGGAT 
GCAACCGTCT 
CGTGACGAAA 
TGACAACATC 
TGGGCCGTAT 
ATGGGTCGTT 
T 



ACGAGAACGG 50 

GTCCGTACTA 100 

TGTAAGCAAT 150 

TGGGTCGTAT 2 00 

ATCGATACCG 2 50 

CGAGTTGTCT 3 00 

ACTTGCTGTG 3 50 

GCCGGTGTGG 4 00 

CAAAGCGCAC 450 

GTGAAGGTAA 50 0 

AAAGTAGCCA 550 

GCGCGTTGCT 6 00 

AAGACGAAAA 650 

TACCGTTACA 700 

GCCTTCTGCA 750 

TGCAAGAGCG 800 

831 
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2) INFORMATION FOR SEQ ID NO: 748 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 862 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria cuniculi 

(B) STRAIN: ATCC 14688 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 8 

CCGTGGCCAA GTACCACAAA TTTATGACGC ACTGAGTGTT GATGGCACCG 50 

AAACAACCTT GGAAGTTCAA CAGCAGTTGG GTGATGGCGT GGTGCGTACC 100 

ATTGCGATGG GTTCAACCGA AGGCTTGAAG CGTGGTTTGA CTGTATCTAA 150 

50 CTCTGGTGCA CCGATTTCTG TGCCAGTGGG TCAAGCGACT TTGGGTCGTA 2 00 

TTATGGATGT GTTGGGTCGT CCAATCGACG AGGCAGGTCC TGTAAATGCT 2 50 

CAAGAAAAAT GGTCAATTCA CCGTGAAGCA CCAAGCTATG ATGAGCAATC 3 00 

AAACTCAACT GAGCTGCTAG AAACAGGCAT CAAAGTAATT GATTTGCTTT 3 50 

GCCCATTTGC TAAAGGTGGT AAAGTTGGCT TGTTCGGTGG TGCAGGTGTG 4 00 

55 GGTAAAACCG TGAATATGAT GGAGCTGATT AATAATATCG CTCTGAAGCA 4 50 

TTCAGGTCTT TCTGTTTTTG CAGGTGTTGG TGAGCGTACT CGTGAGGGTA 50 0 

ACGATTTTTA TCACGAAATG CAAGAAGCAG GCGTTGTTAA TACCGAAGAC 550 

TTCACCAAGT CAAAAGTAGC GATGGTTTAT GGTCAGATGA ATGAGCCACC 600 

AGGAAACCGT TTGCGTGTTG CATTGACAGG CTTGACGATG GCAGAATATT 650 

60 TCCGTGACGA AAAAGATGAA GCAACAGGCA AAGGGCGTGA TGTTCTATTG 700 
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TTCGTTGATA ACATCTATCG 
ACTTCTAGGT CGTATGCCAT 
AAGAAATGGG TGCGTTGCAA 
ATCACTTCGG GG 



TTACACACTG GCTGGTACGG 
CAGCAGTAGG TTATCAACCG 
GAGCGTATTA CCTCAACGCA 



PCT/CA00/01150 

AAGTGTCAGC 750' 
ACTCTGGCTG 800 
ATCGGGTTCG 850 

862 



2) INFORMATION FOR SEQ ID NO: 749 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria elongata subsp. elongata 

(B) STRAIN: ATCC 252 95 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 9 

GGAACTCCCA CGTGACGCTA TCCCGCATGT TTTTGATGCA TTAAAATTAG 50 

TTGAAAATGA CCTAACCTTA GAAGTTCAAC AACTTTTGGG GGATGGTGTA 100 

GTGCGTACCA TTGCGATGGG TAGTTCAGAT GGATTAAAGC GTGGTATGGC 150 

TGTGAATAAT ACCGGAGCTC CGATTACTGT TCCTGTTGGC CGTGAAACTT 2 00 

TGGGTCGTAT CATGGATGTA TTGGGTAATC CGGTTGATGA GGCAGGTCCG 2 50 

GTAAATGCAT CCAATACACG TGCGATCCAT CAAGAGGCTC CTAAGTTTGA 3 00 

TGAGCTTTCT TCAACAACGG AATTATTAGA AACTGGCATT AAGGTTATCG 350 

ACTTGTTATG TCCGTTTGCC AAAGGTGGTA AAGTAGGTCT GTTTGGTGGT 4 00 

GCGGGTGTAG GTAAAACCGT AAATATGATG GAGTTAATTA ACAACATTGC 450 

CAAGGCACAT AGCGGTTTGT CTGTGTTTGC AGGCGTGGGT GAACGTACTC 500 

GTGAAGGTAA TGACTTCTAT CACGAGATGA AAGATTCCAA CGTATTGGAC 550 

AAAGTGGCAA TGGTTTACGG TCAGATGAAC GAACCTCCAG GCAACCGTCT 60 0 

GCGCGTTGCT TTGACCGGTT TGACTATGGC CGAATACTTC CGTGACGAAA 650 

AAGACGAAAA CGGTAAAGGT CGCGACGTAT TGTTCTTCGT GGACAACATT 700 

TACCGTTACA CTTTGGCCGG TACGGAAGTA TCCGCATTGC TGGGTCGTAT 750 

GCCTTCAGCA GTAGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT 800 

TGCAAGAGCG TATTACCTCT AC C C AG AC AG GCTCTATTAC TTCC 844 



2) INFORMATION FOR SEQ ID NO: 7 50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria flavescens 

(B) STRAIN: ATCC 13120 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 750 

CGCGACGCTA TTCCGCATGT TTACGATGCC CTGAAATTGG ACGAGAACGG 50 
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TCTGACTCTG GAAGTTCAAC AGCTTCTGGG TGACGGCGTT GTCCGTACTA 100* 

TTGCAATGGG TAGTTCAGAC GGCCTGAAAC GCGGCATGTC TGTAAGCAAT 150 

ACCGGTGCGC CAATCACTGT GCCGGTAGGT AAAGGTACAT TGGGCCGTAT 200 

TGTCGACGTA TTGGGTACGC CTGTTGATGA AGCAGGTCCG ATCGATACCG 2 50 

5 ACAAGAGCCG TGCCATTCAC CAAACTGCTC CGAAATTCGA TGAGTTGTCT 300 

TCAACTACCG AATTGTTGGA AACCGGTATT AAAGTGATCG ACTTGCTGTG 350 

TCCGTTTGCT AAGGGCGGTA AAGTAGGTCT GTTCGGTGGT GCCGGTGTGG 4 00 

GCAAAACCGT GAACATGATG GAATTGATCA ACAACATCGC CAAAGCGCAC 4 50 

AGCGGCCTGT CCGTGTTCGC AGGTGTGGGT GAACGTACCC GTGAAGGTAA 500 

10 CGACTTCTAC CACGAGATGA AAGATTCCAA CGTATTGGAT AAAGTAGCCA 550 

TGGTGTATGG TCAAATGAAC GAACCTCCAG GCAACCGTCT GCGCGTTGCT 600 

TTGACCGGTT TGACTATGGC CGAATACTTC CGTGACGAAA AAGATGAAAA 650 

CGGTAAAGGT CGCGACGTAT TGTTCTTCGT TGACAACATC TACCGTTACA 700 

CTCTGGCCGG TAC CGAAGT A TCCGCACTGT TGGGTCGTAT GCCTTCTGCA 750 

15 GTGGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT TGCAGGAGCG 800 

TATTACTTCT ACGCAAACCG GTTCCATTAC TTCC 834 



20 2) INFORMATION FOR SEQ ID NO: 751 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear „ 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 

(B) STRAIN: ATCC 31426 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 751 



CGCGACATGA 
TCTGACTTTG 
TCGCTATGGG 
ACTGGTTCGC 
TGTCGATGTA 
ACAAGAGTCG 
TCCACAACCG 
TCCGTTTGCC 
GTAAAACCGT 
AGCGGCTTGT 
CGACTTCTAC 
TGGTGTATGG 
TTGACCGGTT 
CGGTAAAGGT 
CTCTGGCCGG 
GTGGGTTACC 
TATTACCTCT 



TTCCGCGCGT 
GAAGTCCAAC 
CAGCTCGGAC 
CCATTACTGT 
TTGGGAACTC 
TGCCATCCAC 
AATTGCTCGA 
AAAGGCGGTA 
GAACATGATG 
CCGTGTTCTC 
CACGAGATGA 
CCAAATGAAC 
TGACTATGGC 
CGTGACGTAT 
TACCGAAGTA 
AACCGACATT 
ACCCAAACCG 



TTACGACGCT 
AGCTTTTGGG 
GGTTTGAAAC 
GCCGGTAGGT 
CTGTTGACGA 
CAAGCCGCTC 
AACGGGCATT 
AAGTAGGTCT 
GAATTGATCA 
AGGCGTGGGT 
AAGATTCCAA 
GAACCTCCAG 
TGAATACTTC 
TGTTCTTCGT 
TCCGCACTGT 
GGCTGAAGAA 
GTTCCATTAC 



TTGAAATTAG 
TGATGGCGTA 
GCGGCATGAC 
AAAGGTACGT 
GGCAGGTCCA 
CTAAGTTTGA 
AAAGTGATTG 
GTTCGGCGGT 
ACAACATCGC 
GAGCGTACCC 
CGTATTGGAT 
GCAACCGTCT 
CGTGACGAAA 
TGACAACATC 
TGGGCCGTAT 
ATGGGTCGTT 
TTCC 



ACGAAAACGG 50 

GTCCGTACCA 100 

TGTGAGCAAT 150 

TGGGACGCAT 2 00 

ATTGATACCG 2 50 

CGAACTGTCT 3 00 

ACTTGCTGTG 3 50 

GCCGGTGTGG 4 00 

CAAAGCGCAC 4 50 

GCGAAGGTAA 500 

AAAGTAGCCA 550 

GCGCGTTGCT 600 

AAGACGAAAA 650 

TACCGTTACA 700 

GCCTTCTGCA 750 

TGCAAGAGCG 8 00 

834 
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2) INFORMATION FOR SEQ ID NO: 752 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 5 bases 

(B) TYPE: Nucleic acid 
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(C) STRAND EDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 

(B) STRAIN: ATCC 2762 8 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 752 

ACATGATTCC GCGCGTTTAC GACGCTTTGA AATTAGACGA AAACGGTCTG 50 

ACTTTGGAAG TCCAACAGCT TTTGGGTGAT GGCGTAGTCC GTACCATCGC 100 

TATGGGCAGC TCGGACGGTT TGAAACGCGG CATGACTGTG AGCAATACTG 150 

15 GTTCGCCCAT TACTGTGCCG GTAGGTAAAG GTACGTTGGG ACGCATTGTC 2 00 

GATGTATTGG GAACGCCTGT TGACGAGGCA GGTCCAATTG ATACCGACAA 250 

GAGTCGTGCC ATCCACCAAG CCGCTCCTAA GTTTGACGAA CTGTCTTCCA 3 00 

CAACCGAATT GCTCGAAACG GGCATTAAAG TGATTGACTT GCTGTGTCCG 3 50 

TTTGCCAAAG GCGGTAAAGT AGGTCTGTTC GGCGGTGCCG GTGTGGGTAA 400 

2 0 AACCGTGAAC ATGATGGAAT TGATCAACAA CATCGCCAAA GCGCACAGCG 450 

GCTTGTCCGT GTTCTCAGGC GTAGGTGAGC GTACCCGCGA AGGTAACGAC 500 

TTCTACCACG AGATGAAAGA TTCCAACGTA TTGGATAAAG TAGCCATGGT 550 

GTATGGCCAA ATGAACGAAC CTCCAGGCAA CCGTCTGCGC GTTGCTTTGA 600 

CCGGTTTGAC TATGGCTGAA TACTTCCGTG ACGAAAAAGA CGAAAACGGT 650 

25 AAAGGTCGTG ACGTATTGTT CTTCGTTGAC AACATCTACC GTTACACTCT 70 0 

GGCCGGTACC GAAGTATCCG CACTGTTGGG CCGTATGCCT TCTGCAGTGG 750 

GTTACCAACC GACATTGGCT GAAGAAATGG GTCGTTTGCA AGAGCGTATT 800 

ACCTCTACCC AAACCGGTTC CATTA 82 5 
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2) INFORMATION FOR SEQ ID NO: 753 



(i) SEQUENCE CHARACTERISTICS: 

3 5 (A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria lactamica 

(B) STRAIN: ATCC 23970 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 753 



GATGCGATTC CGCATGTTTA CGATGCCCTG AAATTGGACG AGAACGGTCT 50 

GACCCTGGAA GTCCAACAGC TTTTGGGTGA CGGCGTTGTC CGTACTATTG 100 

50 CAATGGGTAG TTCAGACGGC CTGAAACGCG GCATGTCTGT CAGCAATACC 150 

GGTGCGCCAA TCACTGTGCC GGTAGGTAAA GGTACGTTGG GCCGTATTGT 2 00 

CGACGTATTG GGTACACCTG TTGACGAAGC AGGTCCGATC GATACCGACA 250 

AGAGCCGCGC CATCCACCAA ACCGCCCCGA AATTCGACGA GTTGTCTTCA 300 

ACTACCGAAT TGTTGGAAAC CGGCATTAAA GTGATCGATT TGCTGTGTCC 350 

55 GTTTGCTAAG GGCGGTAAAG TAGGTCTGTT CGGTGGTGCC GGTGTGGGCA 400 

AAACCGTGAA CATGATGGAA TTGATCAACA ACATCGCCAA AGCGCACAGC 450 

GGTCTGTCCG TGTTTGCAGG CGTGGGCGAG CGTACCCGCG AAGGTAACGA 500 

CTTCTACCAC GAGATGAAAG ATTCCAACGT ATTGGATAAA GTAGCCATGG 550 

TGTATGGTCA GATGAACGAA CCTCCGGGCA ACCGTCTGCG CGTTGCTTTG 600 

60 ACCGGTTTGA CGATGGC CGA ATACTTCCGC GACGAAAAAG ACGAAAACGG 650 
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CAAAGGCCGC GACGTATTGT 

TGGCCGGTAC CGAAGTATCC 

GGTTACCAAC CGACATTGGC 

TACCTCTACC CAAACCGGTT 



TCTTCGTGGA CAACATCTAC 
GCACTGTTGG GCCGTATGCC 
TGAAGAAATG GGTCGTTTGC 
CCATTACTTC C 



CGTTACACCC TOO 
TTCCGCAGTG 750 
AAGAGCGTAT 800 

831 



2) INFORMATION FOR SEQ ID NO: 754 



10 



15 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 836 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: 2241C 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 754 



CCACGCGACA 

2 5 CGGTCTGACT 

CCATTGCGAT 
AATACCGGTG 
CATTGTCGAT 
CCGACAAGAG 

3 0 TCTTCCACAA 

GTGTCCGTTT 
TGGGTAAAAC 
CACAGCGGCT 
TAACGACTTC 

3 5 CAATGGTTTA 

GCATTGACCG 
AAACGGCAAA 
ACACTCTAGC 
GCAGTGGGCT 

4 0 GCGTATTACC 



TGATTCCGCG 
TTGGAAGTCC 
GGGCAGCTCG 
CGCCCATTAC 
GTATTGGGAA 
CCGTGCCATC 
CCGAATTGCT 
GCCAAAGGCG 
CGTGAACATG 
TGTCCGTGTT 
TACCACGAGA 
CGGTCAGATG 
GTTTGACCAT 
GGCCGCGACG 
TGGTACCGAA 
ACCAACCGAC 
TCTACCCAAA 



CGTTTACGAC 
AACAGCTTTT 
GACGGTTTGA 
TGTGCCGGTA 
CGCCTGTTGA 
CACCAAGCCG 
CGAAACGGGC 
GTAAAGTAGG 
ATGGAATTGA 
CGCAGGCGTG 
TGAAAGATTC 
AACGAACCTC 
GGCGGAATAC 
TATTGTTCTT 
GTATCCGCAT 
ATTGGCAGAA 
CCGGTTCTAT 



GCTTTGAAAT 
GGGCGACGGC 
AACGCGGCAT 
GGTAAAGGTA 
CGAGGCAGGT 
CTCCTAAGTT 
ATTAAAGTGA 
TCTGTTCGGC 
TCAACAACAT 
GGTGAGCGTA 
CAACGTATTG 
CGGGCAACCG 
TTCCGTGACG 
CGTGGACAAC 
TGTTGGGCCG 
GAAATGGGTC 
TACTTC 



TAGACGAAAA 50 

GTAGTCCGTA 100 

GACTGTGAGC 150 

CGTTGGGACG 200 

CCAATCGATA 2 50 

TGACGAACTG 3 00 

TTGACTTGCT 350 

GGTGCCGGTG 4 00 

CGCCAAAGCG 450 

CCCGCGAAGG 50 0 

GATAAAGTGG 550 

TTTGCGCGTC 600 

AAAAAGACGA 650 

ATCTACCGTT 70 0 

TATGCCGTCT 750 

GTTTGCAGGA 80 0 

836 



45 



2) INFORMATION FOR SEQ ID NO: 755 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 837 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Neisseria mucosa 

(B) STRAIN: ATCC 19696 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 755 

60 CCGCGTGATG CCATTCCGCG TGTTTACGAC GCCCTGAAAT TGGATGCAAA 50 
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CGGCCTGACT TTGGAAGTGC 
CTATTGCAAT GGGTAGTTCG 
AATACAGGTG CGCCGATTAC 
TATTGTCGAT GTATTGGGTA 
5 CCGACAAACA CCGTGCCATC 
TCTGCGACTA CTGAGCTGCT 
GTGTCCGTTT GCCAAAGGCG 
TAGGCAAAAC CGTCAACATG 
CATAGCGGTT TGTCCGTGTT 

10 TAATGACTTC TACCACGAGA 
CGATGGTTTA CGGTCAGATG 
GCCTTGACCG GTTTGACGAT 
AAGCGGCAAA GGCCGTGACG 
ACACCCTGGC CGGTACCGAA 

15 GCAGTAGGTT ACCAACCGAC 
GCGTATTACC TCTACCCAAA 



AACAGCTTCT GGGTGACGGC 
GATGGTTTGA AACGCGGCAT 
AGTACCGGTA GGTAAAGGTA 
CGCCTGTTGA CGAAGCAGGT 
CATCAGACAG CTCCGAAATT 
GGAAACCGGC ATTAAAGTGA 
GTAAAGTAGG TCTGTTCGGT 
ATGGAATTAA TTAACAACAT 
TGCAGGTGTG GGTGAGCGTA 
TGAAAGATTC CAACGTATTG 
AACGAACCTC CGGGTAACCG 
GGCCGAATAC TTCCGTGATG 
TATTGTTCTT CGTGGACAAT 
GTATCCGCAT TGTTGGGTCG 
ATTGGCTGAA GAAATGGGTC 
CAGGCTCCAT TACCTCC 



PCT/CA00/01150 

GTAGTTCGTA ^00 

GACTGTAAGC 150 

CTTTGGGACG 2 00 

CCGATTGATA 2 50 

TGATGAACTG 3 00 

TTGACTTGCT 3 50 

GGTGCCGGTG 4 00 

CGCCAAAGCA 4 50 

CCCGTGAAGG 500 

GACAAAGTGG 550 

TCTGCGTGTA 600 

AAAAAGACGA 650 

ATCTACCGTT 700 

TATGCCTTCA 750 

GTTTGCAAGA 800 

837 



20 2) INFORMATION FOR SEQ ID NO: 756 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria sub f lava. 

(B) STRAIN: ATCC 14221 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 756 



CGCGACGCTA TCCCGCATGT TTACGATGCC CTGAAATTGG ACGAGAACGG 50 

TCTGACTCTG GAAGTTCAAC AGCTTCTGGG TGACGGCGTT GTCCGTACTA 100 

TTGCAATGGG TAGTTCAGAC GGCCTGAAAC GCGGCATGTC TGTAAGCAAT 150 

ACTGGTGCGC CAATCACTGT GCCGGTAGGT AAAGGTACAT TGGGTCGTAT 2 00 

40 TGTCGACGTA TTGGGTACGC CTGTTGATGA AGCAGGTCCG ATCGATACCG 2 50 

ACAAGAGCCG TGCCATTCAC CAAACTGCTC CGAAATTCGA CGAGTTGTCT 3 00 

TCAACTACCG AATTGCTGGA AACCGGTATT AAAGTGATCG ACTTGCTGTG 3 50 

TCCGTTTGCT AAGGGCGGTA AAGTAGGTCT GTTCGGTGGT GCCGGTGTGG 4 00 

GCAAAACCGT GAACATGATG GAATTGATCA ACAACATCGC CAAAGCGCAC 4 50 

45 AGCGGTCTGT CCGTGTTCGC AGGTGTGGGT GAACGTACCC GTGAAGGTAA 50 0 

CGACTTCTAC CACGAGATGA AAGATTCCAA CGTATTGGAT AAAGTAGCCA 550 

TGGTGTATGG CCAAATGAAC GAACCTCCGG GCAACCGTCT GCGCGTTGCT 600 

TTGACCGGTT TGACTATGGC CGAATACTTC CGTGACGAAA AAGACGAAAA 650 

CGGTAAAGGT CGCGACGTAT TGTTCTTCGT TGACAACATC TACCGTTACA 700 

50 CTCTGGCCGG TACCGAAGTA TCTGCACTGT TGGGCCGTAT GCCTTCTGCA 750 

GTGGGTTACC AACCGACATT GGCTGAAGAA ATGGGTCGTT TGCAAGAGCG 8 00 

TATTACCTCT ACCCAAACTG GTTCCATTAC TTCC 834 



55 

2) INFORMATION FOR SEQ ID NO: 757 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 833 bases 
60 (B) TYPE: Nucleic acid 

431 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria, weaveri 

(B) STRAIN: ATCC 51223 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 757 

CCCGTGATGC TATTCCCATG TATACGATGC CCTGAAATTG GTAGATAACG 50 

ATCTGACCCT GGAAGTGCAA CAACTTTTAG GTGATGGTGT GGTTCGTACC 100 

ATTGCAATGG GTAGTTCAGA CGGCCTAAAA CGTGGTATGG CTGTTAACAA 150 

15 TACCGGCGCT CCGATTACTG TTCCGGTGGG GAAAGCCACC TTGGGACGTA 2 00 

TTATGGATGT GTTGGGTAAT CCGGTTGATG AAGCAGGTCC TGTTGTATCA 250 

GAAGAAACTC GCGCTATTCA TCAAGCTGCC CCTAAATTTG ACGAACTGTC 300 

TTCAGCAACT GAGTTGTTGG AAACAGGCAT TAAAGTAATT GACTTGCTGT 350 

GCCCGTTTGC CAAAGGTGGT AAAGTAGGTT TGTTTGGTGG TGCCGGCGTG 4 00 

2 0 GGTAAAAC CG TAAATATGAT GGAGTTGATC AACAACATCG CGAAGGCACA 4 50 

TAGTGGTTTG TCTGTATTCG CCGGTGTAGG TGAGCGTACC CGTGAAGGTA 500 

ACGACTTCTA CCATGAAATG AAAGACTCTA ACGTATTGGA TAAAGTAGCC 550 

ATGGTTTATG GCCAGATGAA TGAACCTCCG GGTAACCGTT TGCGCGTTGC 600 

TTTGACTGGT TTGACTATGG CCGAATATTT CCGTGACGAG AAAGATGAAA 650 

2 5 ACGGCAAAGG TCGTGACGTC TTGTTCTTTG TGGATAATAT CTATCGCTAT 70 0 

ACTCTGGCCG GTACTGAAGT GTCTGCACTG TTAGGTCGTA TGCCGTCTGC 750 

AGTAGGTTAT CAGCCTACAT TGGCAGAAGA AATGGGTCGC TTGCAGGAGC 8 00 

GTATTACTTC TACTCAAACA GGTTCGATTA CTT 833 



30 



45 



2) INFORMATION FOR SEQ ID NO: 758 



(i) SEQUENCE CHARACTERISTICS: 

3 5 (A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria animalis 

(B) STRAIN: ATCC 19573 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 758 



CGCGATGCCA TTCCTCACGT TTACGATGCC CTGAAATTGG ACGACACCGG 50 

TCTGACTTTG GAAGTACAAC AACTTCTGGG CGACGGTGTG GTACGTACCA 100 

50 TTGCAATGGG TAGTTCAGAC GGCCTGAAAC GGGGTTTGTC TGTGAGCAAT 150 

ACCGGTTCTC CGATTGCCGT TCCTGTCGGT AAAGCGACTT TGGGTCGTAT 2 00 

TATGGACGTA TTGGGCAATC CGGTTGATGA AGCCGGTCCG GTTGCTACCG 2 50 

AAGAGAAACG TGCTATTCAC CAAGCCGCAC CGAAGTTTGA TGAATTGTCA 3 00 

TCAGCTACCG AGTTGTTGGA AACCGGTATT AAAGTAATCG ACTTGCTGTG 3 50 

55 TCCGTTTGCA AAAGGCGGTA AAGTAGGTCT GTTCGGCGGT GCCGGTGTGG 4 00 

GCAAAACCGT AAACATGATG GAATTGATCA ACAACATCGC CAAAGCACAC 450 

AGCGGTCTGT CTGTGTTTGC CGGTGTAGGT GAACGTACCC GCGAAGGTAA 500 

CGACTTCTAC CACGAGATGA AAGATTCCAA CGTGTTGGAT AAAGTAGCCA 550 

TGGTGTACGG TCAGATGAAT GAGCCGCCGG GCAACCGCTT GCGCGTGGCT 600 

60 TTGACCGGCC TGACTATGGC CGAATACTTC GTGACGAAAA AGACGAAAAC 650 

432 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



GGCAAAGGTC GTGACGTATT 
ACTGGCCGGT ACCGAAGTAT 
TAGGTTATCA GCCGACATTG 
ATTACCTCTA CCCAAACCGG 



GTTCTTCGTG GACAACATTT 
CAGCATTGTT GGGCCGTATG 
GCAGAGGAAA TGGGTCGCTT 
TTCGATTACC TCT 



ACCGCTACAC 7 0O 

CCGTCTGCAG 750 
GCAAGAGCGT 800 

833 



2) INFORMATION FOR SEQ ID NO: 759 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus penneri 
20 (B) STRAIN: ATCC 33519 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 759 

AGCGTCCCTA AAGTATACGA CGCTCTTGAG GTTATGAATG GTAAAGAAAA 50 

25 ACTGGTGCTG GAAGTTCAGC AACAGTTAGG CGGTGGTATC GTTCGTTGTA 100 

TCGCAATGGG TACATCAGAC GGTTTAAGCC GTGGCTTAAA AGTTGAAGAT 150 

TTAGGCCACC CAATTGAAGT ACCAGTAGGT AAAGCAACAC TGGGACGTAT 2 00 

CATGAACGTT CTGGGTACAC CTATTGATAT GAAAGGTGAT ATTGCAACTG 2 50 

AAGAACGTTG GTCTATTCAC CGTGAAGCAC CAACCTACGA AGAGTTATCA 300 

3 0 AACTCACAAG AACTGCTTGA AACCGGTATC AAAGTAATGG ACTTAATCTG 3 50 

TCCGTTTGCT AAAGGTGGTA AAGTAGGTCT GTTCGGTGGT GCGGGTGTGG 4 00 

GTAAAACAGT TAACATGATG GAATTGATCC GTAATATCGC GATCGAGCAC 450 

TCAGGTTACT CTGTATTTGC TGGTGTTGGT GAGCGTACTC GTGAGGGTAA 500 

CGACTTCTAT CATGAAATGA CAGATTCTAA CGTTCTTGAC AAAGTATCGT 550 

3 5 TAGTTTATGG TCAGATGAAT GAGCCACCAG GAAACCGTCT GCGTGTAGCA 600 

CTGACTGGTC TGACTATGGC TGAAAAATTC CGTGATGAAG GCCGTGACGT 650 

ACTGTTATTC GTCGATAACA TTTATCGTTA CACCTTAGCC GGTACAGAAG 700 

TATCAGCACT GTTAGGTCGT ATGCCATCAG CGGTAGGTTA CCAGCCAACA 750 

TTGGCTGAAG AGATGGGTGT TCTGCAAGAA CGTATCACTT CAACCAAAAC 8 00 

4 0 AGGTTCAATC ACCTCTGTA 819 



45 



60 



2) INFORMATION FOR SEQ ID NO : 7 60 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 

serotype Enteritidis 
(B) STRAIN: ATCC 13076 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 760 

433 
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WO 01/23604 

GCCGTACCAC GCGTGTACGA 
GCTGGTGCTG GAAGTTCAGC 
TCGCGATGGG GTCTTCTGAC 
CTCGAACACC CGATCGAAGT 
5 CATGAACGTC CTGGGCGAAC 
AAGAGCGTTG GGCGATTCAC 
AACTCTCAGG AACTGCTGGA 
TCCGTTCGCG AAGGGCGGTA 
GTAAAACCGT AAACATGATG 

10 TCCGGTTACT CAGTGTTTGC 
CGACTTCTAC CACGAAATGA 
TGGTGTATGG CCAGATGAAC 
CTGACCGGCC TGACCATGGC 
ACTGCTGTTC GTCGATAACA 

15 TATCCGCACT GCTGGGCCGT 
CTGGCGGAAG AGATGGGCGT 
CGGTTCTATC ACCTCCGTA 



TGCCCTTGAG GTGCAGAATG 
AGCAGCTTGG CGGCGGTATC 
GGTCTGCGTC GCGGTCTGGA 
CCCGGTAGGT AAAGCCACGC 
CGGTCGACAT GAAAGGCGAG 
CGCGCAGCAC CTTCCTACGA 
AAC CGGTATC AAAGTTATCG 
AAGTCGGTCT GTTCGGTGGT 
GAGCTTATTC GTAACATCGC 
GGGCGTAGGG GAACGTACTC 
CCGACTCCAA CGTTATCGAT 
GAGCCGCCGG GAAACCGTCT 
GGAGAAATTC CGTGACGAAG 
TCTATCGTTA CACCCTGGCC 
ATGCCTTCCG CAGTAGGTTA 
TCTGCAGGAA CGTATCACCT 



PCT/CA00/01150 

gtaAtgagaa" -£0 

GTGCGTACCA 100 

TGTAAAAGAT 150 

TGGGTCGTAT 200 

ATCGGCGAAG 250 

AGAGTTGTCA 3 00 

ACCTGATGTG 350 

GCGGGTGTAG 400 

GATCGAGCAC 450 

GTGAGGGTAA 500 

AAAGTATCCC 550 

GCGCGTTGCA 600 

GTCGTGAGGT 650 

GGTACGGAAG 700 

CCAGCCGACT 750 

CCACCAAAAC 800 

819 



20 



30 



55 



2) INFORMATION FOR SEQ ID NO: 761 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 812 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia, pestis 

(B) STRAIN: KIM D2 7 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 761 

ACCAAAAGTG TACAACGCCC TTGAGGTAGA AGGTACAACT GAAAAGTTAG 50 

TGCTGGAAGT TCAGCAACAG TTGGGCGGTG GTGTTGTTCG TTGTATCGCA 10 0 

ATGGGCTCTT CCGATGGTCT GAGCCGTGGG TTGAAAGTAA CCAACCTAGA 150 

40 ACACCCGATC GAAGTACCGG TTGGTAAAGC GACCCTTGGC CGTATCATGA 2 00 

ACGTATTGGG TGAACCAATC GACATGAAAG GTCCTATCGG TGAAGAAGAG 250 

CGTTGGGCAA TCCATCGCGA AGCGCCTTCT TATGAAGAGC TTGCTAGCTC 300 

ACAAGATCTG TTAGAAACCG GTATCAAGGT TATGGACCTG ATTTGTCCGT 3 50 

TTGCTAAGGG CGGTAAAGTC GGTCTGTTCG GTGGTGCGGG TGTAGGTAAA 400 

45 ACAGTAAACA TGATGGAGCT GATCCGTAAC ATCGCGATCG AGCACTCTGG 450 

GTATTCTGTA TTTGCCGGTG TAGGTGAGCG TACCCGTGAG GGTAATGACT 500 

TCTACCATGA AATGACTGAC TCCAACGTTT TGGACAAAGT ATCCTTGGTT 550 

TACGGCCAGA TGAATGAGCC ACCAGGTAAC CGTCTACGCG TTGCACTGAC 600 

CGGCCTGACC ATGGCGGAGA AATTCCGTGA TGAAGGTCGT GACGTACTGC 650 

50 TGTTTATCGA TAATATCTAT CGTTATACCC TAGCTGGTAC GGAAGTATCC 700 

GCATTGCTGG GTCGTATGCC ATCAGCGGTA GGTTATCAGC CAACACTGGC 750 

TGAAGAGATG GGTGTGTTGC AGGAACGTAT TACTTCCACT AAGACGGGTT 800 

CAATCACCTC TG 812 



2) INFORMATION FOR SEQ ID NO: 762 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 4 08 bases 

434 



SUBSTITUTE SHEET (RULE 26) 



10 



WO 01/23604 PCT/CA00/01 150 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Burkholderia mallei 

(B) STRAIN: GB8 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 762 



CGACGTGATC GAGCCCTTCT TCGTCGACGT GATGCGCTCT TGCAGCTTGC 50 

CCATTTCTTC AGCCAGCGTC GGCTGATAGC CCACTGCCGA CGGCATACGG 100 

15 CCGAGCAGCG CCGACACTTC GGTACCGGCC AGCGTGAAAC GGTAGATGTT 150 

GTCGACGAAG AACAGCACGT CGAGGCCTTC GTCACGGAAG TGCTCGGCCA 2 00 

TCGTGAGGCC CGTCAGCGCC ACGCGCAGAC GGTTGCCCGG CGGCTCGTTC 2 50 

ATCTGGCCGT ACACCAGCGC GACCTTGTCG AGAACGTTCG AGTCCTTCAT 3 00 

TTCGTGGTAG AAGTCGTTCC CTTCACGGGT ACGCTCGCCC ACGCCCGCGA 3 50 

20 ACACGGAGTA ACCGCCGTGC TCCTTCGCGA TGTTGTTGAT GAGCTCCATC 4 00 

ATGTTGAC 4 08 



25 2) INFORMATION FOR SEQ ID NO: 763 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium sordellii 

(B) STRAIN: ATCC 9714 



40 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 763 



GAACTTATAA ACAACATAGC TACTCAACAT GGTGGTATAT CAGTATTCGC 50 

AGGTGTTGGA GAGAGAACAA GAGAAGGTAA CGACCTTTAC GGAGAAATGA 100 

GTGAGTCTGG AGTTATAAAC AAGACAGCTC TAGTATTCGG ACAAATGAAT 150 

GAGCCACCTG GAGCAAGAAT GAGAGTTGCT TTAACTGGTC TTACAATGGC 2 00 

45 TGAATATTTC AGAGATCAAG AAGGACAAGA CGTTTTATTA TTCGTAGATA 2 50 

ATATATTCCG TTTCACTCAA GCAGGATCTG AGGTTTCTGC ACTTCTTGGA 3 00 

CGTACTCCAT CAGCAGTTGG ATACCAACCA ACATTAGCTA CAGAGATGGG 3 50 

TAGATTACAA GAGAGAATAA CATCTACAAA TAAAGGGTCT ATAACATCAG 4 00 



2) INFORMATION FOR SEQ ID NO: 764 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 4 05 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: Genomic DNA 

435 
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PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridia 

(B) STRAIN: ATCC 194 02 

(xi) SEQUENCE DESCRIPTION: SEQ 

TTAATTCAAG AATTAATCAA CAATATAGCG 
TGTATTTACA GGAGTTGGAG AAAGAACAAG 
ATGAAATGAA AGAATCTGGT GTTATAAATA 
CAAATGAATG AGCCACCTGG AGCAAGAATG 
TACTATGGCA GAATATTTCA GGGATCAAGG 
TAGACAACAT ATTTAGATTC ACTCAAGCAG 
CTTGGAAGAA TACCTAGTGC CGTTGGATAC 
AATGGGTGCG CTTCAAGAAA GAATAACATC 
CATCA 



novyi 



ID NO: 764 

AAGGAACACG GTGGATTATC 50 

AGAAGGTAAT GACCTTTACT 100 

AAACAGCACT AGTATTTGGT 150 

AGAGTTGCTC TTACAGGACT 2 00 

ACAAAACGTA CTTTTATTCA 250 

GTTCAGAGGT GTCAGCTTTA 3 00 

CAACCAACAC TAGCAACAGA 350 

TACAAAGCAT GGTTCTATAA 4 00 

405 



2) INFORMATION FOR SEQ ID NO: 765 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium botulinum 

(B) STRAIN: 20:3.1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 765 

AATTAATAAA TAACATAGCT AAAGAACATG GTGGATTATC AGTGTTTACT 50 

GGAGTTGGAG AAAGATCTAG AGAAGGTAAT GATCTTTATC ATGAAATGAG 100 

AGAATCAGGC GTTATAGATA AAACCGCATT AGTATTTGGT CAAATGAATG 150 

AGCCACCTGG TGCTAGAATG AGAGTTGCAT TAACAGGGTT AACTATGGCT 200 

GAATATTTTA GAGATAAAGG TCAAGATGTA TTACTATTTA TAGATAATAT 2 50 

ATTTAGATTT ACTCAAGCTG GATCAGAAGT TTCAGCATTA CTTGGAAGAA 300 

TACCATCAGC AGTTGGTTAC CAACCAACTT TAGCAACTGA AATGGGTGCA 3 50 

CTTCAAGAGA GAATTACATC AACTAAGAAT GGTTCTATAA CAT 393 



2) INFORMATION FOR SEQ ID NO: 766 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 99 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium histolyticum 

(B) STRAIN: ATCC 19401 

436 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 766 

AGGAACTTAT AAATAATATT GCAAAACAAT ATGGAGGTCT ATCTGTATTT 50 

ACAGGTGTTG GTGAAAGAAC AAGAGAAGGT AATGACCTAT ATAATGAAAT 100 

5 GAAAGATTCT GGGGTTATAG AGAAAACTGC ACTAGTATTT GGACAGATGA 150 

ATGAACCACC AGGAGCGAGA ATGAGAGTAG CATTGACAGG ACTTACTATG 2 00 

GCAGAATATT TTAGAGATCA AGGGCAAGAT GTACTTTTAT TTATAGATAA 250 

TATATTTAGA TTTACGCAGG CTGGTTCTGA AGTTTCTGCA TTGTTAGGAA 300 

GAATTCCAAG TGCAGTTGGA TATCAACCAA CCCTTGCAAC TGAAATGGGT 3 50 

10 GCATTACAAG AAAGAATAAC ATCCACAAAA AATGGATCAA TTACTTCAG 3 99 



2) INFORMATION FOR SEQ ID NO: 767 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 90 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

2 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Peptostreptococcus prevotii 

(B) STRAIN: ATCC 9321 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 767 

3 0 CATGATGGAA CTGATCCGTA ACATCGCCAT CGAGCACAGC GGTTATTCCG 50 

TGTTCGCCGG TGTGGGTGAG CGTACTCGTG AGGGTAACGA CTTCTACCAC 100 

GAGATGAAGG ACTCCAACGT TCTGGACAAA GTGGCACTGG TCTACGGTCA 150 

GATGAACGAG CCGCCGGGTA ACCGTCTGCG CGTAGCACTG ACTGGCCTGA 2 00 

CCATGGCCGA GAAGTTCCGT GACGAAGGTA ACGACGTTCT GTTGTTCGTC 2 50 

35 GACAACATCT ACCGTTACAC CCTGGCCGGT ACTGAAGTAT CCGCACTGCT 3 00 

GGGCCGTATG CCTTCGGCAG TAGGTTACCA GCCGACCCTG GCTGAAGAGA 3 50 

TGGGCGTTCT GCAAGAACGT ATCACTTCGA CCAAGGAAGG 3 90 



40 



50 



2) INFORMATION FOR SEQ ID NO: 768 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 953 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Absidia corymbifera 

(B) STRAIN: ATCC 46775 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 768 

AGGTCTTGTT CGTGGCAAAA GGTCATTGAC ACTGGTGCTC CTATCACCAT 50 

TCCTGTTGGT AACGAAGTCC TTGGTCGTAT CATTAACGTC ATTGGTGAGC 100 

CCATTGATGA GCGTGGTCCY ATCAAGTCCA AGGCCACTCG TGCTATCCAC 150 

60 GCTGATGCTC CCGAGTTCGT TGATCAATCC CCCACTCCCG AGATTCTCGA 2 00 

437 
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GACTGGTATC AAGGTTGTCG ATTTGCTTGC 
AGATTGGTCT TTTCGGTGGT GCCGGTGTCG 
GAATTGATCA ACAACATTGC CAAGGCTCAC 
TGGTGTCGGT GAACGTACTC GTGAAGGCAA 
5 TTTCCACTGG TGTCATCAAG CTTGAAGGTG 
TTTGGTCAAA TGAACGAACC CCCCGGAGCT 
TGGTTTGACC ATTGCCGAAT ACTTCCGTGA 
TGCTCTTCAT TGACAACATT TTCCGTTTCA 
TCCGCTTTGC TTGGTCGTAT TCCCTCTGCT 

10 YTCCACTGAT ATGGGTGGTA TGCAAGAGCG 
GTTCCATTAC CTCCGTGCAA GCTGTCTACG 
GATCCTGCTC CTGCTACTAC TTTTGCTCAC 
GTCTCGTTCC ATTGCTGAGT TGGGTATCTA 
ACTCCAAGTC CCGTATCTTG GATCCCCGTA 

15 TCTGTTGCCA CTGGTGTCCA ACAAATCCTT 
AGA 



PCT/CA00/01150 

TCCCTATGCT CGTGGtGGTA 25/5 

GCAAGACTGT CTTGATTCAA 3 00 

GGTGGTTACT CTATCTTCTG 350 

CGATTTGTAC CACGAAATGA 400 

AATCCAAGTG TGCTCTTGTC 45 0 

CGTGCCCGTG TTGCCTTGAC 500 

TGAGGAAGGT CAAGATGTGT 550 

CTCAAGCCGG TTCTGAAGTG 60 0 

GTCGGTTACC AACCCACTCT 650 

TATTACTACC ACCAAGAACG 70 0 

TCCCTGCTGA CGATTTGACT 750 

TTGGACGCCA CCACTGTGTT 800 

CCCTGCTGTC GATCCCCTTG 850 

TCGTTGGTGA TGAGCACTAC 900 

CAAAACTACA AGTCGCTTCA 95 0 

953 



20 2) INFORMATION FOR SEQ ID NO: 769 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1343 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alternaria alternata 

(B) STRAIN: ATCC 62099 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 769 



CGCGGTTCCA AGGCCACCGA CACCGGTGCC CCCATCAAGA TTC CCGTTGG 50 

TCACGGTACC CTTGGTCGTA TCATGAACGT CACTGGTGAC CCCATTGACG 10 0 

AGCGTGGTCC CATCAAGGCC ACCAAGTACG CTCCCATCCA CGCCGACCCC 150 

CCGGAGTTCA CCGAGCAATC CACCTCCGCT GAGGTCCTCG TTACCGGTAT 2 00 

40 CAAGGTTGTC GACCTGTTGG CTCCTTACGC TCGTGGTGGA AAGATTGGTC 250 

TCTTCGGAGG TGCTGGTGTC GGAAAGACTG TCTTCATTCA GGAGCTGATT 300 

GTAAGGAGAC ACACTGTCTA CTGGCTGAGC ATTAGCTAAC GGCAGGCAGA 3 50 

ACAACATCGC CAAGGCCCAC GGTGGTTTCT CTGTCTTCAC TGGTGTCGGT 400 

GAGCGTACCC GTGAGGGTAA CGATCTGTAC CACGAGATGC AGGAGACTTC 450 

45 CGTCATTCAG CTTGACGGTG ACTCCAAGGT CGCCCTCGTC TTCGGTCAGA 50 0 

TGAACGAGCC CCCGGGTGCC CGTGCCCGTG TCGCTCTTAC TGGTCTTACT 550 

GTTGCTGAGT AAGTCTTGAA TTCACTGTGT TGACAACGTC GTGGCTAATG 600 

GGAAAAAAGA TACTTCCGTG ACGAAGAGGG ACAGGATGTG CTTCTCTTCA 650 

TCGACAGTAA GTGCTTGTAC GAACTGCCTG TGAGACATAC ACTGACTTCG 70 0 

50 GCAATAGACA TTTTCCGTTT CACCCAGGCC GGTTCCGAGG TGTCCGCTCT 750 

TCTTGGTCGT ATCCCCTCTG CCGTCGGTTA CCAGCCCACC CTCGCCATTG 800 

ACATGGGTGT CATGCAGGAG CGTATTACCA CCACCACCAA GGGTTCCATC 850 

ACCTCCGTCC AGGCCGTCTA CGTGCCCGCT GACGATTTGA CTGACCCTGC 900 

CCCCGCCACC ACCTTCGCCC ATTTGGACGC CACCACTGTC TTGTCCCGTG 950 

55 GTATCTCCGA GTTGGGTATC TACCCCGCCG TCGACCCTCT TGACTCCAAG 1000 

TCCCGTATGT TGGACCCCCG TGTCATTGGT CAGGACCACT ACGACACCGC 1050 

CACCCGCGTT CAGCAGATTC TCCAGGAGTA CAAGTCGCTC CAGGATATCA 110 0 

TTGCCATTCT CGGTATGGAC GAGTTGTCGG AAGCTGACAA GCTTACCGTC 1150 

GAGCGTGCCC GTAAGATCCA GCGTTTCTTG AGCCAGCCTT TCGCTGTCGC 1200 

60 CCAGGTCTTC ACTGGTATTG AGGGCAAGCT TGTCGACCTC AAGGACACCA 1250 
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TCCGATCATT CAAGGCTATC TTGACTGGTG AGGGTGACGA CCTTCCCGAG 
GGTGAGTCTC GACTATCTCC GCATTCATAG CGTATAACTG ACA 



2) INFORMATION FOR SEQ ID NO: 770 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus 

(B) STRAIN: ATCC 2 6947 

(xi) SEQUENCE DESCRIPTION: SEQ 

TTCAGGAGTT GATTGTATGT TCACCTGCAA 
ACTCTTTTCT AACTCTTCAC AGAACAACAT 
ACTCTGTCTT CACTGGTGTC GGTGAGCGTA 
TACCACGAAA TGCAGGAGAC TGGTGTCATT 
GGTCGCCCTT GTCTTCGGTC AGATGAACGA 
GTGTCGCCCT TACCGGTCTG ACCATCGCCG 
GGTCAGGATG TGCTGCTCTT CATTGACAAC 
CGGTTCTGAG GTGTCTGCCC TTCTTGGTCG 
ACCAGCCCAC TCTGGCCGTC GACATGGGTG 
ACCACCACCA AGGGTTCCAT TACCTCCGTC 



flavus 



ID NO: 770 

CATAAGACTT CCCATTCTCC 50 

TGCCAAGGCT CACGGTGGTT 100 

CCCGTGAGGG TAACGATCTG 150 

CAGCTCGAGG GTGAATCTAA 2 00 

GCCCCCAGGT GCCCGTGCCC 2 50 

AGTACTTCCG TGACGAGGAA 3 00 

ATTTTCCGTT TCACCCAGGC 3 50 

TATCCCCTCC GCTGTCGGTT 4 00 

GTATGCAGGA GCGTATTACC 450 

480 



2) INFORMATION FOR SEQ ID NO: 771 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1174 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mucor circinelloides 

(B) STRAIN: ATCC 38592 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 771 

CTCGAACAAG AYAACTTGCC YGCCATTTTG AACGCCCTTG AAGTCAAGGA 50 

YCACTCTGGT GGACGTCTCG TYCTCGAAGT CTCTCAACAT TTGGGTGAGA 100 

ACACTGTCCG TACTATTGCT ATGGATGGTA CTGAAGGTAA GTTATGTYCA 150 

TCCCANNGGA TACAGTCARA CAGMAATGTC TAGTGGTTAT AGCAGYAGCA 2 00 

GATGATTGAC CAATATGTTA GGTCTTGTCC GGTGGTCAAA AGGTTGTTGA 250 

CACTGGTGCT CCCATTACCA TCCCCGTTGG TAAGGAAGTC CTTGGTCGTA 3 00 

TCATCAACGT TATTGGTGAA CCCATTGATG AACGTGGTCC CATTGACGCC 350 

AAGACTCACC GTCCTATTCA CGCTGAAGCT CCCGAATTCG TTGATCAATC 4 00 

CCCCACTCCC GAAATCCTCG AGACTGGTAT CAAGGTYGTC GATTTGTTGG 450 

CTCCTTATGC TCGTGGTGGT AAGATTGGTC TCTTCGGTGG TGCTGGTGTC 500 

439 
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GGTAAGACTG TCTTGATTCA 
CGGTGGTTAC TCTATCTTCT 
ACGATTTGTA CCATGAAATG 
GAGTCCAAGT GTGCTCTCGT 
5 TCGTGCCCGT GTCGCTTTGA 
ATGATGAGGG TCAAGATGTC 
ACTCAAGCTG GTTCTGAGGT 
TGTCGGTTAC CAACCCACTC 
GTATTACTAC CACCAAGAAC 
10 GTCCCTGCTG ATGATTTGAC 
CTTGGATGCC ACCACTGTCT 
AYCCCGCTGT CGATCCTCTT 
ATTGTCGGTG ATGAGCACTA 
CCAAAACTAC AAGTCTCTCC 

15 



AGAATTGATT AACAACATYG 
GTGGTGTCGG TGAACGTACT 
ATTGAAACCG GTGTCATTCA 
CTTTGGTCAA ATGAACGAAC 
CTGGTTTGAC TATTGCTGAA 
TTGCTTTTCA TTGATAACAT 
ATCTGCCCTT TTGGGTCGTA 
TTTCCACYGA TATGGGTGGT 
GGTTCCATTA CCTCCGTCCA 
CGATCCTGCT CCTGCCACCA 
TGTCTCGTTC CATCGCTGAA 
GATTCCAAGT CTCGTATCCT 
CAAGGTTGCC ACTGAAGTTC 
AAGA 



PCT/CA00/01150 

CCAAGGCTCA 550 

CGTGAGGGTA 600 

ATTGGAAGGC 650 

CCCCAGGTGC 700 

TACTTCCGTG 750 

TTTCAGATTC 80 0 

TTCCTTCCGC 850 

ATGCAAGAGC 90 0 

AGCTGTCTAC 950 

CTTTTGCTCA 1000 

TTGGGTATCT 1050 

CGATCCCCGT 1100 

AACAAATCCT 1150 
1174 



2 ) INFORMATION FOR SEQ ID NO: 772 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 67 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Piedraia hortai 

3 0 (B) STRAIN: ATCC 242 92 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 772 

AGGAGCTTAT CAACAACATC GCCAAGGCTC ACGGTGGTTA CTCCGTCTTC 50 

35 ACTGGTGTCG GTGAGCGTAC TCGTGAGGGT AACGATTTGT ACCACGAGAT 100 

GCAAGAGACT TCCGTCATTC AGCTTGACSG CGAGTCCAAG GTCGCTCTCG 150 

TGTTCGGTCA GATGAACGAG CCCCCGGGTG CCCGTGCCCG TGTTGCCCTG 2 00 

ACTGGTCTTA CCATCGCTGA GTACTTCCGT GATGCCGAGG GTCAGGATGG 250 

TAAGTTCTAT AACTCTTGTC GCAAAGGTTT CATTCTGGTC GCTAACTTGC 3 00 

4 0 TCAGTGCTCC TGTTCATCGA CAACATTTTC CGTTTCACCC AGGCTGGTAT 3 50 

GGAGGTGTCT GCCCTCCTCG GTCGTATTCC TTCTGCCGTC GGTTACCAAC 400 

CTACTCTCGC CGTCGACATG GGTGGTATGC AAGAGCGTAT TACCACTACC 450 

AAGAAGGGAT CCATTAC 467 



2) INFORMATION FOR SEQ ID NO: 773 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 578 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudallescheria boydii 

(B) STRAIN: ATCC 44331 

60 

440 
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(xi) SEQUENCE DESCRIPTION: SEQ 

TTGCCAAGGC CCACGGTGGT TACTCTGTCT 

ACCCGTGAGG GTAACGATCT GTACCACGAA 

5 TCAGCTTGAT GGCGAGTCCA AGGTCGCGCT 

AGCCCCCTGG TGCCCGTGCT CGTGTCGCTC 

GAGTACTTCC GTGATGAGGA GGGTCAGGAT 

TTATCTTCTT TGCCACCACC CCTCTACGAA 

GGCATCGTTT GTAGGGCGGG TCGGAGTTTG 

10 TTGAAGCCGC GGATGCCCGA TGTTTGACGC 

GACAACAGTG CTTCTCTTCA TTGACAACAT 

GTTCCGAGGT GTCTGCCCTT CTCGGTCGTA 

CAGCCCACTC TTGCCGTAGA CATGGGTGCC 

CACCAAGAAG GGTTCGATTA CCTCCGTC 

15 



PCT/CA00/01150 

ID NO: 773 

TCACTGGTGT CGGTGAGCGT 5 0 

ATGCAGGAAA CCTCCGTCAT 10 0 

TGTCTTCGGT CAGATGAACG 150 

TTACTGGTCT TACCGTTGCC 2 00 

GGTAAGTTAT ATCGTTTTTA 250 

TCCATGCCTC CGTTGGTGAA 3 00 

CGGCAATTTC TGCCGTCGGC 350 

GTATCGATGC TAACAACAAT 4 00 

TTTCCGATTC ACCCAGGCCG 450 

TTCCCTCTGC CGTCGGTTAC 50 0 

ATGCAGGAAC GTATTACCAC 550 

578 



2) INFORMATION FOR SEQ ID NO: 774 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1123 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rhizopus oryzae 
30 (B) STRAIN: ATCC 56015 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 774 

AACTTACCYG CTATCTTGAA CGCTCTCGAA GTCCAAGATC ACTCTGGTGG 50 

3 5 ACGTCTTGTC CTTGAAGTTC GCTCAACACT TGGGTGAAAA TACTGTCCGT 100 

ACTATTGCTA TGGATGGTAC TGAAGGTAAG CTATACTATA ACCGTKTTAT 150 

CCGAGTATGA TATTAACTTG AAAAAAGGTC TCGTYCGTGG TCAAAAGGTT 2 00 

ATTGACACTG GTGCTCCCAT TACCATTCCT GYTGGTAAGG AAGTTCTCGG 2 50 

TCGTATCATT AACGTCATTG GTGAACCCAT CGATGAACGT GGTCCTATCA 3 00 

4 0 ACGCCAAGAG CCAACGTCCC ATTCACGCCG AAGCTCCCGA ATTCGTTGAC 350 

CAATCTCCTA CTCCCGAAAT TCTTGAAACT GGTATCAAGG TTGTCGACTT 400 

GTTGGCTCCT TATGCTCGTG GTGGTAAGAT TGGTCTTTTC GGTGGTGCTG 450 

GTGTCGGTAA GACTGTGTTG ATTCAAGAAT TGATTAACAA CATCGCCAAG 500 

GCTCACGGTG GTTACTCTAT TTTCTGTGGT GTCGGTGAAC GTACTCGTGA 550 

4 5 AGGTAACGAT CTTTACCACG AAATGATTGA AACTGGTGTC ATCAAGCTCG 600 

ATGGTGACTC CAAGTGTGCT CTTGTCTTTG GTCAAATGAA CGAACCCCCA 650 

GGAGCTCGTG CCCGTGTTGC CTTGACTGGT TTGACCATTG CTGAATACTT 700 

CCGTGATGAT GAAGGTCAAG ATGTGTTGCT TTTCATTGAT AACATTTTCC 750 

GTTTCACCCA AGCTGGTTCW GAAGTATCTG CCCTTTTGGG TCGTATTCCC 800 

50 TCCGCTGTCG GTTACCAACC CACTCTTTCT ACTGATATGG GTGGTATGCA 850 

AGAACGTATT ACAACCACCA AGAACGGTTC CATTACATCC GTCCAAGCTG 900 

TCTACGTCCC TGCTGATGAT TTGACCGATC CTGCTCCCGC CACCACTTTT 950 

GCTCACTTGG ATGCCACCAC TGTCTTGTCT CGTTCCATTG CCGAATTGGG 1000 

TATTTACCCT GCCGTCGATC CTCTTGAYTC CAAGTCTCGT ATCTTGGATC 1050 

55 CTCGTATCGT TGGTGACGAA CACTACAAGG TCGCTACCGA AGTTCAACAA 1100 

ATCCTTCAAA ACTACAAGTC TCT 1123 



60 2) INFORMATION FOR SEQ ID NO: 775 

441 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Scopulariopsis koningii 

(B) STRAIN: ATCC 38745 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 775 



ATTCAGGAGC TCATCAACAA CATTGCCAAG GCTCACGGTG GTTACTCTGT 50 

GTTCACTGGT GTCGGTGAGC GTACCCGTGA GGGTAACGAT CTGTACCACG 100 

AAATGCAGGA GACTTCGGTC ATTCAGCTCG AGGGCGAGTC CAAGGTCGCG 150 

CTTGTGTTCG GTCAGATGAA CGAGCCCCCC GGTGCCCGTG CCCGTGTCGC 2 00 

2 0 CCTTACCGGT CTGACCGTTG CCGAGTACTT CCGTGACGAG GAGGGCCAGG 2 50 

ATGGTGAGTA ACCGACGAAG TCTGAGATCT TGTCGGGCAT TATTCTAACG 3 00 

ACAACTAGTG CTTCTCTTCA TCGACAACAT TTTCCGCTTC ACCCAGGCCG 3 50 

GTTCCGAGGT GTCCGCGCTT CTCGGCCGTA TCCCCTCTGC CGTCGGTTAC 4 00 

CAGCCCACCC TGGCCGTCGA CATGGGAGGT ATGCAGGAGC GTATTACCAC 4 50 

25 GACTCAGAAG GGCTCGATTA CCTCGGT 477 



30 



2) INFORMATION FOR SEQ ID NO: 776 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 8125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 776 

45 TCCGGAGTTG ATTGTAAGTC ATTTGAAACC CAGCCCCAAG AAACAGAAGC 50 

TAGGCGAAAA TTGGACAATT GAGCAATTTA GCCATTGGAG AAAAGAAATT 100 

TCGAGTATTA ATTGTTTTTA TAGAACAACA TTGCCAAGGC TCACGGTGGT 150 

TACTCTGTCT TCACTGGTGT CGGAGAGCGT ACCCGTGAAG GAAACGATCT 2 00 

CTACCATGAA ATGCAGGAGA CCCGTGTCAT TCAGCTTGAT GGCGAGTCCA 2 50 

50 AGGTCGCCCT GGTCTTCGGC CAGATGAACG AGCCCCCAGG TGCCCGTGCC 3 00 

CGTGTTGCTC TTACTGGTTT GACCATTGCT GAGTACTTCC GTGATGAGGA 3 50 

AGGTCAAGAC GGTGAGTTTY TTATGGATAA AARATTTTTT TTTTTTTTTT 4 00 

TTTTTTMAAR AAATTCATGT TCTAACAAAG TATATCCTAG TGCTTCTCTT 4 50 

CATCGACAAC ATTTTCCGTT TCACTCAGGC TGGTTCCGAA GTGTCTGCCC 500 

55 TGCTTGGTCG TATCCCATCT GCCGTCGGTT ACCAACCCAC TCTTGCCGTC 550 

GACATGGGTG GTATGCAGGA ACGTATTACC ACCACCAAGA AGGGATCCAT 600 

TACCTCCGTC 610 



60 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton tonsurans 

(B) STRAIN: ATCC 56185 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 777 

GGAGTTGATT GTAAGTCATT TGAAACCCAG CCCCAAGAAA CAGAAGCTAG 50 

GTGAAAATTG GACAATTGAA CAATTTAGCC CTTGGAGAAA AGAAATTTCG 100 

AGTATTAATT ATTTTTATAG AACAACATTG CCAAGGCTCA CGGTGGTTAC 150 

20 TCTGTCTTCA CTGGTGTCGG AGAGCGTACC CGTGAAGGAA ACGATCTCTA 2 00 

CCATGAAATG CAGGAGACCC GTGTCATTCA GCTTGATGGC GAGTCCAAGG 2 50 

TCGCCCTGGT CTTCGGCCAG ATGAACGAGC CCCCAGGTGC CCGTGCCCGT 3 00 

GTTGCTCTTA CTGGTTTGAC CATTGCTGAG TACTTCCGTG ATGAGGAAGG 3 50 

TCAAGACGGT GAGTTTCTTA TGGATGAAAG ATTTTTTTTT TTCAAGAAAT 4 00 

25 TCATGTTCTA ACAAAGTGTA TTCTAGTGCT TCTCTTCATC GACAACATTT 450 

TCCGTTTCAC TCAGGCTGGT TCCGAAGTGT CTGCCCTGCT TGGTCGTATC 500 

CCATCTGCCG TCGGTTACCA ACCCACTCTT GCCGTCGACA TGGGTGGTAT 550 

GCAGGAACGT ATTACCACCA CCAAGAAGGG ATCCATTACC TCC 593 



30 



45 



2) INFORMATION FOR SEQ ID NO: 778 



(i) SEQUENCE CHARACTERISTICS: 
3 5 (A) LENGTH: 1141 bases 

(B) TYPE: Nucleic acid 
•(C) STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichosporon cutaneum 

(B) STRAIN: ATCC 62965 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 778 



CCGTGGTCAA GAAGTTATTG ACACTGGTGC CCCAATTACC ATTCCTGTTG 50 

GTCGTGGTAC TCTTGGTAGA ATTATCAACG TCATTGGTGA ACCAATTGAC 100 

50 GAACGTGGCC CTATCAAGGC TTCTAAGTAT GCTCCTATCC ATACTGAACC 150 

ACCAACCTTT GCTGAACAAT CTACTTCTGC TGAAGTTCTT GAAACCGGTA 2 00 

TCAAGGTTGT CGATCTTCTT GCTCCTTACG CCCGTGGTGG TAAGATTGGT 250 

CTTTTCGGTG GTGCTGGTGT CGGTAAGACT GTCTTCATTC AAGAACTTAT 3 00 

TAACAACATT GCCAAGGCTC ACGGTGGTTT CTCTGTCTTC ACTGGTGTCG 350 

55 GTGAAAGAAC CCGTGAAGGT AACGATCTTT ACCGTGAAAT GAAGGAAACT 4 00 

GGTGTCATCA ACCTCGAAGG TGACTCCAAG GTCGCTCTCG TTTTCGGTCA 450 

AATGAACGAA CCTCCAGGTG CCCGTGCCCG TGTCGCTTTG ACTGGTCTTA 500 

CCATTGCCGA ATACTTCCGT GATGAAGAAG GACAAGATGT CTTGCTTTTC 550 

GTTGACAACA TTTTCAGATT CACCCAAGCC GGTTCTGAAG TCTCTGCTCT 600 

60 TTTGGGTCGT ATTCCATCTG CCGTCGGTTA CCAACCTACC CTTGCTACCG 650 

443 
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ATATGGGTGC CCTCCAAGAA 

ACATCTGTCC AAGCCGTCTA 

CCCAGCCACC ACTTTCGCTC 

CCATTTCCGA ATTGGGTATC 

5 TCTCGTCTTT TGGATCCTGA 

CACTCAAGTT CAACAAACCC 

TTGCCATTTT GGGTATGGAT 

GAACGTGCCC GTAAGATCCA 

CGAAGTTTTC ACTGGTATCG 

10 TCAGATCTTT CAAGGAAATC 



CGTATTACCA CCACCCAAAA 
TGTCCCAGCA GACGATTTGA 
ACTTGGACGC CACCACTGTC 
TACCCAGCTG TCGATCCTCT 
AGTTATTGGA CACGAACACT 
TCCAAGCTTA CAAGTCTCTC 
GAATTGTCTG AAGCTGATAA 
AAGATTCCTT TCCCAACCAT 
AAGGTCGTCT CGTTCCATTG 
CTTGAAGGTA AGTACGATCA 



PCT/CA00/01150 



gggttCcgtc ' 


70^ 


CTGATCCTGC 


750 


TTGTCTCGTT 


800 


CGATTCCAAG 


850 


ACGAAGTTGC 


900 


CAAGATATCA 


950 


GCTTACTGTC 


1000 


TCGCTGTTGC 


1050 


AAGGAAACCG 


1100 


C 


1141 



2) INFORMATION FOR SEQ ID NO: 779 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Cladophialophora carrionii 

(B) STRAIN: ATCC 22864 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 779 

3 0 GCTGAAGGCC GAACGTGAGC GTGGTATCAC CATCGATATC GCGCTCTGGA 50 

AGTTCGAGAC TCCCAAGTAC TTCGTCACCG TCATCGATGC CCCTGGTCAT 100 

CGTGACTTCA TCAAGAACAT GATCACTGGT ACCTCCCAGG CTGATTGTGC 150 

TATTCTCATC ATTGCCGCTG GTACTGGTGA GTTCGAGGCC GGTATCTCCA 200 

AGGATGGCCA GACCCGTGAG CATGCTCTGC TCGCCTACAC CCTGGGCGTG 250 

3 5 AAGCAGCTTA TCGTCGCCAT CAACAAGATG GACACCACCA AATGGTCTGA 3 00 

GGATCGTTTC AACGAAATCA TCAAGGAGAC TTCCAACTTC ATCAAGAAGG 350 

TCGGATACAA CCCCAAGTCC GTTCCATTCG TGCCCATCTC CGGCTTCAAC 400 

GGTGACAACA TGATCGACGT CTCCACCAAT GCCCCCTGGT ACAAGGGCTG 450 

GGAAAAGGAG TCCAAGGCTG GCAAGGCCAC CGGCAAGACC CTCCTTGAGG 500 

4 0 CTATCGACTC CATCGACCCT CCTGCTCGTC CCACCGACAA GCCTCTCCGT 550 

CTCCCACTCC AGGATGTCTA CAAGATTTCT GGTATCGGCA CGGTGCCCGT 6 00 

CGGTCGTGTT GAGACTGGTA C CAT C AAGG C CGGTATGGTC GTCACCTTTG 650 

CCCCCGCCAA CGTCACCACT GAAGTCAAGT CCGTCGAAAT GCACCACGAA 700 

CAGCTCGCCG AGGGCGTTCC GGGTGACAAC GTCGGCTTCA ACGTCAAGAA 7 50 

45 CGTCTCCGTG AAGGAGGTTC GTCGTGGAAA CGTTGCTGGT GACTCCAAGA 800 

ACGACCCCCC CAAGGGTGCC GACTCCTTCA ACGCCCAGGT CATCGTCCTC 850 

AACCACCCTG GTCAGGTCGG TGCTGGCTAC GCCCCGGTCT TGGATTGCCA 900 

CACTGCCCAC ATTGCCTGCA AGTTCTCTGA GCTCCTCGAG AAGATCGATC 950 

GTCGTACCGG CAAGTCCATG GAAAACAACC CCAAGTTCAT CAAGTCTGGT 10 00 

50 GATGCTGCCA TCGTGAAGAT GGTTCCCAGC AAGCCTATGT GCGTTGAGGC 1050 

TTTCACCGAC TACCCTCCTC TTGGTCGTTT CGCCGTCCGT GAC 1093 



55 2) INFORMATION FOR SEQ ID NO: 780 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Double 

444 
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(D) TOPOLOGY: Linear 
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10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cunninghamella bertholletiae 

(B) STRAIN: ATCC 42115 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 780 



TACTTGTAAA TGGTCTCAAG ATCGTTACAA CGAAATTGTT AAGGAAGTTT 50 

CTTCCTTCAT CAAGAAGATT GGTTACAATC CTAAATCCGT TCCTTTCGTY 100 

CCTATCTCTG GTTGGCACGG TGATAACATG TTGGAAGCTT CTACCAACAT 150 

GCCTTGGTAC AAGGGATGGA CCAAGGAAAC TAAAGCTGGT TCTTCCACTG 200 

15 GTAAGACTCT CTTAGAAGCC ATTGACAGCA TTGAACCTCC TACCCGTCCT 250 

TCTGACAAGC CTTTACGTCT TCCTTTACAA GATGTTTACA AGATTGGTGG 3 00 

TATTGGTACT GTCCCTGTTG GTCGTGTTGA AACTGGTGTC ATCAAGGCTG 350 

GTATGGTTGT TACYTTCGCT CCCGCTAACG TCACCACTGA AGTTAAGTCC 4 00 

GTTGAAATGC ATCACGAACA ATTAGAACAA GGTGTTCCTG GTGACAACGT 4 50 

2 0 TGGTTTCAAC GTCAAGAACG TTTCCGTTAA GGATATCCGT CGTGGTAACG 50 0 

TCTGTTCCGA CTCCAAGAAC GACCCCGCTA AGGAATCTGC TTCCTTCAAC 550 

GCTCAAGTTA TCGTCTTGAA CCACCCTGGT CAAATTGGTG CTGGTTATGC 60 0 

CCCAGTTCTT GACTGTCACA CTGCTCACAT TGCTTGTAAG TTCGCTGAAT 65 0 

TATTAGAAAA GATCGATCGT CGTTCCGGTA AGAAACTCGA AGATGCTCCT 700 

25 AAATTCGTTA AATCTGGTGA CTCTGCTATC GTTAAGATGG TTCCTTCCAA 750 

GC 752 



3 0 2) INFORMATION FOR SEQ ID NO : 781 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 8 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Curvularia lunata 

(B) STRAIN: ATCC 26425 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 781 



CAAGTGGTCT GAGGACCGTT ACCAGGAAAT CATCAAGGAG ACCTCCAACT 50 

TCATCAAGAA GGTCGGCTAC AACCCCAAGC ACGTTCCCTT CGTCCCCATC 100 

TCCGGTTTCA ACGGAGACAA CATGATTGAG GCCTCCACCA ACTGCCCCTG 150 

GTACAAGGGT TGGGAGAAGG AGACCAAGGC CAAGGCCACT GGTAAGACCC 2 00 

50 TCCTTGAGGC CATCGACGCC ATCGACCCTC CTGTCCGTCC TACCGACAAG 2 50 

CCCCTCCGCC TTCCCCTCCA GGATGTTTAC AAGATTGGTG GTATTGGCAC 300 

GGTCCCCGTC GGTCGTGTCG AGACCGGTAT CATCAAGCCC GGTATGGTCG 350 

TCACCTTCGC CCCCGCTGGT GTCACCACCG AAGTCAAGTC CGTCGAGATG 400 

CACCACGAGC AGCTTACTGA GGGTGTCCCC GGTGACAACG TCGGCTTCAA 450 

55 CGTCAAGAAC GTCTCCGTCA AGGAGATCCG TCGTGGTAAC GTTGCCGGTG 500 

ACTCCAAGAA CGACCCCCCC AAGGGTTGCG AGTCCTTCAA CGCCCAGGTC 550 

ATCGTCCTCA ACCACCCTGG TCAGGTCGGT GCCGGTTACG CCCCAGTCCT 600 

TGACTGCCAC ACTGCCCACA TTGCCTGCAA GTTCTCCGAG CTCCTCGAGA 650 

AGATCGACCG CCGTACCGGA AAGTCTGTTG AGAACTCCCC CAAGTTCATC 700 

60 AAGTCCGGTG ACGCCGCCAT CGTCAAGA 72 8 

445 
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2) INFORMATION FOR SEQ ID NO: 782 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1145 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double - 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fonsecaea pedrosoi 

(B) STRAIN: ATCC 18831 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 782 

TGAAGTCCGA GCGTGAGCGT GGTATCACCA TCGATATCGC CCTCTGGAAG 50 

TTCGAGACTC CCAAGTAAGG CTCAACAGAC ACAACAAGCA AATGCATACT 10 0 

CGCTAACCTA TTCACCCACC ACAGGTACAA CGTCACCGTC ATTGACGCCC 150 

CCGGTCACCG TGATTTCATC AAGAACATGA TCACTGGTAC CTCCCAGGCT 2 00 

GACTGCGCCA TTCTCATCAT TGCCGCCGGT ACTGGTGAGT TCGAGGCCGG 2 50 

TATCTCCAAG GACGGTCAGA CCCGTGAGCA CGCTCTTCTC GCCTACACCC 3 00 

TTGGTGTCAA GCAGCTCATC GTTGCCATCA ACAAGATGGA CACCACCAAG 3 50 

TGGTCTGAGG CCCGTTACCA GGAGATCATC AAGGAGACCT CCGGTTTCAT 4 00 

CAAGAAGGTC GGCTTCAACC CCAAGCACGT TCCCTTCGTG CCCATCTCCG 4 50 

GTTTCAACGG TGACAACATG ATCGACGTCT CCACCAACTG CCCCTGGTAC 500 

AAGGGTTGGG AGAAGGAGAC CAAGGCCAAG GCCACCGGCA AGACCCTCCT 55 0 

CGAGGCCATT GACGCCATCG ACCCCCCCAC TCGTCCCACC GACAAGCCCC 60 0 

TCCGTCTTCC CCTYCAGGAT GTCTACAAGA TCGGTGGTAT TGGCACGGTT 650 

CCCGTCGGTC GTGTCGAGAC CGGTACCATC AAGGCCGGCA TGGTCGTCAC 70 0 

CTTCGCCCCC GCTGGTGTCA CCACTGAGGT CAAGTCCGTC GAGATGCACC 750 

ACGAGCAGCT TCCCGAGGGT CTCCCCGGTG ACAACGTCGG CTTCAACGTC 800 

AAGAACGTCT CCGTCAAGGA GATCCGTCGT GGCAACGTCG CCGGTGACTC 850 

CAAGAACGAC CCCCCCAAGG GCTGCGACAG CTTCAACGCC CAGGTCATCG 90 0 

TCCTCAACCA CCCCGGTCAG GTCGGCGCCG GCTACGCGCC CGTCCTCGAC 950 

TGCCACACTG CTCACATTGC TTGCAAGTTC TCTGAGCTCC TCGAGAAGAT 100 0 

CGACCGCCGT ACCGGCAAGT CCATTGAGGC CAGCCCCAAG TTCATCAAGT 1050 

CTGGTGACGC CGCCATCGTC AAGATGGTTC CCTCCAAGCC TATGTGCGTT 110 0 

GAGGCCTTCA CCGACTACCC CCCTCTTGGA CGTTTCGCCG TCCGT 114 5 



2) INFORMATION FOR SEQ ID NO: 783 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1151 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microspojrum audouinii 

(B) STRAIN: ATCC 11347 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 783 

446 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



GCTCAAGGCT 
AGTTCGAGAC 
TTTCCCATTT 
5 GACACCGTGA 
TGCGCTATTC 
CTCCAAGGAT 
GTGTCAAGCA 
TCTGAGTCCC 

10 GAAGGTCGGC 
TCAACGGTGA 
GGATGGAACA 
CCTTGAGGCC 
CTCTCCGTCT 

15 GTACCAGTCG 
CACTTTCGCC 
ACCACCAGCA 
GTCAAGAACG 
TTCCAAGAAC 

2 0 TCGTCCTCAA 
GACTGCCACA 
GATTGACCGC 
AGTCTGGTGA 
GTTGAGGCTT 

25 C 



GAGCGTGAGC 
CCCCAAGTAC 
ATAGTTGCGA 
CTTCATCAAG 
TCATCATTGC 
GGCCAGACTC 
GCTCATCGTT 
GTTTCGGTGA 
TACGACCCCA 
CAACATGATT 
AGGAGACCAA 
ATCGATGCCA 
CCCACTCCAG 
GTCGTGTTGA 
CCCGCCAACG 
GCTCGTTCAG 
TCTCTGTCAA 
GACCCACCAT 
CCACCCCGGC 
CTGCCCACAT 
CGTACTGGTA 
TGCCGCTATT 
TCACTGACTA 



GTGGTATCAC 
ATGGTCACCG 
CAAGTAACTA 
AACATGATTA 
TGCCGGTACT 
GTGAGCACGC 
GCCATCAACA 
AATCATCAAG 
AGGGTGTCCC 
GAGCCCTCCA 
GGCCGGTGGC 
TTGACATGCC 
GATGTCTACA 
GACTGGTATC 
TCACCACTGA 
GGTGTTCCCG 
GGAAGTCCGC 
CTGGCTGCGC 
CAGATCGGTG 
TGCTTGCAAG 
AATCCGTCGA 
GCCACCATGG 
CCCACCACTT 



CATTGACATC 
TCATCGGTAT 
ATAAAAAGTA 
CTGGTACCTC 
GGTGAGTTCG 
CCTGCTCGCT 
AGATGGACAC 
GAAGTCACCA 
ATTCGTCCCA 
CCAACTGCCC 
AAATCCTCTG 
CACTCGTCCC 
AGATCTCTGG 
ATCAAGCCTG 
AGTCAAGTCC 
GTGACAACGT 
CGTGGTAACG 
CTCTTTCAAG 
CTGGTTACGC 
TTCTCTGAGC 
AACCAGCCCT 
TTCCATCCAA 
GGTCGTTTCG 



GCCCTCTGGA 5t) 

GCTTTATCTG 100 

GATGCCCCCG 150 

CCAGGCCGAC 2 00 

AGGCTGGTAT 250 

TTCACCCTCG 300 

CACCAACTGG 3 50 

ACTTCATCAA 4 00 

ATCTCTGGCT 450 

ATGGTACAAG 500 

GTAAGACCCT 550 

ACCGACAAGC 600 

TATCGGAACA 650 

GTATGGTTGT 700 

GTCGAAATGC 750 

TGGCTTCAAC 800 

TTGCCGGTGA 850 

GCCCAGGTCA 900 

CCCAGTCCTC 950 

TTCTTGAGAA 1000 

AAGTTCGTCA 1050 

GCCCATGTGC 1100 

CCGTCCGTGA 1150 
1151 



2) INFORMATION FOR SEQ ID NO: 784 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Mucor circinelloides 

(B) STRAIN: ATCC 38592 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 784 

45 CCAAGTGGTC TCAAGATCGT TACAACGAAA TTGTCAAGGA AGTTTCCGGT 50 

TTCATCAAGA AGATCGGTTT CAACCCCAAG TCCGTTCCTT TCGTTCCCAT 100 

TTCTGGCTGG CACGGTGATA ACATGTTGGA TGAATCCACC AACATGCCCT 150 

GGTTCAAGGG ATGGAACAAG GAGACCAAGG CCGGTTCCAA GACTGGTAAG 2 00 

ACTCTCCTCG AAGCCATCGA TGCCATTGAG CCCCCTGTCC GTCCTTCTGA 250 

50 CAAGCCTCTC CGTCTTCCTC TTCAAGATGT CTACAAGATT GGTGGTATTG 3 00 

GTACAGTTCC CGTCGGTCGT GTTGAAACTG GTACTATCAA GGCTGGTATG 3 50 

GTTGTCAACT TCGCTCCCGC TGCTGTCACC ACTGAAGTTA AGTCCGTTGA 4 00 

AATGCATCAC GAAACCCTCT CTGAAGGTCT CCCCGGTGAC AACGTTGGTT 4 50 

TCAACGTCAA GAACGTCTCC GTCAAGGATA TCCGTCGTGG TAACGTCTGT 500 

55 TCCGACTCCA AGAACGATCC CGCTAAGGAA TCTGCCTCTT TCACTGCTCA 550 

AGTTATTATC TTGAACCATC CCGGTCAAAT CTCTGCTGGT TACGCACCAG 600 

TTCTCGATTG TCACACTGCT CACATCGCCT GTAAGTTCTC TGAACTCATT 650 

GAGAAGATTG ATCGTCGTTC CGGTAAGTAC CTGCATCTGT CAGAATTGAA 700 

GGTCCGCCGT TATAGCAAAG GCTGGGTTTA AATGTTGGGG TTTGTCTGAT 750 

60 CTATAATGAT GATTGCTCCT TCAATTTTTG ACATAATTTG ATGATCTGAA 800 

447 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



TTGTGTTGCT AACGTCGCAT TTGCTTCTTT TGCTTCTTTT GCATGTAGGT 
AAGAAGATGG AAGATGCTCC CAAAGTAAGT ATTACGATTG ATGGACAATT 
AAAATAGAAT ACTAACAATT ATTGTTTATA GTTCGTCAAG TCTGGTGACT 
CTGCTATCGT CAAGATGGTT CCCTCCAAG 



85^ 
900 
950 
979 



2) INFORMATION FOR SEQ ID NO: 785 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 99 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Phialaphora verrucosa 

(B) STRAIN: ATCC 38561 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 785 

GGACAAGCTG AAGGCCGAAC GTGAGCGTGG TATCACCATC GATATCGCGC 50 

TCTGGAAGTT CGAGACTCCC AAATACTTCG TCACCGTCAT TGATGCCCCT 100 

GGTCATCGTG ACTTCATCAA GAACATGATC ACTGGTACCT CCCAGGCTGA 150 

TTGTGCCATC CTCATCATTG CCGCCGGTAC CGGTGAGTTC GAAGCCGGTA 2 00 

TCTCCAAGGA TGGCCAGACC CGTGAGCACG CTCTCCTCGC CTACACCCTA 2 50 

GGTGTGAAGC AGCTTATCGT CGCCATCAAC AAGATGGACA CCGCCAAATG 300 

GTC CGAGGAT CGGTTCAACG AAATCATCAA GGAGACTTCC AACTTCATCA 3 50 

AGAAGGTCGG ATACAACCCC AAGTCCGTCC CGTTCGTGCC CATCTCCGGT 4 00 

TTCAACGGTG ACAACATGAT CGACGTCTCC TCCAACGCCC CCTGGTACAA 450 

GGGTTGGGAG AAGGAGACCA AGGCCGGCAA GGCCACTGGC AAGACCCTCC 500 

TCGAGGCCAT CGACGCGATT GACCCTCCTA CTCGTCCCAC CGACAAGCCC 550 

CTCCGTCTCC CTCTCCAGGA TGTCTACAAG ATCTCTGGTA TCGGCACGGT 600 

GCCCGTTGGT CGTGTTGAGA CCGGTACCAT CAAAGCCGGT ATGGTCGTCA 650 

CCTTCGCTCC CGCCAACGTC AC C ACTGAAG TCAAGTCCGT CGAAATGCAC 700 

CACGAACAGC TCGCCGAAGG TGTTCCAGGT GACAATGTCG GCTTCAACGT 750 

CAAGAACGTC TCCGTCAAGG AGGTTCGTCG TGGAAACGTT GCCGGTGACT 800 

CCAAGAATGA CCCCCCCAAG GGTGCCGACT CCTTCAACGC TCAGGTCATC 850 

GTCCTCAACC ACCCTGGTCA GGTCGGTGCC GGCTACGCCC CGGTCTTGGA 90 0 

TTGCCACACT GCCCACATTG CTTGCAAGTT CTCTGAGCTC CTCGAGAAGA 950 

TCGATCGTCG TACCGGCAAG TCCATGGAAA ACAACCCCAA GTTTATCAAG 1000 

TCTGGTGATG CTGCCATCGT GAAGATGGTT CCCAGCAAGC CTATGTGCGT 1050 

TGAGGCCTTC ACCGACTATC CTCCTCTTGG TCGTTTCGCC GTCCGTGAC 10 99 



2) INFORMATION FOR SEQ ID NO: 786 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saksenaea vasiformis 

448 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(B) STRAIN: ATCC 60625 



PCT/CAOO/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 786 

5 ACCACCAAGT GGTCTGAGGC CCGTTACCAG GAGATCATCA AGGAGACCTC 50 

CGGYTTCATC AAGAAGGTCG GCTTCAACCC CAAGCACGTT CCCTTCGTGC 100 

CCATCTCCGG TTTCAACGGT GACAACATGA TCGACGTCTC CACCAACTGC 150 

CCCTGGTACA AGGGTTGGGA GAAGGAGACC AAGGCCAAGG CCACCGGCAA 2 00 

GACCMTCCTC GAGGCCATTG ACGCCATCGA CCCCCCYAGY CGTCCCACCG 250 

10 ACAAGCCCCT YCGTCTTCCC CTMCAGGATG TYTACAAGAT TGGCGGTATT 3 00 

GGCACGGTTC CCGTCGGTCG TGTYGAGACC GGTRCCATCA AGGGTGGCAT 350 

GGTCGTCACC TTCCCCCCCG CTGGTGTCAC CACTGAGGTC AAGTCCGTCG 40 0 

AGATGCACCA CGAGCAGCTC GCCGAGGGTS TCCCCGGTGA CAACGTCGGC 4 50 

TTCAACGTCA AGAACGTCTC CGTCAAGGAG ATCCGTCGTG GCAACGTTGC 50 0 

15 CGGTGACTCC AAGAACGACC CCCCCAAGGG CTGCGACAGC TTCAACGCCC 550 

AGGTCATCGT CCTCAACCAC CCCGGTCAGG TCGGCGCCGG CTACGCGCCS 60 0 

GTCCTSGACT GCCACACTGC TCACATTGCC TGCAAGTTCT CTGAGCTCCT 650 

CGAGAAGATC GACCGCCGTT CCGGCAAGTC CATCGAGTCC GGCCCCAAGT 7 00 

TCATCAAGTC TGGTGACGCC GCCATCGTCA AGATGGTTCC CTCCAAGCCC 750 

20 



2) INFORMATION FOR SEQ ID NO: 787 



25 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1084 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Syncephalastrum racemosum 

(B) STRAIN: ATCC 32330 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 78 7 



40 



45 



50 



55 



60 



GCTCAAGGCC 
AGTTCGAGAC 
CGTGATTTCA 
TATCCTCATC 
AGGATGGCCA 
CGTCAGCTGA 
GGCCCGTTAC 
TCGGTTTCAA 
GGTGACAACA 
GAAGAAGGAG 
CCATTGACAA 
CTTCCCCTCC 
CGGTCGTGTC 
CCCCCGCCAA 
CAGCTCGTCG 
CGTTTCCGTC 
ACGACCCCGC 
AACCACCCCG 
CACCGCTCAC 
GTCGTTCCGG 
GACTCCGCCA 
CTACACTGAG 



GAGCGTGAGC 
CCCCAAGTAC 
TCAAGAACAT 
ATTGCCGCCG 
GACCCGTGAG 
TCGTCGCCAT 
AACGAAATCG 
CCCCAAGTCC 
TGTTGGAGGA 
ACCAAGGCTG 
CATTGACCCC 
AGGATGTCTA 
GAGACTGGTG 
CGTCACCACT 
AGGGTGTCCC 
AAGGATATCC 
CAAGGAGTCT 
GTCAGATCGG 
ATTGCCTGCA 
YAAGAAGCTC 
TCGTCAAGAT 
TACCCCCCTC 



GTGGTATCAC 
CACGTCACCG 
GATCACTGGT 
GTACTGGTGA 
CACGCTCTGC 
CAACAAGATG 
TCAAGGAGGT 
GTTCCCTTCG 
GTCCTCCAAC 
GCGAGAAGTC 
CCGGTCCGTC 
CAAGATCGGT 
TCATCAAGGC 
GAAGTCAAGT 
CGGTGACAAC 
GCCGTGGTAA 
GCCTCGTTCA 
TGCCGGTTAC 
AGTTCGCTGA 
GAAGAGTCCC 
GGTTCCCTCC 
TTGGCCGTTT 



CATCGATATC 
TCATTGATGC 
ACCTCGCAGG 
GTTCGAGGCT 
TTGCCTTCAC 
GACTCGACCA 
CTCCACCTTC 
TCCCCATCTC 
ATGCCCTGGT 
CGGCAAGACC 
CCTCGGACAA 
GGTATCGGCA 
TGGTATGGTC 
CCGTCGAGAT 
GTCGGTTTCA 
CGTCTGCTCT 
CCGCCCAGGT 
GCCCCGGTTC 
GCTCCTCGAG 
CCAAGTTCGT 
AAGCCCATGT 
CGCC 



GCTCTCTGGA 50 

CCCCGGCCAT 100 

CTGACTGCGG 150 

GGTATCTCCA 2 00 

CCTCGGTGTC 250 

AGTACTCTGA 300 

ATCAAGAAGA 3 50 

TGGCTGGAAC 4 00 

TCAAGGGCTG 4 50 

CTCCTTGAGG 50 0 

GCCCCTCCGT 550 

CAGTCCCCGT 600 

GTGACCTTCG 650 

GCACCACGAG 700 

ACGTCAAGAA 750 

GACTCCAAGA 8 00 

CATCGTCCTG 850 

TTGACTGCCA 900 

AAGATCGACC 950 

CAAGTCGGGT 1000 

GCGTTGAGGC 1050 
1084 
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2) INFORMATION FOR SEQ ID NO: 788 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1155 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton tonsurans 

(B) STRAIN: ATCC 56185 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 788 

GCTCAAGGCC GAGCGTGAGC GTGGTATCAC CATCGATATC GCCCTCTGGA 50 

AGTTCGAGAC CCCCAAGTAC AATGTCACCG TCATTGGTAT GTTTTTCTTT 100 

ACCTTTCCCC TCCATCGTCT TGCTGTGCCA TAACTAACGA GAGTAGACGC 150 

CCCCGGTCAC CGTGACTTCA TCAAGAACAT GATCACTGGT ACCTCCCAGG 200 

CTGACTGTGC TATTCTCATC ATTGCTGCCG GTACTGGTGA GTTCGAGGCT 2 50 

GGTATCTCCA AGGATGGCCA GACCCGTGAG CACGCTCTGC TCGCCTTCAC 3 00 

CCTTGGTGTC AAGCAGCTCA TCGTTGCCAT CAACAAGATG GACACCACCA 350 

ACTGGTCCGA GGACCGTTTC AAGGAAATCA TCAAGGAAGT CACCAACTTC 4 00 

ATCAAGAAGG TTGGCTACGA CCCCAAGGGT GTTCCATTCG TTCCAATCTC 450 

TGGTTTCAAC GGTGACAACA TGATTGAGGC CTCCAGCAAC TGCCCATGGT 500 

ACAAGGGATG GAACAAGGAG ACCAAGGCCG GTGGTGCCAA GACTGGCAAG 550 

ACCCTYCTCG AGGCCATCGA TGCCATCGAC ATGCCAACCC GTCCTACCGA 600 

CAAGCCCCTY CGTCTCCCAC TCCAGGATGT CTACAAGATC TCTGGTATCG 650 

GAACTGTACC AGTCGGTCGT GTTGAGACCG GTATCATCAA GCCTGGTATG 7 00 

GTCGTCACCT TCGCCCCTGC CAACGTCACC ACTGAAGTCA AGTCCGTCGA 750 

AATGCACCAC CAGCAGCTTC AGCAGGGTGT CCCCGGTGAC AACGTCGGCT 800 

TCAACGTCAA GAACGTTTCC GTCAAGGAAG TCCGCCGTGG TAACGTTGCC 850 

GGTGACTCCA AGAACGACCC ACCATCCGGC TGTGCCTCCT TCAACGCCCA 900 

GGTCATCGTC CTCAACCACC CCGGCCAGAT CGGTGCTGGT TACGCCCCAG 950 

TCCTCGACTG CCACACTGCT CACATTGCTT GCAAGTTCGC TGAGCTCCTC 1000 

GAGAAGATTG ACCGCCGTAC CGGTAAATCC GTCGAAGCCA ACCCCAAGTT 1050 

CGTCAAGTCT GGTGATGCCG CTATCGCCAA GATGGTTCCC TCCAAGCCCA 110 0 

TGTGCGTTGA GGCTTTCACT GACTACCCCC CACTTGGTCG TTTCGCCGTC 1150 

CGTGA 1155 



2) INFORMATION FOR SEQ ID NO: 789 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 8125 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 9 

TCAAGGCCGA GCGTGAGCGT GGTATCACCA TCGATATCGC CCTCTGGAAG 50 

TTCGAGACCC CCAAGTACAA TGTCACCGTC ATTGGTATGT TTCTCTTTAC 100 

5 CTTTCCCCTC CATCGTCTTG CTGTGCCATA ACTAACGAGA GTAGACGCCC 150 

CCGGTCACCG TGACTTCATC AAGAACATGA TCACTGGTAC CTCCCAGGCT 2 00 

GACTGTGCTA TTCTCATCAT TGCTGCCGGT ACTGGTGAGT TCGAGGCTGG 2 50 

TATCTCCAAG GATGGCCAGA CCCGTGAGCA CGCTCTGCTC GCCTTCACCC 3 00 

TTGGTGTCAA GCAGCTCATC GTTGCCATCA ACAAGATGGA CACCACCAAC 3 50 

10 TGGTCCGAGG ACCGTTTCAA GGAAATCATC AAGGAAGTCA CCAACTTCAT 400 

CAAGAAGGTT GGCTACGACC CCAAGGGTGT TCCATTCGTT CCAATCTCTG 450 

GTTTCAACGG TGACAACATG ATTGAGGCCT CCACCAACTG CCCATGGTAC 500 

AAGGGATGGA ACAAGGAGAC CAAGGCCGGT GGTGCCAAGA CTGGCAAGAC 550 

CCTCCTCGAG GCCATCGATG CCATCGACAT GCCAACCCGT CCTACCGACA 600 

15 AGCCCCTCCG TCTCCCACTC CAGGATGTCT ACAAGATCTC TGGTATCGGA 650 

ACTGTACCAG TCGGTCGTGT TGAGACCGGT ATCATCAAGC CCGGTATGGT 700 

CGTCACCTTC GCCCCTGCCA ACGTCACCAC TGAAGTCAAG TCCGTCGAAA 750 

TGCACCACCA GCAGCTTCAG CAGGGTGTCC CCGGTGACAA CGTCGGCTTC 8 00 

AACGTCAAGA ACGTTTCCGT CAAGGAAGTC CGCCGTGGTA ACGTTGCCGG 850 

2 0 TGACTCCAAG AACGACCCAC CATCCGGCTG TGCCTCCTTC AACGCCCAGG 900 

TCATCGTCCT CAACCACCCC GGCCAGATCG GTGCTGGTTA CGCCCCAGTC 950 

CTCGACTGCC ACACTGCTCA CATTGCTTGC AAGTTCGCTG AGCTCCTCGA 1000 

GAAGATTGAC CGCCGTACCG GTAAATCCGT CGAAGCCAAC CCCAAGTTCG 1050 

TCAAGTCTGG TGATGCCGCT ATCGCCAAGA TGGTTCCATC CAAGCCTATG 1100 

2 5 TGCGTTGAGG CTTTCACTGA CTACCCCCCA CTTGGTCG 1138 



30 



2) INFORMATION FOR SEQ ID NO: 790 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
4 0 (A) ORGANISM: Bipolaris hawaiiensis 

(B) STRAIN: ATCC 26067 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 790 

4 5 CACCACCAAG TGGTCTGAGG AGCGTTACCA GGAAATCATC AAGGAGACCT 50 

CCAACTTCAT CAAGAAGGTC GGCTACAACC CCAAGCACGT TCCCTTCGTC 100 

CCCATCTCTG GTTTCAACGG AGACAACATG ATTGAGGCTT CCACCAACTG 150 

CCCCTGGTAC AAGGGTTGGG AGAAGGAGAC CAAGGCCAAG GCCACTGGTA 2 00 

AGACCCTTCT CGAGGCCATC GACGCCATCG ACCCCCCTGT CCGTCCTACC 2 50 

50 GACAAGCCCC TCCGCCTTCC CCTCCAGGAT GTGTACAAGA TTGGTGGTAT 3 00 

TGGCACGGTT CCCGTCGGTC GTGTCGAGAC CGGTATCATC AAGCCCGGTA 3 50 

TGGTCGTCAC CTTCGCCCCC GCTGGTGTCA CCACTGAAGT CAAGTCCGTC 4 00 

GAGATGCACC ACGAGCAGCT TGCCGAGGGT GTCCCCGGTG ACAACGTCGG 450 

CTTCAACGTC AAGAACGTCT CCGTCAAGGA GATCCGTCGT GGTAACGTTG 500 

55 CCGGTGACTC CAAGAACGAC CCCCCCAAGG GTTGCGAGTC CTTCAACGCC 550 

CAGGTCATCG TCCTCAACCA CCCCGGTCAG GTCGGTGCCG GTTACGCACC 600 

AGTCCTTGAC TGCCACACTG CCCACATTGC TTGCAAGTTC TCCGAGCTCC 650 

TCGAGAAGAT TGACCGCCGT ACCGGAAAGT CTGTTGAGAA CTCCCCCAAG 700 

TTCATCAAGT CCGGTGACGC CGCCATCGTC AAGATGGTTC CCTCCAAG 748 



60 
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10 



40 



55 



2) INFORMATION FOR SEQ ID NO: 791 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 958 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 
15 (B) STRAIN: ATCC 14110 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 791 

CGCTATTGTC GTTGTTGCTG CCTCCGACGG TCAGATGTAG GTGGAACATC 50 

2 0 TTGGGAAATA CGTCGTAAAA CACGGCGCTT ACGTTTTCGC GAATAGGCCC 100 

CAGACTCGTG AGCATTTGCT GCTCGCCCGC CAGGTTGGTG T C C AG AAG AT 150 

CGTTGTCTTC GTCAACAAAA TCGATGCTAT TGATGATCCG GAGATGCTGG 2 00 

AACTGGTCGA ACTCGAGATG CGTGAGCTGC TGAACAGCTA CGGTTTCGAG 2 50 

GGTGAAGAGA CTCCGATCAT TTTCGGTTCC GCTCTCTGTG CTCTCGAAGG 3 00 

2 5 ACGCCGTGAC GACATCGGTA AAGACAGAAT TGAGCAGCTT ATGAACGCTG 3 50 

TCGACACCTG GATCCCCACT CCTCAGCGTG ACCTCGACAA ACCTTTCTTG 400 

ATGTCTGTCG AGGAAGTGTT CTCTATCGCC GGCCGTGGTA CCGTGGCTTC 4 50 

TGGTCGTGTC GAGCGTGGTA TCTTGAAGAA GGACTCTGAG GTTGAGATTG 500 

TTGGAGGCTC CTTCGAACCC AAGAAGACCA AAGTCACCGA CATTGAAACC 550 

30 TTCAAGAAGA GCTGTGATGA ATCGCGTGCT GGTGACAACT CTGGTCTCCT 6 00 

CCTGCGTGGT ATCCGACGTG AAGACGTCAA GCGTGGTATG GTCATTGCTG 650 

TTCCCGGCAG CACCAAAGCT CACGACAAGT TCCTCGTCTC CATGTACGTC 700 

CTGACCGAGG CGGAGGGTGG TCGTCGTACT GGCTTCGGTG CCAACTACCG 7 50 

TCCCCAAGTC TTCATCCGTA CTGCAGGTAA GTTCCCGCAC ACCGTGTCCA 8 00 

35 GATCTTCCGA GAGATTAGCG ATATATGCTA ATGATTCATC AGACGAGGCT 850 

GCTGACCTCA GCTTCCCTGA CGGCGACCAA TCTCGCAGAG TTATGCCTGG 900 

TGACAACGTC GAGATGATCC TGAAGACCCA CCACCCTGTT GCTGCTGAGG 950 

CTGGTCAA 9 58 



2) INFORMATION FOR SEQ ID NO: 792 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 93 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton mentagrophytes 

(B) STRAIN: ATCC 8125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 92 



CGTTGTCGCA GCTTCTGACG GTCAAATGTA ATTGAATGCC CGCCCAGACG 50 

GATGAAAGGA TTTGACGTTT CTAACATCAT TCTAGGCCTC AGACCAGAGA 100 

60 ACATTTGCTC CTTGCCCGCC AGGTCGGTGT CCAGAAGCTG GTCGTTTTCG 150 
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TTAACAAGGT CGATGCCGTT 
CTTGAAATGC GTGAACTCCT 
CCCCATCATT TTTGGCTCTG 
AGCTTGGTGT CGAGAAGATT 
5 ATCCCCACCC CCGAGCGCGC 
GGAAGTGTTC TCTATCTCTG 
AGCGTGGTAT CCTCAAGAAG 
ACCACCCCTA TCAAGACCAA 
CTGCGATGAA TCTCGAGCTG 

10 TCAAGCGTGA GGACTTGAAG 
ACCAAGGCTC ACACCGACTT 
TGAGGGTGGT CGTTCCAACG 
TCATCCGTAC TGCTGGTATG 
ATCATTGCTA ACTTGTATTT 

15 TGGCCTGGAG AAGACCAAGA 
GATGATTTGC AAGACCCTCC 



GAGGACCCAG AGATGTTGGA 
CAGCCACTAC GGTTTCGAGG 
CTCTCTGTGC CCTCGAGTCC 
GACGAGCTAT TGAACGCCGT 
CACTGATAAG CCTTTCCTCA 
GTCGTGGTAC CGTCGTCTCC 
GATTCCGACG TCGAAATTGT 
GGTCACAGAT ATCGAAACCT 
GTGACAACTC TGGTCTCCTT 
CGTGGAATGG TTGTTGCTGC 
CATGGTCTCC CTCTACGTCC 
GCTTCACCCA CAAGTACCGC 
TAACCCAAGT TTCCGCTATT 
CCTTCCGTAG ACGAAGCCGC 
CAAGAAGGCT ATGCCTGGTG 
ACCCCATTGC TGCCGA 



PCT/CA00/01150 

GCTTGTCGAA '20 Of 

GTGAGGAGAC 2 50 

CGTCGACCTG 3 00 

CGACACCTGG 350 

TGTCCATTGA 4 00 

GGTCGTGTTG 4 50 

TGGTGGCTCT 50 0 

TCAAGAAGTC 550 

CTCCGAGGTA 60 0 

CCCCGGATCC 650 

TGACTGAGGC 700 

CCCCAAATGT 750 

TACTAAGTAG 800 

ATCTTTCAGC 850 

ACAACGTCGA 900 

936 



20 .2) INFORMATION FOR SEQ ID NO: 793 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 793 

TTATTGTTGC TGCTGGTACT 2 0 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 794 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma, cruzi 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 794 

GACGACAAGT CGGTGAACTT 2 0 



55 2) INFORMATION FOR SEQ ID NO: 795 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Trypanosoma cruzi 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 795 

ACTTGCACGC GATGTGGCAG 



2 ) INFORMATION FOR SEQ ID NO: 7 96 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 796 
GGTCCAATGC CWCAAACWAG A 



2) INFORMATION FOR SEQ ID NO: 797 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 797 
CATTAAGAAT GGYTTATCTG TSKCTCT 



2 ) INFORMATION FOR SEQ ID NO: 798 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 798 
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TGGTTGTCCC AGCCGATCGT TT 22 



5 2) INFORMATION FOR SEQ ID NO: 799 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 799 

2 0 ACCTGTGAAT ACAAGCAATC T 21 



25 



40 



50 



55 



60 



2 ) INFORMATION FOR SEQ ID NO: 800 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: single 
3 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 80 0 

GATGAAATCT TCAACGAAGT TGAT 24 

2 ) INFORMATION FOR SEQ ID NO: 801 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 801 

ACAACACCGA GAAGATCCCA 2 0 
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2) INFORMATION FOR SEQ ID NO: 802 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 02 

TTGCCATTTC TGGTTTCGTT 



2) INFORMATION FOR SEQ ID NO: 803 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 803 

ACTTCAGTGG TAACACCAGC 

2) INFORMATION FOR SEQ ID NO: 804 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 04 

CCTGGGACGG CCTCTGGCAT 



2) INFORMATION FOR SEQ ID NO: 805 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 805 

CTCTTGTCCA TCTTAGCAGT 20 



2) INFORMATION FOR SEQ ID NO: 806 

(i) SEQUENCE CHARACTERISTICS: 

2 0 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 806 

AGCATCACCA GACTTGATAA G 21 



2) INFORMATION FOR SEQ ID NO: 807 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 807 

AAAGTGGCTT CAAAGGTTGC 20 



55 2) INFORMATION FOR SEQ ID NO: 808 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 812 
GTIACIGGIT CYTYRARRTT ICCICC 



2) INFORMATION FOR SEQ ID NO: 813 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 813 
AATCYGTYGA AATGCAYCAC GA 



2 ) INFORMATION FOR SEQ ID NO: 814 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 814 
GC I GGCACGT ACAC I GCCTG 



2 ) INFORMATION FOR SEQ ID NO: 815 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 815 
TGGTGCATYT CKACRGACTT 
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2) INFORMATION FOR SEQ ID NO : 816 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 816 
GCTACGACGA GATCAAGGGC 20 



2) INFORMATION FOR SEQ ID NO: 817 



(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

2 5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 817 
TGGAAGAAGG CCGAGGAGTT 2 0 

30 

2) INFORMATION FOR SEQ ID NO: 818 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 818 
45 AGCCGGGCTG GATCTTCTTC 20 



2 ) INFORMATION FOR SEQ ID NO: 819 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 819 

60 
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20 



5 2) INFORMATION FOR SEQ ID NO: 82 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma, brucei 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 0 

2 0 GAAGGAGGTG TCTGCTTACA C 21 



2) INFORMATION FOR SEQ ID NO: 821 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Trypanosoma, brucei 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 821 

GGCGCAAACG TCACCACATC A 2 1 

2) INFORMATION FOR SEQ ID NO: 822 

4 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 
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50 
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(ii) MOLECULE TYPE: DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 822 

CGGCGGATGT CCTTAACAGA A 21 
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2) INFORMATION FOR SEQ ID NO: 823 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 823 
GAGCGGTATG AYGAGATTGT 



2) INFORMATION FOR SEQ ID NO: 824 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 824 
GGCTTCTGCG GCACCATGCG 



2) INFORMATION FOR SEQ ID NO: 825 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 825 
ATGAGCARCG SAACCATCGT TCAGTG 



2) INFORMATION FOR SEQ ID NO: 826 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 826 
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TCGATCGTGC CGACCATGTA GAACGC 2 6 



2) INFORMATION FOR SEQ ID NO: 827 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 44 6 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium novyi 

(B) STRAIN: ATCC 19402 

2 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 7 

CACCAACTTG CTAAATGGGG AGATGCCCAG ATTGTTGTAT ATATAGGCTG 50 

TGGAGAACGT GGAAATGAAA TGACAGATGT TCTTAATGAG TTTCCAGAAC 100 

TTAAAGATCC TAAGACTGGC AAATCAATAA TGGAAAGAAC AGTTTTAATA 150 

2 5 GCAAATACTT CTAATATGCC AGTTGCAGCC CGTGAAGCTT GTATATATAC 2 00 

AGGAATCACA ATAGCAGAAT ATTTTAGAGA TATGGGATAT TCAGTAGCAC 2 50 

TTATGGCGGA TTCCACTTCA CGTTGGGCAG AGGCATTAAG AGAAATGTCT 3 00 

GGAAGACTTG AAGAAATGCC TGGTGATGAA GGTTACCCAG CTTATTTAGG 3 50 

ATCAAGACTT GCTGATTTCT ATGAAAGAGC TGGAAAAGTT GTGTGTTTAG 4 00 

3 0 GAGACGATGA AAGAGAAGGT GCCATTACTG CAATAGGTGC TGTATC 446 



35 



2 ) INFORMATION FOR SEQ ID NO: 828 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
4 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Clostridium difficile 

(B) STRAIN: 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 8 

50 CAGCATCAGC TTGCTAAATG GGCAGATGCA GATATAGTTG TATATATAGG 50 

CTGTGGCGAG CGTGGAAATG AAATGACAGA TGTTCTTCTT GAATTTCCTG 100 

AATTAAAAGA CCCAAGAACA GGCGAGTCAC TTATGCAAAG AACTGTGCTT 150 

ATAGCAAATA CATCAGATAT GCCGGTTGCT GCACGTGAAG CTTCTATATA 2 00 

CACTGGTATT ACAATAGCTG AATATTTTAG AGATATGGGA TATAGTGTTG 250 

55 CACTTATGGC AGACTCTACA TCAAGATGGG CTGAGGCTCT TAGAGAGATG 3 00 

AGTGGTCGTT TAGAGGAGAT GCCTGGTGAA GAAGGTTATC CTGCATACTT 3 50 

AGGTTCACGT CTTGCTCAAT TCTATGAGAG AGCAGGAAAG GTAAATTGTC 4 00 

TAGGTATGGA TGAAAGAGAA GGAACACTTA CAGCAATTGG TGCAG 445 



60 
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2) INFORMATION FOR SEQ ID NO: 82 9 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 445 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium septicum 

(B) STRAIN: ATCC 12464 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82 9 



ATGCTATAGC TAAATGGGGA GACAGCGAAA TAGTTGTTTA CGTTGGATGT 5 0 

GGAGAACGTG GTAACGAAAT GACAGACGTT CTTAACGAAT TCCCAGAACT 100 

2 0 TATTGACCCA AAAACTGGGG AAAGTTTAAT GAAGAGAACA GTACTTATAG 150 

CTAATACTTC AAACATGCCA GTTGCTGCTA GAGAAGCTTG CATATACACA 2 00 

GGTATTACAA TAGCTGAATA CTTCAGAGAT ATGGGATACT CAGTATCTAT 2 50 

AATGGCTGAT TCAACTTCAA GATGGGCAGA AGCATTAAGA GAAATGTCAG 3 00 

GTAGACTTGA AGAAATGCCA GGTGATGAAG GATATCCAGC GTACTTAGGA 3 50 

25 TCAAGACTTG CTGATTATTA CGAAAGAGCA GGTAAGGTTG TTTGTCTAGG 400 

TAAAGATGGT AGAGAAGGTG CTGTAACAGC AATTGGAGCT GTATC 44 5 



30 2) INFORMATION FOR SEQ ID NO: 830 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium botulinum 

(B) STRAIN: 20:3.1 



45 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 830 



TCAAATTGCT AAATGGGGAG ATGCAGAAAT CGTTGTTTAC GTTGGATGCG 50 

GAGAACGTGG TAACGAAATG ACAGACGTTG TTAATGAGTT CCCAGAACTT 10 0 

ATTGACCCTA AGACTGGCGA AAGCTTAATG AAGAGAACAG TTCTTATAGC 150 

TAATACTTCA AACATGCCAG TTGCAGCGAG AGAAGCTTCA ATATATACAG 2 00 

50 GTATCACAAT AGCTGAATAT TTCAGAGATA TGGGATATGC AGTATCAATA 2 50 

ATGGCTGACT CAACTTCAAG ATGGGCTGAG GCATTAAGAG AAATGTCTGG 300 

TAGACTTGAA GAAATGCCTG GTGATGAAGG ATATCCAGCT TACCTTGGAT 3 50 

CAAGACTTGC TGATTACTAT GAAAGAGCTG GTAAGGTTGA ATGTTTAGGT 400 

AATGATGGAA GAATTGGTTC TATAACAGCA ATCGGTGCGG TATC 444 



2) INFORMATION FOR SEQ ID NO: 831 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 4 56 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 

(B) STRAIN: ATCC 13124 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 831 



TAGTTCAGCA CCAAGTTGCT AAATGGGGAG ATACTGAGAT AGTTGTTTAC 50 

GTTGGATGTG GAGAACGTGG TAACGAGATG ACAGACGTTC TTAACGAATT 10 0 

15 CCCAGAACTT AAAGACCCTA AAACTGGGGA AAGCTTAATG AAGAGAACAG 150 

TTCTTATTGC TAATACATCT AACATGCCAG TTGCTGCCAG AGAAGCATCA 2 00 

ATATATACTG GTATAACAAT AGCAGAGTAT TTCAGAGATA TGGGATACTC 2 50 

AGTATCAATC ATGGCTGACT CAACTTCACG TTGGGCAGAG GCTTTAAGAG 3 00 

AAATGTCAGG AAGACTTGAA GAAATGCCAG GAGACGAAGG TTACCCAGCA 3 50 

2 0 TACTTAGGAT CAAGACTTGC TGATTACTAT GAAAGAGCTG GTAAAGTTGT 4 00 

AGCTTTAGGT AAAGATGGAA GAGAAGGAGC TGTTACAGCT ATCGGAGCAG 4 50 

TATCCC 4 5 6 



25 2) INFORMATION FOR SEQ ID NO: 832 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium tetani 

(B) STRAIN: ATCC 194 06 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 832 



CCAACTTGCA AAATGGGCTG ATGCTCAAAT AGTTGTGTAC ATAGGATGTG 50 

GAGAACGTGG AAATGAAATG ACAGACGTTT TAAATGAGTT CCCAGAATTA 100 

AAGGATC CTA AAACCGGGGA ATCTTTAATG AAAAGAACTG TGTTAATAGC 150 

AAATACATCT AATATGCCTG TTGCAGCTAG AGAAGCATCT ATATATACTG 200 

4 5 GTATAACAAT AGGGGAATAT TTTAGAGATA TGGGATATTC AATAGCACTA 25 0 

ATGGCAGATT CGACTTCTAG ATGGGCAGAG GCTCTAAGAG AAATGTCTGG 3 00 

AAGACTAGAG GAGATGCCAG GTGAAGAAGG TTATCCAGCT TATTTAGGAT 350 

CTAGATTAGC AGAGTTCTAT GAAAGAGCAG GTAATGTTAT ATGTTTAGGT 400 

CAGGATGGAA GAGAAGGAGC ATTAACAGCT ATAGGAGCAG TTTC 444 

50 

2) INFORMATION FOR SEQ ID NO: 833 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 178 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 
5 (C) ACCESSION NUMBER: Genome project 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 833 

TGAACCAAGG AAAAATAATA ACCGTTTCGG GACCTCTTGT TGTGGCTTCT 5 0 

10 GGGATGCAAG AAGCTAATAT TCAAGATATT TGTCGTGTGG GACATCTTGG 100 

CTTAGTCGGA GAAATTATTG AAATGCGTCG CGATCAAGCG TCTATTCAGG 150 

TTTATGAGGA AACATCAGGG ATCGGTCCAG GAGAACCAGT AGTGACTACT 2 00 

GGTTGTCCTT TGTCGGTCGA GTTAGGCCCG GGCCTGATTT CAGAAATGTT 2 50 

TGACGGTATT CAGCGACCGC TTGATCGTTT TCAAAAAGCA ACGGACAGCG 3 00 

15 ACTTTTTAAT CCGTGGTGTG GCTATCCCAA GTCTTGATCG AAAGGCTAAG 350 

TGGGCATTTA TTCCCAAGCT AAGTGTTGGT CAAGAAGTAG TTGCAGGTGA 4 00 

TATTTTAGGA ACTGTGCAAG AAACAGCTGT CATTGAGCAC CGTATCATGG 4 50 

TTCCTTATAA AGTTTCAGGG ACCTTGGTGG CTATTCATGC AGGGGACTTC 500 

ACAGTAACAG ATACAGTTTA TGAAATTAAG CAGGAAGACG GTTCCATTTA 550 

2 0 CCAAGGTAGC CTCATGCAGA CTTGGCCAGT TCGTCAAAGT CGCCCTGTTG 60 0 

CTCAAAAGCT TATCCCAGTC GAACCTTTGG TTACAGGTCA ACGGGTTATT 650 

GACACCTTTT TCCCTGTTAC AAAAGGTGGT GCCGCTGCCG TTCCTGGACC 7 00 

ATTTGGGGCA GGAAAAACAG TTGTGCAGCA TCAAATAGCT AAATTTGCCA 750 

ACGTTGATAT TGTTATTTAT GTCGGTTGTG GGGAACGCGG CAACGAGATG 800 

25 ACCGACGTTT TGAATGAGTT TCCAGAGTTA ATTGACCCAA ATACAGGCCA 850 

GTCCATTATG GAGCGCACGG TGTTAATTGC AAACACCTCT AATATGCCAG 900 

TAGCAGCGCG TGAAGCGTCG ATTTACACAG GTATTACCAT TGC CGAAT AT 950 

TTCCGTGATA TGGGCTATTC TGTGGCTATC ATGGCAGACT CGACATCACG 1000 

TTGGGCAGAA GCTCTGCGCG AGATGTCAGG ACGCCTACAA GAAATGCCTG 1050 

3 0 GTGATGAAGG CTACCCGGCT TACTTAGGGA GTCGTATTGC CGAATATTAT 1100 

GAACGGGCTG GTCGTGTTCG GACCTTGGGA AGTCAAGAAC GTGAGGGAAC 1150 
CATTACAGCC ATCGGCGCGG TTTCTCCTCC TGGAGGGGAT ATTTCAGAGC . 12 00 

CTGTCACTCA AAACACCCTT CGGATTGTCA AAGTTTTCTG GGGGCTCGAC 12 50 

GCGCCTCTTG CGCAACGGCG TCACTTCCCA GCGATTAACT GGCTGACGTC 13 0 0 

35 TTATTCATTG TATCAAGATG ATGTAGGAAG CTATATTGAC CGTAAACAGC 13 5 0 

AATCTAATTG GTCCAACAAG GTAACTCGTG CCATGGCTAT TTTGCAGCGT 1400 

GAAGCCAGTC TAGAAGAAAT TGTACGCTTG GTGGGGCTTG ATTCACTGTC 14 50 

TGAACAAGAT CGTTTGACCA TGGCTGTTGC CCGGCAAATT CGGGAGGATT 1500 

ATCTCCAGCA AAATGCCTTT GATTCGGTGG ATACCTTTAC TTCCTTTCCG 1550 

40 AAACAAGAGG CCATGCTAAC CAATATTTTG ACCTTTAATG AGGAAGCCAG 160 0 

CAAAGCCCTT TCTTTGGGAG CTTATTTTAA TGAGATTATG GAAGGCACTG 1650 

CTCAGGTACG CGATCGCATC GCACGCAGCA AATTTATCCC AGAAGAAAAC 170 0 

TTAGAGCAGA TTAAAGGGCT TACTCAGAAG GTTACCAAAG AGATTCACCA 1750 

CGTTTTAGCA AAGGGAGGAA TTTAGATGAG CGTTCT 17 86 

45 



2) INFORMATION FOR SEQ ID NO: 834 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 99 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia, bovis 
60 (B) STRAIN: Suarez-3 

466 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 834 



10 



TATCTCACGT 
CTAAGTACGC 
GGAAACGAGA 
GGTTGATGGC 
ATACTTCAAA 
ATTACCCTAT 
GGCGGATTCC 
GTTTAGCTGA 
AGGCTTTCGG 
ACCACTTCGT 



AAGTTTTTGC 
TAATACTGAC 
TTGCGGAGGT 
AAGGAAGTGA 
CATGCCAGTG 
GTGAATACTT 
ACCAGTCGTT 
GATGCCTGCT 
CGTTCTATGA 
GAAGGTTCAG 



GCGGCGGTTA 
GTTACTGTCT 
GCTTAAGGAG 
GCATTATGAA 
GCCGCCAGGG 
CAGGGATATG 
GGGCTGAGGC 
GATTCAGGTT 
GCGTGCTGGT 
TTACCATTGT 



TATATCACTC 
ATGTGGGATG 
TTCCCTGAGC 
ACGCACTTGC 
AGGCTAGTAT 
GGATACAACG 
TTTGCGTGAG 
ATCCCGCCTA 
ACAGCTGAGT 
TGGTGCTGTA 



CAGGCCCTGG 50 

TGGAGAGCGT 100 

TGAAGACCAA 150 

TTGGTGGCCA 2 00 

CTACACTGGC 2 50 

CCTGTGTGAT 3 00 

ATATCAGGTC 3 50 

CCTTGCTTCT 4 00 

GTATTGGAAC 4 50 

TCTCCACCA 4 99 



15 



20 



25 



2) INFORMATION FOR SEQ ID NO: 835 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 64 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 



30 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 83 5 



TTTCTCAAGC 
GGTTGTGGAG 
TGAGCTTTAT 
CTTGTTTAGT 
TCCATCTACA 
TAATGTTTCT 
GAGAAATTTC 
GCATATTTAG 
TAAATGTATG 
CAGTTTCTCC 



TTTGAGTAAA 
AAAGAGGAAA 
ACTATGGTTG 
AGCTAATACA 
CTGGTATTAC 
ATGATGGCAG 
TGGTAGATTA 
GCGCCAGACT 
GGTTCCCCAG 
ACCT 



TATAGTAACT 
TGAAATGGCA 
ATGGAAAGAA 
TCAAATATGC 
ACTTTCTGAA 
ATTCAACTTC 
GCTGAAATGC 
TGCTTCATTC 
ATAGAGAAGG 



CTGATGTTAT 
GAAGTTCTTA 
GGAGTCAATT 
CTGTCGCTGC 
TACTTTAGAG 
TCGTTGGGCT 
CTGCAGATTC 
TATGAAAGAT 
TACAGTAACA 



TATTTACATT 50 

CAGAATTCCC 100 

ATGCAAAGAA 150 

TAGAGAAGCT 2 00 

ATATGGGATG 2 50 

GAAGCTCTTA 3 00 

GGGTTACCCA 3 50 

CAGGAAGAGT 4 00 

ATTGTTGGTG 4 50 

464 



4 5 2) INFORMATION FOR SEQ ID NO: 83 6 
3) 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 446 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania infantum 

(B) STRAIN: MOU 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 83 6 

467 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

GCCCTCTCCA AGTACTCCAA CTCCGATTGC GTCATCTATG TC&GCTGCGG ' 50 

CGAGCGCGGT AATGAGATGG CCGAGGTGCT CATGGAGTTC CCGACCCTGA 100 

CGACCGTGAT CGATGGCCGC GAGGAGTCGA TCATGAAGCG CACCTGCCTC 150 

GTGGCGAACA CCTCGAACAT GCCAGTCGCA GCCCGTGAGG CCTCTATTTA 2 00 

CACCGGCATC ACCCTGGCCG AGTACTACCG TGATATGGGC AAGCATATCG 250 

CCATGATGGC TGACTCGACG TCTCGCTGGG CCGAGGCGCT TCGTGAGATT 3 00 

TCGGGTCGTC TGGCGGAGAT GCCGGCGGAT GGTGGCTACC CCGCCTACQT 350 

CAGCGCTCGT CTCGCCTCCT TCTACGAGCG CGCCGGCCTC GTCACCTGCA 400 

TCGGCGGGCC GAAGCGCCAG GGCTCCGTCA CGATCGTCGG TGCCGT 446 



ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 

SEQUENCE DESCRIPTION: SEQ ID NO: 83 7 



2) INFORMATION FOR SEQ ID NO: 837 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 56 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) 



(xi) 

TTAGTCAGGC CCTCTCCAAG TACTCCAACT 
GGCTGCGGCG AGCGCGGTAA TGAGATGGCT 
AACTCTGACG ACCGTGATCG ATGGTCGCGA 
CCTGCCTCGT GGCAAACACT TCGAACATGC 
TCTATTTACA CCGGCATCAC CCTGGCCGAG 
GCATATTGCC ATGATGGCCG ACTCGACATC 
GTGAGATTTC CGGTCGTCTG GCGGAGATGC 
GCCTACCTCA GCGCTCGTCT CGCCTCCTTC 
CACCTGCATC GGCGGGCCGA AGCGCCAGGG 
CTGTGT 



CCGACTGCGT CATCTATGTC 50 

GAGGTGCTCA TGGATTTCCC 100 

GGAGTCCATC ATGAAGCGCA 150 

CAGTCGCAGC CCGCGAGGCC 200 

TACTACCGTG ATATGGGCAA 250 

TCGCTGGGCC GAGGCGCTTC 3 00 

CAGCCGATGG TGGCTACCCT 350 

TACGAGCGCG CCGGCCTCGT 4 00 

CTCCGTCACG ATCGTCGGTG 450 

456 



2) INFORMATION FOR SEQ ID NO: 838 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tarentolae 

(B) STRAIN: MOU-2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83 8 

AGGCCCTCTC CAAGTACTCC AACTCCGACT GCGTCATCTA CGTCGGCTGC 
GGCGAGCGCG GTAATGAGAT GGCCGAGGTG CTCATGGAGT TCCCGACCCT 1 
GACGACTGTG ATTGATGGCC GTGAGGAGTC GATCATGAAG CGGACCTGCC 1 

468 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TCGTGGCCAA CACCTCCAAC 
TACACTGGTA TCACCCTGGC 
CGCCATGATG GCTGACTCGA 
TTTCGGGTCG CCTGGCGGAG 
5 CTTAGTGCTC GTCTTGCCTC 
CATCGGTGGG CCGAAGCGCC 



ATGCCAGTCG CAGCCCGTGA 
CGAATACTAC CGTGATATGG 
CGTCTCGCTG GGCGGAGGCG 
ATGCCGGCTG ATGGTGGGTA 
CTTCTACGAG CGTGCCGGCC 
AGGGCTCCGT CACGATCGTC 



PCT/CA00/01150 



A6CCTCTATT 2 00 

GCAAGCATAT 2 50 

CTTCGTGAGA 3 00 

CCCCGCCTAC 350 

TCGTCACCTG 4 00 

GGTGCCGTGT 4 50 



10 2) INFORMATION FOR SEQ ID NO: 839 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 7 bases 

(B) TYPE: Nucleic acid 
15. (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei 

(B) STRAIN: EATRO 7 95 



25 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 839 



GCACTCTCGA AGTACTCGAA CAGTGACGCT GTTATTTACG TGGGTTGTGG 50 

TGAGCGTGGC AATGAGATGG CGGAGGTGCT CATGGACTTT CCCACCCTCA 100 

CCACCATTAT TGATGGACGT GAGGAGTCTA TCATGAAGCG CACATGCCTG 150 

GTGGCAAATA CTTCCAATAT GCCTGTTGCT GCTCGTGAGG CATCTATTTA 2 00 

3 0 CACTGGTATC ACCTTAGCTG AGTATTATCG GGATATGGGA AAACACATCG 2 50 

CCATGATGGC CGATTCAACC TCCCGCTGGG CTGAGGCTTT GCGTGAGATT 3 00 

TCCGGTCGTC TTGCTGAAAT GCCTGCAGAT GGAGGTTATC CCGCGTACCT 3 50 

CAGCGCCCGT TTGGCCTCCT TCTACGAGCG TGCTGGCCGT GTGACATGCA 4 00 

TCGGTGGGCC GAAGCGTGAA GGGTCTGTAA CAATTGT 43 7 

35 

2) INFORMATION FOR SEQ ID NO: 840 

4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 52 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma, cruzi 
50 (B) STRAIN: MM3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 840 

TGAAGGCTGA GCGCGAGCGC GGCATCACGA TCGACATCGC GCTCTGGAAG 50 

55 TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGACGCCC CCGGCCACCG 100 

CGACTTCATC AAGAACATGA TCACGGGCAC GTCTCAGGCG GACGCCGCCG 150 

TCCTTGTCAT TGCGTCATCG CAGGGTGAGT TTGAGGCGGG CATCTCGAAG 2 00 

GACGGCCAGA CACGCGAGCA CGCGCTGCTC GCCTTCACGC TCGGCGTGAA 2 50 

GCAGATGGTT GTGTGCTGCA ACAAGATGGA CGACAAGTCG GTGAACTTCG 3 00 

60 CCCAGGAGCG CTACGATGAG ATTGTGAAGG AGGTGTCGGC GTACCTGAAG 350 

469 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
AAGGTTGGGT ACAACGTGGA GAAGGTGCGC 
GCAGGGCGAC AACATGATTG ACAAGTCGGA 
GCCCCACGCT GCTGGAGGCA CTCGACATGC 
AGCGACAAGC CGCTGCGCCT GCCGCTGCAG 
5 TATCGGCACC GTGCCGGTCG GTCGCGTGGA 
GCGACGTGGT GACGTTTGCG CCCGCCAACG 
ATTGAGATGC ACCACGAGCA GCTGGCCGAG 
CGGCTTCAAC GTGAAGAACG TGTCCGTGAA 
TGTGCGGCAA CTCGAAGAAC GACCCCCCAA 

10 GCGCAGGTGA TCATCCTGAA CCACCCCGGC 
GCCGGTGCTC GACTGCCACA CCTGCCACAT 
TCGAGTCCAA GATCGACCGC CGCTCCGGCA 
AAGTCGATCA AGTCCGGTGA CGCCGCCATG 
GCCCATGTGC GTGGAGGTGT TCAACGACTA 

15 CC 



PCT/CAOO/01150 

TTCATCCCCA TCTCCGGCTG A 00 

AAATATGCCG TGGTACAAGG 4 50 

TGGAGCCCCC GGTGCGCCCC 500 

GACGTGTACA AGATCGGCGG 550 

GACGGGCACG ATGAAGCCCG 600 

TGACGACGGA GGTGAAGTCG 650 

GCCACGCCCG GCGACAACGT 700 

GGACATCCGC CGTGGCAACG 750 

AGGAGGCGGC CGACTTCACG 800 

CAGATCGGCA ACGGCTATGC 850 

CGCGTGCAAG TTCGCCGAGA 900 

AGGAGCTTGA GAAGAACCCC 950 

GTGCGCATGG TGCCGCAGAA 1000 

CGCTCCTCTT GGCCGCTTTG 1050 

1052 



2) INFORMATION FOR SEQ ID NO: 841 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1061 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

2 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

3 0 (A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: CGL-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 841 

3 5 TGAAGGCTGA GCGCGAGCGC GGCATCACGA TCGACATCGC GCTCTGGAAG 50 

TTCGAGTCGC CCAAGTCTGT GTTCACGATC ATCGACGCCC CCGGCCACCG 100 

CGACTTCATC AAGAACATGA TCACGGGCAC GTCTCAGGCG GATGCCGCCG 150 

TCCTTGTCAT TGCGTCATCG CAGGGTGAGT TTGAGGCGGG CATCTCGAAG 20 0 

GACGGCCAGA CACGCGAGCA CGCGCTGCTC GCCTTCACGC TCGGCGTGAA 250 

4 0 GCAGATGGTT GTGTGCTGCA ACAAGATGGA CGACAAGTCG GTGAACTTTG 300 

CCCAGGAGCG CTACGATGAG ATTGTGAAGG AGGTGTCGGC GTACCTGAAG 3 50 

AAGGTTGGGT ACAACGTGGA GAAGGTGCGC TTCATCCCCA TCTCCGGCTG 4 00 

GCAGGGCGAC AACATGATTG ACAAGTCGGA AAATATGCCG TGGTACAAGG 450 

GCCCCACGCT GCTGGAGGCA CTCGACATGC TGGAGCCCCC GGTGCGCCCC 500 

4 5 AGCGACAAGC CGCTGCGCCT GCCGCTGCAG GATGTGTACA AGATCGGCGG 550 

TATCGGCACC GTGCCGGTCG GTCGCGTGGA GACGGGCACG ATGAAGCCCG 600 

GCGACGTGGT GACGTTTGCG CCCGCCAACG TGACGACGGA GGTGAAGTCG 650 

ATTGAGATGC ACCACGAGCA GCTGGCCGAG GCCACGCCCG GCGACAACGT 700 

CGGCTTCAAC GTGAAGAACG TGTCCGTGAA GGACATCCGC CGTGGCAACG 750 

50 TGTGCGGCAA CTCGAAGAAC GATCCCCCAA AGGAGGCGGC CGACTTCACG 800 

GCGCAGGTGA TCATCCTGAA CCACCCCGGC CAGATCGGCA ACGGCTATGC 850 

GCCGGTGCTT GACTGCCACA CCTGCCACAT CGCGTGCAAG TTCGCCGAGA 900 

TCGAGTCCAA GATCGACCGC CGCTCCGGCA AGGAGCTTGA GAAGAACCCC 950 

AAGTCGATCA AGTCCGGTGA CGCCGCCATG GTGCGCATGG TGCCGCAGAA 10 00 

55 GCCCATGTGC GTGGAGGTGT TCAACGACTA CGCTCCTCTT GGCCGCTTTG 1050 

CCGTGCGTGA C 1061 



60 2) INFORMATION FOR SEQ ID NO: 842 

470 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



.(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: PCU-1 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 842 



GCTGAAGGCT GAGCGCGAGC GCGGCATCAC GATCGACATC GCGCTCTGGA 50 

AGTTCGAGTC GCCCAAGTCT GTGTTCACGA TCATCGACGC CCCCGGCCAC 10 0 

CGCGACTTCA TCAAGAACAT GATCACGGGC ACGTCTCAGG CGGACGCCGC 150 

CGTCCTTGTC ATTGCGTCAT CGCAGGGTGA GTTTGAGGCG GGCATCTCGA 20 0 

2 0 AGGACGGCCA GACACGCGAG CACGCGCTGC TCGCCTTCAC GCTCGGCGTG 250 

AAGCAGATGG TTGTGTGCTG CAACAAGATG GACGACAAGT CGGTGAACTT 3 00 

CGCCCAGGAG CGCTACGATG AGATTGTGAA GGAGGTGTCG GCGTACCTGA 350 

AGAAGGTTGG GTACAACGTG GAGAAGGTGC GCTTCATCCC CATCTCCGGC 4 00 

TGGCAGGGCG ACAACATGAT TGACAAGTCG GAAAATATGC CGTGGTACAA 4 50 

25 GGGCCCCACG CTGCTGGAGG CACTCGACAT GCTGGAGCCC CCGGTGCGCC 500 

CCAGCGACAA GCCGCTGCGC CTGCCGCTGC AGGACGTGTA CAAGATCGGC 550 

GGTATCGGCA CCGTGC CGGT CGGTCGCGTG GAGACGGGCA CGATGAAGCC 600 

CGGCGACGTG GTGACGTTTG CGCCCGCCAA CGTGACGACG GAGGTGAAGT 650 

CGATTGAGAT GCACCACGAG CAGCTGGCCG AGGCCACGCC CGGCGACAAC 70 0 

3 0 GTCGGCTTCA ACGTGAAGAA CGTGTCCGTG AAGGACATCC GCCGTGGCAA 750 

CGTGTGCGGC AACTCGAAGA ACGACCCCCC AAAGGAGGCG GCCGACTTCA 8 00 

CGGCGCAGGT GATCATCCTG AACCACCCCG GCCAGATCGG CAACGGCTAT 850 

GCGCCGGTGC TCGACTGCCA CACCTGCCAC ATCGCGTGCA AGTTCGCCGA 90 0 

GATCGAGTCC AAGATCGACC GCCGCTCCGG CAAGGAGCTT GAGAAGAACC 950 

3 5 CCAAGTCGAT CAAGTCCGGT GACGCCGCCA TGGTGCGCAT GGTGCCGCAG 100 0 

AAGCCCATGT GCGTGGAGGT GTTCAACGAC TACGCTCCTC TTGGCCGCTT 1050 

TGCCGTGCGT GA 1062 



40 



50 



2) INFORMATION FOR SEQ ID NO: 843 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1057 bases 
45 (B) TYPE : Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 

(B) STRAIN: Suarez-3 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 843 

TACATTGACA GCTGCATTGA CCAAGGTCTG TTCGATGGGT GGTCATGGCG 50 

AGTACACTCC TTATGAAGCA ATTGACCGTG CTCCTGAGGA GCGTAAACGT 100 

GGTATTACTA TAAATTCGAC ACATGTTGAA TATGAGACTA AGAACCGTCA 150 

60 TTACGGTCAC GTGGACTGTC CAGGTCACTC TGATTATGTG AAGAACATGA 200 

471 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

TATCTGGCGC TGCTCAGATG GATGGTGCCA TATTGGTTGT TTCT'FGTGTT 2^0 

GACGGTCCCA TGCCTCAGAC TAAGGAGCAC GTGTTGCTTG CTAAGCAGAT 3 00 

TGGTGTACCT CGTTTAGTTG TGTTTTTGAA CAAGCTTGAC ATGTTAGAGG 350 

ACTCTGAGCT ATTGGAGTTG GTGGAGTTAG AGGTTCGTGA GTTATTGAGT 4 00 

5 GAGTTTGGTT ACGACGGTGA CAACACGCCT ATCGTTCGTG GCAGTGCTAT 450 

AAAGGCATTG AACAGTAGTT CCGAGGCTGA CATTAAGCCA ATTCAGGATT 500 

TATTGGATGC GTGTGATGCC TTTTTACTGA CTCCAGAACG TAAGGATGAC 550 

ATGCCGCTCT TGGTTGCTAT TGACGATGTT CTTGCCATTC CTGGCAAGGG 600 

TACTGTTGTA ACCGGTAGGA TAGAGCAGGG CAAGATTCGT TGTGGTGACC 650 

10 CTATTGAGGT TTGCGCCGGT CCGAAGTCCG GCAAGAAGAC TGTGTGTGTT 700 

GGTCTTRARA TGTTCCGCAA GAGTCTCAGT RAGGGTATTG CTGGTGACCA 750 

GATTGGTGTT TTGCTCAAGG GTGTGAAGCG CGACRAGGTA GAGCGCGGGT 800 

TTGTATTGAT TCAACCCGGA AGTTACAAAT GTCACGGTGA ATTTGATGCT 850 

GACTTGTACG TGTTGACTAC GRAGGAAGGT GGGCGCAAGC ATCCGTTTGT 900 

15 GTCTAACTAC CGTCCTCAGG CGTTTATACG TACTGGAGAC GTTTGCTGCT 950 

CAGTTCATTT GGATRAGGGT GTTGAGATGG CAGCTCCTGG TGACAACGTG 100 0 

CGTTGCAAGA TCAAGTTACT TTATCCCATG CCTGTCCATG AAGGTTTACG 1050 
ATTTGCG 



20 



35 



2 ) INFORMATION FOR SEQ ID NO: 844 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 943 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 

(B) STRAIN: ATCC 50119 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 844 



1057 



CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 50 

GCGAGCACCT CCTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

40 TTCATCAACA AAGTGGATAT GACGGACGAG GACACGTGCG ACCTGGTGGA 150 

CATGGAGGTG CGCGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAGGAGA 2 00 

CACCAATCGT GCGCGGCTCG GCCCTCAAGG CCGTCGAGGG CGACGCGAAG 250 

TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AGTGGATCCC 300 

CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATCGAGCACG 350 

45 TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 4 00 

GATCAGGGCG TGCTGAAGCT CAACACAGAC GCCGAGCTGG CCGGCTTCAG 450 

CGCCAAGAAG TCGACGGTGA GGGTGACGGG CATCGAGATG TACCACAAGA 500 

CGCTGAGCGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 550 

ACCGGCGACA CAACCAGTCT GTCCAAGGAC AACGTGGAAC GCGGCATGGT 600 

50 AATGGCGGCG ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 650 

TGTACGTGCT GACGAAGGAT GAGGGCGGCC GCCACACCGG CTTCAGCCCC 700 

CACTACCGCC CGCAGCTCTT CTTCCATTGC GCTGACGTGA CAGCGGACAT 750 

GAGCTTCCCG GAGGCGGAGA AGCACCGGGA GGAGCTGAAC AAGAAATTCG 800 

GCCGCGGCCC CGAGGAGGAC AAGAAGAAAG AGGCGGAGAT GAAGGAGTTC 850 

55 GAGAGCAAGC TCGTCTGCAT GCCGGGCGAT AACCGCGAGC TGATCCTGAC 900 

GCTCGCGTAC CCGATGCCCA TTGAAAAGGG CCTGAAGTTC ACC 943 



60 2) INFORMATION FOR SEQ ID NO: 845 

472 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 9 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania amazonensis 

(B) STRAIN: ATCC 50131 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 845 



CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 50 

GCGAGCATCT CTTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

TTCATCAACA AAGTGGACAT GACGGACGAG GACACGTGCG ACCTGGTGGA 150 

CATGGAGGTG CGTGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAAGAGA 2 00 

20 CGCCCATCGT GCGCGGCTCG GCCCTCAAAG CCGTCGAGGG CGACGCGAAG 250 

TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AATGGATCCC 3 00 

CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATTGAGCACG 3 50 

TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 400 

GATCAGGGCG TGCTGAAGCT CAACACAGAC GCCGAGCTGG CCGGCTTCAG 4 50 

25 CGCCAAGAAG TCGACGGTGA GGGTGACGGG CAT CGAGATG TACCACAAGA 50 0 

CGCTGAGTGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 550 

ACCGGCGACA CGATCAGTCT CTCCAAGGAC AACGTTGAAC GCGGCATGGT 600 

AATGGCGGCA ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 650 

TGTACGTGCT GACGAAGGAT GAGGGCGGCC GCCACACTGG CTTCAGCCCC 700 

30 CACTACCGCC CGCAGCTCTT CTTCCATTGT GCTGACGTGA CGGCGGACAT 750 

GAGCTTCCCG GAGGCGGAGA AGCACCGCGA GGAGCTCAAC AAGAAATTCG 800 

GCCGCGGCCC CGAGGAGGAC AAGCAGAAGG AGGCGGAGAT GAAAGAGTTC 850 

GAGAGCAAGC TCGTCTGCAT GCCGGGCGAC AACCGCGAGC TGATCCTGAC 900 

GCTGGCGTAC CCGATGCCCA TTGACAAGGG TCTGAAGTT 93 9 

35 



2) INFORMATION FOR SEQ ID NO: 846 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 945 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



50 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 

(B) STRAIN: ATCC 50212 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 846 



55 



60 



CATTGTGGTG 
TCCTGATCTG 
AAGGTGGACA 
GCGCGAGCAG 
TGCGCGGCTC 
AACATTCTCG 
GCGCAACACA 



GCGGCCACCG 
CTCGCAGATC 
TGACGGACGA 
CTGGAGAAAT 
AGCCCTCAAA 
AACTGGTGCG 
GACAAGCCTT 



ACGGCGTCAT 
GGGCTTCCGG 
GGACACGTGC 
ACAAGTTTCC 
GCCGTCGAGG 
GAAGTGTGAC 
TCCTTATGGC 
4 



GCCGCAGACA 
CGCTCGTAGG 
GACCTGGTGG 
GGCGGAGGAG 
GCGATGCGAA 
GAGTGGATCC 
CATCGAGCAC 
73 



CGCGAGCACC 50 

GTTCATCAAC 100 

ACATGGAGCT 150 

ACGCCAATCG 2 00 

GTACGAGGAG 250 

CTGACCCGCC 300 

GTTTACGAGA 350 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

TCGGCAAGGA CAAGAAGAGC GTTGTCGTGA CCGGCCGCGT CGATckGGGC 4 0j^ 

ATTCTGAAGC TCAACACAGA CGCCGAGCTG GCCGGCTTCA GCTCCAAGAA 4 50 

GTCGACGGTG AGGGTGACGG GCATCGAGAT GTACCACAAG ACGCTGAGCG 500 

AGTGCATGCC TGGTGACTCC GTCGGCGTCA GCATTGTCGG GACCGGCGAC 550 

5 ACGACCAGTC TATCCAAGGG CAACGTGGAA CGCGGCATGG TGATGGCGGC 600 

GACGGGTAGC ACGAACCTGT ACAACAAGGT GAAGGCGCAG GTGTACGTGC 650 

TGACGAAGGA TGAGGGCGGC CGCCACACTG GCTTTAGTCC TCACTACCGC 700 

CCGCAGCTCT TCTTCCATTG TGCTGACGTG ACGGCGGACA TGAGCTTCCC 750 

GGAGGCGGAG AAGCACCGCG AAGAGCTCAA CAAGAAATTC GGCCGCGGCC 800 

10 CCGAGGAGGA CAAGAAGAAA GAGGCAGCGA TGAAGGAGTT CGAGAGCAAG 850 

CTCGTCTGCA TGCCGGGCGA TAACCGCGAG CTGATCCTGA CGCTGGCGTA 900 

CCCGATGCCC ATTGAAAAGG GTCTGAAGTT CACCATCCGT GAGGG 94 5 



15 



25 



2) INFORMATION FOR SEQ ID NO: 847 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 93 9 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania infantum 

(B) STRAIN: MOU 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 7 

GGCATCATTG TGGTGGCGGC CACCGACGGC GTCATGCCGC AGACACGCGA 50 

GCACCTCCTG ATCTGCTCGC AGATCGGGCT TCCGGCGCTC GTAGGGTTCA 100 

TCAACAAGGT GGACATGACG GACGAGGACA CGTGCGACCT GGTGGACATG 150 

3 5 GAGCTGCGCG AGCAGCTGGA GAAATACAAG TTTCCGGCGG AGGAGACGCC 2 00 

AATCGTGCGC GGCTCAGCCC TCAAAGCCGT CGAGGGCGAT GCGAAGTACG 250 

AGGAGAACAT TCTCGAACTG GTGCGGAAGT GTGACGAGTG GATCCCTGAC 3 00 

CCGCCGCGCA ACACAGACAA GCCTTTCCTT ATGGCCATCG AGCACGTTTA 3 50 

CGAGATCGGC AAGGACAAGA AGAGCGTTGT CGTGACCGGC CGCGTCGATC 4 00 

4 0 AGGGCGTTCT GAAGCTCAAC ACAGACGCCG AGCTGGCCGG CTTCAGCTCC 450 

AAGAAGTCGA CGGTGAGGGT GACGGGCATC GAGATGTACC ACAAGACGCT 500 

GAGCGAGTGC ATGCCTGGTG ACTCCGTCGG CGTCAGCATT GTCGGCACCG 550 

GCGACACGAC CAGTCTATCC AAGGGCAACG TGGAACGCGG CATGGTGATG 600 

GCGGCGACGG GTAGCACGAA CCTGTACAAC AAGGTGAAGG CGCAGGTGTA 650 

45 CGTGCTGACG AAGGATGAGG GCGGCCGCCA CACTGGCTTT AGTCCTCACT 700 

ACCGCCCGCA GCTCTTCTTC CATTGTGCTG ACGTGACGGC GGACATGAGC 750 

TTCCCGGAGG CGGAGAAGCA CCGCGAAGAG CTCAACAAGA AATTCGGCCG 8 00 

CGGCCCCGAG GAGGACAAGA AGAAAGAGGC AGCGATGAAG GAGTTCGAGA 850 

GCAAGCTCGT CTGCATGCCG GGCGATAACC GCGAGCTGAT CCTGACGCTG 900 

50 GCGTACCCGA TGCCCATTGA AAAGGGTCTG AAGTTCACC 93 9 



2 ) INFORMATION FOR SEQ ID NO: 848 

55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
60 (D) TOPOLOGY: Linear 

474 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Leishmania enriettii 

(B) STRAIN: ATCC 50120 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 8 

10 CACCGACGGC GTCATGCCGC AGACACGGGA GCACCTGCTC ATCTGCTCGC 50 

AGATCGGGCT GCCGGCGCTT GTAGGGTTCA TCAATAAAGT TGACATGACG 100 

GACGAGGATA CGTGCGACCT CGTGGACATG GAGGTGCGGG AACAGCTGGA 150 

GAAGTACAAG TTTCCGGCCG AGGAGACGCC CATCGTACGT GGCTCGGCCC 2 00 

TCAAGGCCCT CGAGGGGGAT GCGCAATACG AGGGGAGTAT TCTCGAGCTG 250 

15 GTGCGAAAGT GCGACGAGTG GATCCCCGAC CCGCCGCGCA ACACCGAAAA 300 

GCCTTTCCTC ATGGCTATCG AGCACGTTTA CGAGCTCGGC AAAGACAAGA 350 

AGAGCGTCAT CGTTACCGGC CGCGTCGATC AAGGTGTGCT GAAGCTCAAC 400 

ACAGACGCCG AGCTGGCCGG CTTCAGCGCC AAGAAGGCGA CAGTCAAAGT 450 

GACGGGCATC GAGATGTATC ACAAGACACT CAATGAGTGC ATGCCCGGCG 500 

2 0 ACTCTGTCGG TGTCAGCATC GTCGGTACCG GTGACACGAC CAGCTTATCC 550 

AAGGATAATG TTGAGCGCGG TATGGTAATG GCGGCAACGG GTAGCACGAA 600 

CCTGTACAAC AAGCTGAAGG CGCAGGTTTA CGTGCTGACA AAGGAGGAGG 650 

GTGGCCGCCA CACCGGGTTC AGCCCCCACT ACCGCCCGCA GCTCTTCTTC 700 

CACTGCGCTG ACGTGACCGC AGACATGAGC TTCCCGGAGG CGGAGAAGTA 750 

25 CCGCGAGGAG CTCAACAAGA AGTTCGGCCG TGGCCCTGAG GAGGACAAGA 800 

AGAAAGAGGC GGAGATGAAG GAGTTCGAAA GCAAACTTGT CTGCATGCCA 850 

GGCGATAACC GCGAGCTGAT CCTAACTCTG GCGTACCCGA TGCCCATCGA 900 

CAAGGGCCTG AAGTTCACCA TCCGTGAGGG CGG 933 



30 



45 



2) INFORMATION FOR SEQ ID NO: 84 9 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 943 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania gerbilli 

(B) STRAIN: ATCC 50121 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 849 



CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 50 

GCGAGCACCT CCTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

50 TTCATCAACA AAGTGGACAT GACGGACGAG GACACGTGCG ACCTGGTGGA 150 

CATGGAGGTG CGCGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAGGAGA 200 

CACCAATCGT GCGCGGCTCG GCCCTCAAGG CCGTCGAGGG CGACGCGAAG 250 

TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AGTGGATCCC 3 00 

CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATCGAGCACG 350 

55 TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 400 

GATCAGGGCG TGCTGAAGCT CAACACGGAC GCCGAGCTGG CCGGCTTCAG 450 

CGCCAAGAAG TCGACGGTGA GGGTGACGGG CATTGAGATG TACCACAAGA 500 

CGCTGAGCGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 550 

ACCGGCGACA CGACCAGTCT GTCCAAGGAC AACGTGGAAC GCGGCATGGT 600 

60 AATGGCGGCG ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 650 

475 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TGTACGTGCT GACGAAGGAT GAGGGCGGCC 
CACTACCGCC CGCAGCTCTT CTTCCATTGC 
GAGCTTCCCG GAGGCGGAGA AGCACCGCGA 
GCCGCGGCCC CGAGGAGGAC AAGAAGAAAG 
5 GAGAGCAAGC TCGTCTGCAT GCCGGGCGAT 
GCTGGCGTAC CCGATGCCCA TTGAAAAGGG 



PCT/CA00/01150 

GCCACACTGG CTTCA&CCCC 

GCTGACGTGA CAGCGGACAT 75 0 

GGAGCTCAAC AAGAAATTCG 800 

AGGCGGAGAT GAAGGAGTTC 850 

AACCGCGAGC TGATCCTGAC 900 

TCTGAAGTTC ACC 943 



10 2) INFORMATION FOR SEQ ID NO: 850 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania major 

(B) STRAIN: ATCC 50122 



2 5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 850 



GGCATCATTG TGGTGGCGGC CACCGACGGC GTCATGCCGC AGACACGCGA 50 

ACACCTCCTG ATCTGCTCGC AAATTGGCCT TCCGGCGCTC GTAGGGTTCA 100 

TCAACAAAGT GGACATGACG GACGAGGACA CGTGTGACCT GGTGGACATG 150 

GAGGTGCGCG AGCAGCTGGA GAAATACAAG TTTCCGGCGG AGGAGACACC 2 00 

30 AATCGTGCGC GGCTCGGCCC TCAAGGCCGT CGAGGGCGAC GCGAAGTACG 2 50 

AGGAGAACAT CCTCGAACTG GTGCGGAAGT GCGACGAGTG GATCCCCGAC 3 00 

CCGCCGCGCA ACACAGACAA GCCTTTCCTT ATGGCCATCG AGCACGTTTA 3 50 

CGAGATCGGC AAGGACAAGA AGAGCGTCAT CGTGACCGGC CGCGTCGATC 4 00 

AGGGCGTGCT GAAGCTCAAC ACAGACGCCG AGCTGGCCGG CTTCAGCGCC 450 

3 5 AAGAAGTCGA CGGTGAGGGT GACGGGCATT GAAATGTACC ACAAGACGCT 500 

GAGCGAGTGC ATGCCCGGTG ACTCCGTCGG CGTCAGCATT GTCGGCACCG 550 

GCGACACGAC CAGTCTGTCC AAGGACAACG TGGAGCGCGG CATGGTAATG 6 00 

GCGGCGACGG GTAGCACGAA CCTGTACAAC AAGGTGAAGG CGCAGGTGTA 650 

CGTGCTGACG AAGGATGAGG GCGGCCGCCA CACTGGCTTC AGCCCCCACT 7 00 

40 ACCGCCCGCA GCTCTTCTTC CATTGCGCTG ACGTGACAGC GGACATGAGC 750 

TTCCCGGAGG CGGAGAAGCA CCGCGAGGAG CTCAACAAGA AATTCGGCCG 800 

CGGCCCCGAG GAGGACAAGA AGAAAGAGGC GGAGATGAAG GAGTTCGAGA 850 

GCAAGCTCGT CTGCATGCCG GGCGATAACC GCGAGCTGAT CCTGACGCTG 900 

GCGTACCCGA TGCCCATT 918 

45 



2) INFORMATION FOR SEQ ID NO: 851 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 
60 (B) STRAIN: ATCC 50156 

476 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 851 

CGGTGGCATC ATTGTGGTGG CGGCCACCGA CGGCGTCATG CCGCAGACAC 50 

5 GCGAGCATCT CCTGATCTGC TCGCAGATTG GGCTTCCGGC GCTCGTAGGG 100 

TTCATCAACA AAGTGGACAT GACGGACGAG GACACGTGCG ACCTGGTGGA 150 

CATGGAGGTG CGTGAGCAGC TGGAGAAATA CAAGTTTCCG GCGGAAGAGA 200 

CGCCCATCGT GCGCGGCTCG GCCCTCAAGG CCGTCGAGGG CGACGCGAAG 2 50 

TACGAGGAGA ACATCCTCGA ACTGGTGCGG AAGTGCGACG AATGGATCCC 3 00 

10 CGACCCGCCG CGCAACACAG ACAAGCCTTT CCTTATGGCC ATTGAGCACG 350 

TTTACGAGAT CGGCAAGGAC AAGAAGAGCG TCATCGTGAC CGGCCGCGTC 4 00 

GATCAGGGCG TGCTGAAGCT CAACACAGAC GCCGAGCTGG CCGGCTTCAG 4 50 

CGTCAAGAAG TCGACGGTGA GGGTGACGGG CATCGAGATG TACCACAAGA 500 

CGCTGAGTGA GTGCATGCCC GGTGACTCCG TCGGCGTCAG CATTGTCGGC 550 

15 ACCGGCGACA CGATCAGTCT CTCCAAGGAC AACGTTGAAC GCGGCATGGT 600 

AATGGCGGCA ACGGGTAGCA CGAACCTGTA CAACAAGGTG AAGGCGCAGG 650 

TGTACGTGCT GACGAAGGAT GAGGGCGGCC GCCACACTGG CTTCAGCCCC 700 

CACTACCGCC CGCAGCTCTT CTTCCATTGT GCTGACGTGA CGGCGGACAT 750 

GAGCTTCCCG GAGGCGGAGA AGCACCGCGA GGAGCTCAAC AAGAAATTCG 800 

2 0 GCCGCGGCCC CGAGGAGGAC AAGCAGAAGG AGGCGGAGAT GAAAGAGTTC 8 50 

GAGAGCAAGC TCGTCTGCAT GCCGGGCGAC AACCGCGAGC TGATCCTGAC 900 

GCTGGCGTAC CCGATGCCCA TTGAGAAGGG TCTGAAGTT 93 9 



25 



2) INFORMATION FOR SEQ ID NO: 852 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 912 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 

(vi.) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tarentolae 

(B) STRAIN: MOU-2 

4 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 852 

TCATTGTGGT GGCCGCCACC GACGGCGTCA TGCCGCAAAC ACGGGAGCAC 50 

CTTTTGATCT GCTCGCAGAT CGGGCTGCCG GCGCTCGTAG GGTTCATCAA 100 

CAAAGTGGAC ATGACAGACG AAGACACGTG CGACCTGGTA GACCTGGAGG 150 

4 5 TGCGTGAGCA GCTGGAGAAG TACAAGTTTC CGGCAGAGGA AACACCAATC 2 00 

GTGCGTGGCT CGGCCCTCAA GGCCGTTGAG GGCGATGCAA AGTACGAGGA 2 50 

GAACATCCTC GAACTGGTGC GGAAGTGCGA CGAGTGGATC CCAGACCCGC 3 00 

CACGCAATAC GGACAAGCCT TTCCTTATGG CCATTGAACA CGTGTACGAG 3 50 

ATCGGCAAGG ATAGGAAAAG CGTCATCGTA ACCGGCCGCG TCGATCAAGG 4 00 

50 TGTGCTGAAG CTGAACACAG ACGCCGAGCT GGCCGGCTTC AGCGCCAAGA 450 

AGTCGACGGT GAAAGTGACG GGCATTGAGA TGTACCACAA GACACTGACA 500 

GAGTGCATGC CCGGCGACTC TGTCGGCGTC AGCATTGTGG GCACTGGYGA 550 

CACGACCAGC CTCTCTAAGG ACAATGTTGA GCGTGGCATG GTACTGGCCG 600 

CTACGGGTAG CACGAACCTG TACAACAAAG TAAAGGCGCA GGTGTATGTA 650 

55 CTCACGAAGG ATGAGGGCGG CCGCCACACC GGCTTCAGCC CCCACTACCG 700 

TCCGCAGCTC TTCTTCCACT GCGCTGACGT AACGGCGGAC ATGAGCTTCC 750 

CGGAGGCGGA GAAGCACCGC GAGGAACTCA ATAAGAAATT CGGCCGCGGC 8 00 

CCCGAGGAGG ACAAGAAAAA GGAGGCGGAG ATGAAGGAGT TCGAGAGCAA 850 

GCTGGTCTGC ATGCCAGGCG ATAACCGCGA GCTGATCCTG ACATTGGCGT 900 

60 ACCCGATGCC TA 912 

477 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 853 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 936 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: MM3 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 853 

ATTCTTGTGG TGGCAGCTAA CGACGGATGC ATGCCGCAGA CGCGTGAGCA 50 

CCTGCTTATT TGTTCGCAGA TTGGCCTTCC TGCTCTTGTA TGCTTTATCA 100 

ATAAGTGTGA CATGATGCAA GGGCAGGAGG AAATGATTGA ACTTGTTGAA 150 

ATGGAGGTAC GTGAACTTTT GGAGAAGTAC AAGTTCCCTG CGGAGGAGAC 2 00 

GCCATTTGTG CGGGGGTCTG CGGTGAAGGC ATTGGAGGGT GATGCTGAAA 250 

ATGAAGGAAA GATTTTGGAG CTTGTAAAAA AATGTGATGA ATGGATTCCC 3 00 

GACCCACCGC GTGCCATTGA AAAACCGTTC CTTATGGCCA TTGAGCACGT 350 

TTTTGAGGTT GGAAAGGATA AGAAGGCCGT TGTTGTGAGC GGGCGTGTGG 4 00 

ACCAGGGGCA GTTGAAGGTC GGCGCAGATG CAGAACTTTC CGGGTTTAGC 4 50 

GCAAAGAAGC TGACGGTGAA GGTTGCTAGC ATCGAAATGT ACCATAAAAT 5 00 

TCTGGAGGAT TGCATGCCTG GTGACTCTGT TGGCGCGAAG ATCGTTGGCA 550 

GCGGTGAAAC AGTGAACCTG TCGAAGGAAA ATGTGGAACG CGGCATGGTA 600 

CTCTCCGCAC CAGGTGCAAC GACACTGTTC AACAAGGTCC GCGCGCAGGT 650 

GTACGTGTTG ACAAAGGAAG AAGGCGGTCG TCACACAGCC TTTAGTCCTC 700 

ACTATCGTCC GCAGCTTTTC TTCCACTGTG CTGATGTCAC GGCAGATATT 750 

AACTTCCCGG AAAGCGAGAA GCTTGCAGGG GAGCTGAACA AAAAGTATGG 8 00 

CCGTGATGCG GCGGAACAGA AGAAGAAGGA GGCAGAACTG AAAGAGTTTG 850 

AAAAGACGCT TGTCTGCATG CCTGGTGATA ACCGCGAACT CCTGCTCACC 900 

CTTGCCTATC CAATGCCAAT GGAAAAGGGA CTCAAG 93 6 



2) INFORMATION FOR SEQ ID NO: 854 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 934 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma cruzi 

(B) STRAIN: PCU-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 854 

CGGCATTCTT GTGGTGGCAG CTAACGACGG ATGCATGCCG CAGACGCGTG 50 

AGCACCTGCT TATTTGTTCG CAGATTGGCC TTCCTGCTCT TGTATGCTTT 100 

ATCAATAAGT GTGACATGAT GCAAGGGCAG GAGGAAATGA TTGAGCTTGT 150 

478 
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TGAAATGGAG GTACGTGAAC 
AGACGCCATT TGTGCGGGGG 
GAAAATGAAG GAAAGATTTT 
TCCCGACCCA CCGCGTGCCA 
5 ACGTTTTTGA GGTTGGAAAG 
GTGGACCAGG GGCAGTTGAA 
TAGTGCAAAG AAGCTGACGG 
AAATTCTGGA GGATTGCATG 
GGCAGCGGTG AAACAGTGAA 

10 GGTACTCTCC GCACCAGGTG 
AGGTGTACGT GTTGACAAAG 
CCTCACTATC GTCCGCAGCT 
TATTAACTTC CCGGAAAGCG 
ATGGCCGTGA CGCGGCGGAA 

15 TTTGAAAAGA CGCTTGTCTG 
CACCCTTGCC TATCCAATGC 



TTTTGGAGAA GTACAAGTTC 
TCTGCGGTGA AGGCATTGGA 
GGAGCTTGTA AAAAAATGTG 
TTGAAAAACC GTTCCTTATG 
GATAAGAAGG CCGTTGTTGT 
GGTCGGCGCA GATGCAGAAC 
TGAAGGTTGC TAGCATCGAA 
CCTGGTGACT CTATTGGCGC 
CCTGTCGAAG GAAAATGTGG 
CAACGACACT GTTCAACAGG 
GAAGAAGGCG GTCGTCACAC 
TTTCTTCCAC TGTGCTGATG 
AGAAGCTTGC AGGGGAGCTG 
CAGAAGAAGA AGGAGGCAGA 
CATGCCTGGT GATAACCGCG 
CAATGGAAAA GGGA 



PCT/CA00/01150 

CCTGCGG^GG 200 

GGGTGATGCT 250 

ATGAATGGAT 300 

GCCATTGAGC 350 

GAGCGGGCGT 400 

TTTCCGGGTT 450 

ATGTACCATA 500 

GAAGATCGTT 550 

AACGCGGCAT 600 

GTCCGCGCGC 650 

AGCCTTTAGT 700 

TCACGGCGGA 750 

AACAAAAAGT 80 0 

ACTGAAAGAG 850 

AACTCCTGCT 900 

934 



20 2 ) INFORMATION FOR SEQ ID NO: 855 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 937 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma, cruzi 

(B) STRAIN: CGL-1 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 855 



TGGCGGCATT CTTGTGGTGG CAGCTAACGA CGGATGCATG CCGCAGACGC 50 

GTGAGCACCT GCTTATTTGT TCGCAGATTG GCCTTCCTGC TCTTGTATGC 10 0 

TTTATCAATA AGTGTGACAT GATGCAAGGG CAGGAGGAAA TGATTGAACT 150 

TGTTGAAATG GAGGTACGTG AACTTTTGGA GAAGTACAAG TTCCCTGCGG 2 00 

40 AGGAGACGCC ATTTGTGCGG GGGTCTGCGG TGAAGGCATT GGAGGGTGAT 250 

GCTGAAAATG AAGGAAAGAT TTTGGAGCTT GTAAAAAAAT GTGATGAATG 3 00 

GATTCCCGAC CCACCGCGTG CCATTGAAAA ACCGTTCCTT ATGGCCATTG 350 

AGCACGTTTT TGAGGTTGGA AAGGATAAGA AGGCCGTTGT TGTGAGCGGG 40 0 

CGTGTGGACC AGGGGCAGTT GAAGGTCGGC GCAGATGCAG AACTTTCCGG 450 

4 5 GTTTAGCGCA AAGAAGCTGA CGGTGAAGGT TGCTAGCATC GAAATGTACC 50 0 

ATAAAATTCT GGAGGATTGC ATGCCTGGTG ACTCTGTTGG CGCGAAGATC 550 

GTTGGCAGCG GTGAAACAGT GAACCTGTCG AAGGAAAATG TGGAACGCGG 60 0 

CATGGTACTC TCCGCACCAG GTGCAACGAC ACTGTTCAAC AAGGTCCGCG 650 

CGCAGGTGTA CGTGTTGACA AAGGAAGAAG GCGGTCGTCA CACAGCCTTT 700 

50 AGTCCTCACT ATCGTCCGCA GCTTTTCTTC CACTGTGCTG ATGTCACGGC 750 

AGATATTAAC TTCCCGGAAA GCGAGAAGCT TGCAGGGGAG CTGAACAAAA 800 

AGTATGGCCG TGATGCGGCG GAACAGAAGA AGAAGGAGGC AGAACTGAAA 850 

GAGTTTGAAA AGACGCTTGT CTGCATGCCT GGTGATAACC GCGAACTCCT 90 0 

GCTCACCCTT GCCTATCCAA TGCCAATGGA AAAGGGA 937 

55 

2) INFORMATION FOR SEQ ID NO: 856 

60 (i) SEQUENCE CHARACTERISTICS: 

479 



SUBSTITUTE SHEET (RULE 26) 
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(A) LENGTH: 88 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bigemina 
10 (B) STRAIN: Suarez-2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 856 

CTTATTATGG AGCTGATCAA CAACGTCGCG AAGAAGCACG GTGGTTTCTC 50 

15 CGTGTTCGCC GGCGTCGGCG AGCGCACCAG GGAGGGCAAC GAGCTGTACC 100 

ACGAGATGAT GGAGACCGGC GTCATCAAGC GCCGCCAGCT GGATGACGGC 150 

ACGTTCGACT TCTCCGGCTC CAAGGCCGCG CTGGTGTACG GCCAGATGAA 200 

CGAGCCGCCA GGTGCCAGGG CGCGTGTTGC CCTCACTGGC CTGACGGTGG 250 

CCGAGTACTT CCGTGATGAG GACGGCCAGG ACGTGCTGCT CTTCATCGAC 300 

20 AACATCTACC GTTTCACCCA GGCTGGTTCT GAGGTGAGTG CCCTTTTGGG 350 

GCGCATCCCG TCCGCCGTCG GTTACCAGCC GACCCTCGCC ACCGACCTTG 400 

GCGCGCTGCA GGAGCGTATC ACGACGACCA ACAAGGGCTC CATCACCTCC 450 

GTGCAGGCCG TCTACGTGCC GGCCGACGAT ATCACCGACC CGGCGCCTGC 500 

GACCACCTTC ACCCATCTGG ACGCGACCAC TGTGCTCTCC CGTTCCATCG 550 

25 CCGAGCTGGG TATCTACCCC GCCGTCGACC CGCTCGACTC CACCTCGCGT 600 

ATGCTGTCCG CGAACATCGT CGGCGAGGAG CAGTACAACG TGGCGCGTGG 650 

CGTGCAGAAA ATACTGCAGG ACTACAAATC GCTGCAGGAT ATCATCGCCA 700 

TCCTGGGTAT GGACGAGCTG TCTGAGCAGG ACAAGTTCGT CGTCGCGCGT 750 

GCGCGCAAGG TTCAGCGTTT CCTATCCCAG CCCTTCCAGG TGGCTGAGGT 800 

30 ATTCACCGGC AAGCCCGGAC GTTTCGTCGA GCTGCAGGAC ACCATCAGCG 850 

GCGTCAAGGA GATTTTGGAC GGCGAGTGCG ACGACATG 888 



35 2) INFORMATION FOR SEQ ID NO: 857 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 884 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia bovis 

(B) STRAIN: Suarez-3 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 857 



TGATTATGGA ATTGATCAAC AATGTCGCCA AGAAACACGG TGGGTTCTCC 50 

GTGTTCGCTG GTGTTGGTGA ACGTACGAGG GAAGGTAACG AACTGTACCA 100 

TGAAATGATG GAAACGGGTG TCATCAAGCG CCGTCAACTG GAAGACGGAA 150 

CATTTGACTT CTCGGGCTCT AAAGCTGCTT TGGTGTACGG ACAAATGAAC 200 

55 GAACCACCAG GTGCTAGAGC CCGTGTTGCA CTCACGGGAT TGACCGTTGC 250 

CGAGTATTTC CGTGATGAAG AGGGGCAGGA TGTGCTACTC TTCATCGATA 300 

ACATCTACCG TTTCACCCAG GCCGGTTCCG AAGTGAGTGC GCTGTTAGGA 350 

AGAATTCCAT CCGCCGTGGG TTATCAACCT ACATTGGCCA CTGATCTCGG 400 

AGCACTCCAG GAACGCATTA CTACAACCAA CAAGGGTTCG ATTACATCAG 450 

60 TCCAGGCAGT ATACGTCCCA GCCGATGATA TCACTGATCC CGCTCCAGCT 500 

480 
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ACCACTTTCT CGCACTTGGA 
GGAGTTGGGT ATTTACCCTG 
TGCTGTCGGC CAACATTGTA 
GTACAGAAAA TTTTACAGGA 
TCTGGGTATG GACGAGCTGT 
CCCGTAAGGT ACAGCGTTTC 
TTCACCGGCA AGCCTGGGAG 
TGTCAAGGAA ATCTGGAAGG 



TGCCACTACA 
CGGTCGACCC 
GGACAGGAAC 
CTACAAATCA 
CTGAGCAGGA 
CTGTCTCAGC 
GTTCGTTGAA 
TGAGTGTGAC 



GTGCTTTCTC GTTCAATTGC 

GCTTGACTCA ACGTCACGTA 

AGTACGATGC CGCACGTGGT 

CTGCAGGATA TCATTGCCAT 

CAAGTTCGTT GTAGCACGCG 

CGTTCCAAGT GGCTGAGGTG 

CTACAGGATA C CAT C AG CGG 
GATA 



550 
600 
650 
700 
750 
800 
850 
884 



2 ) INFORMATION FOR SEQ ID NO: 858 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia microtti 

(B) STRAIN: Persing-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 858 

TGGAACTGAT TAATAATGTG GCCAAAAAGC ATGGCGGTTA CTCTGTTTTT 50 

GCAGGTGTAG GTGAAAGGAC GAGGGAGGGT AATGAATTGT ACCATGAAAT 100 

GATGGAGACA GGTGTTATAA AGAAAAAGGC ACTAGGTGGT GGGAAGTTTG 15 0 

ATTTCAGTGG ATCTAAAGCA GCGCTGGTCT ATGGACAAAT GAACGAGCCA 2 00 

CCTGGGGCCC GTGCTAGAGT GGCACTAACT GGATTAACAG TCGCAGAATA 250 

TTTCCGTGAC GAACAAGGAC AAGACGTGTT GTTGTTTATT GATAATATTT 300 

ACCGATTTAC TCAGGCAGGG TCTGAGGTTT CAGCCTTGCT AGGCCGTATA 3 50 

CCTTCAGCTG TGGGATACCA GCCTACATTG GCAACAGATC TTGGCTGTTT 4 00 

ACAAGAACGA ATTACTACGA CCAAATCTGG TTCAATCACC AGTGTACAAG 4 50 

CTGTGTATGT GCCAGCAGAT GATATTACTG ATCCAGCGCC TGCCACAACT 500 

TTTACTCACT TGGACGCTAC TACTGTACTT AGCAGGCCAA TTGCTGAACT 550 

CGGTATTTAT CCAGCGGTAG ACCCGTTGGA TTCAACAAGC CGTATGCTAA 600 

GCGCGAACAT TGTGGGAAAT GAACACTATA GTGTAGCCCG TTCCGTGCAG 650 

AAGATACTGC AAGATTACAA ATCGCTTCAG GACATTATTG CCATTTTGGG 70 0 

TATGGATGAA CTGTCGGAAC AAGACAAAAA TATAGTAGCC CGAGCAAGGA 750 

AGATGCAAAG GTTCTTATCA CAGCCATTCC AAGTGGCGGA AGTTTTTACT 800 

GGTAAACCGG GAAGATTTGT GGAATTGGAA GATACAATTG CCGGGGCACG 850 

AGATATAATT GCGGGTAATT G 871 



2) INFORMATION FOR SEQ ID NO: 859 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1255 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania guyanensis 

481 
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WO 01/23604 

(B) STRAIN: ATCC 50126 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 859 

5 TGACGGCGCT GGACGTGACG GAGGACCTCG GCCGCGATGA GCCGCTGACG 50 

CTGGAGATCG TGCAGCACTT GGATGCGAAC ACCGGCCGCT GCATTGCGAT 100 

GCAGACGACG GACCTGCTGA AGCTGAAGTC GAAGGTTGTG TCGACCGGCG 150 

GCAACATCTC CGTGCCGGTG GGCCGCGAGA CACTGGGCCG CATCTTCAAC 2 00 

GTGCTGGGCG ACGCGATTGA CCACCGCGGC CCCGTGTGCG AGAAGATGCG 2 50 

10 CATGGCGATC CACGCCGAGG CGCCGAAGCT GGCGGACCAG GCTGCGGAGG 3 00 

ACACGATCCT GACGACCGGC ATCAAGGTGA TCGACCTGAT TCTGCCCTAC 350 

TGCAAGGGCG GCAAGATCGG CCTGTTCGGC GGTGCCGGTG TGGGCAAGAC 4 00 

TGTGATCATC ATGGAGCTGA TCAACAACGT CGCGAAGGGG CACGGCGGCT 4 50 

TCTCCGTGTT CGCCGGCGTT GGCGAGCGCA CGCGCGAGGG CACGGACCTG 500 

15 TACCTGGAGA TGATGCAGTC AAAGGTGATT GACCTGAAGG GCGAGTCGAA 550 

GTGCGTGCTT GTGTACGGGC AGATGAACGA GCCCCCGGGT GCGCGCGCGC 600 

GCGTTGCGCA GTCTGCGCTG ACGATGGCCG AGTACTTCCG CGACGTGGAG 650 

GGCCAGAACG TGCTGCTGTT CATCGACAAC ATCTTCCGCT TCACGCAGGC 700 

GAACTCCGAG GTGTCTGCGC TGCTGGGCCG CATCCCGGCC GCCGTGGGTT 750 

20 ACCAGCCGAC GCTTGCGGAG GATCTTGGCA TGCTGCAGGA GCGCATTACG 800 

TCGACGACGA AGGGATCGAT TACGTCTGTG CAGGCTGTGT ACGTGCCTGC 850 

GGATGATATC ACGGACCCCG CGCCCGCGAC GACGTTCTCG CACCTGGACG 900 

CGACGACTGT GCTGGACCGC GCGGTGGCGG AGTCGGGCAT CTACCCTGCC 950 

GTGAACCCGC TGGAGTGCGC GTCGCGCATC ATGGACCCCG ATGTGATCGA 1000 

2 5 CGTGGACCAT TACAACGTTG CACAGGATAT CGTCCAGATG CTGACCAAGT 1050 

ACAAGGAGCT GCAGGACATC ATTGCGGTGC TTGGCATCGA CGAGCTGAGC 1100 

GAGGAGGACA AGGTCGTGGT GGACCGCGCG CGCAAGGTGA CGCGGTTCCT 1150 

GTCGCAGCCG TTCCAGGTTG CGGAGGTGTT CACCGGCATG ACGGGCCACT 12 00 

ACGTGCAGCT GAGCGACACG GTGGAGTCGT TCTCTGGCCT GCTGATGGGG 12 50 

30 TCGTA 1255 



2) INFORMATION FOR SEQ ID NO: 860 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Leishmeniisi mexicana 

(B) STRAIN: ATCC 50156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 860 

50 CTCGGAGGGC GTCCCGCCCG TGCTGACGGC GCTGGATGTG ACGGAGGACC 50 

TTGGCCGCGA TGAGCCGCTG ACGCTGGAGA TCGTGCAGCA CCTGGACGCG 100 

AACACCGGCC GCTGCATTGC GATGCAGACG ACGGACCTGC TGAAGCTGAA 150 

GTCGAAGGTT GTGTCGACCG GCGGCAACAT CTCTGTGCCG GTGGGCCGTG 2 00 

AGACGCTGGG CCGCATCTTC AACGTGCTGG GCGACGCGAT CGACCAGCGC 2 50 

55 GGCCCCGTGG GTGAGAAGAT GCGCATGGCG ATCCACGCCG AGGCCCCGAA 3 00 

GCTGGCGGAT CAGGCCGCGG AGGACACGAT CCTGACGACC GGCATCAAGG 3 50 

TGATCGACCT GATTCTGCCC TACTGCAAGG GTGGCAAGAT CGGCCTGTTT 4 00 

GGCGGCGCCG GTGTGGGCAA GACCGTGATC ATCATGGAGC TGATTAACAA 450 

CGTCGCGAAG GGCCACGGTG GTTTCTCGGT GTTTGCCGGC GTTGGCGAGC 500 

60 GCACGCGCGA GGGCACGGAC CTGTACCTGG AGATGATGCA GTCGAAGGTG 550 

482 
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ATTGACCTGA AGGGCGAGTC GAAGTGCGTG 

CGAGCCCCCG GGTGCGCGCG CGCGCGTTGC 

CGGAGTACTT CCGAGACGTG GAGGGCCAGA 

AACATCTTCC GCTTCACGCA GGCGAACTCC 

5 CCGCATTCCG GCCGCCGTGG GCTACCAGCC 
GTATGCTGCA GGAGCGCATC ACGTCGACGA 

GTGCAGGCCG TGTACGTGCC TGCGGATGAT 

GACGACGTTC TCGCACCTGG ACGCGACGAC 

CGGAGTCGGG GATCTACCCT GCCGTGAACC 

10 ATCATGGACC CCGATGTGAT CGACGTGGAC 

TATCGTGCAG ATGCTGACCA AGTACAAGGA 

TGCTTGGTAT CGACGAGCTG AGCGAGGAGG 

GCGCGCAAGG TGACCCGGTT CCTGTCGCAG 

GTTCACGGGC ATGACGGGCC AC 

15 
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CTTGTGTACG GGCAGAT&AA " 600 

GCAGTCTGCG CTGACGATGG 650 

ATGTGCTGCT GTTCATCGAC 700 

GAGGTCTCTG CGCTGCTGGG 750 

GACGCTTGCG GAGGATCTTG 8 00 

CGAAGGGGTC GATCACGTCC 850 

ATCACGGATC CGGCGCCCGC 900 

TGTGCTGGAC CGCGCGGTGG 950 

CGCTGGAGTG CGCGTCGCGT 1000 

CACTACAACG TTGCGCAGGA 1050 

GCTGCAGGAT ATCATTGCGG 110 0 

ACAAGGTCGT GGTGGACCGC 1150 

CCGTTCCAGG TTGCGGAGGT 1200 

1222 



2) INFORMATION FOR SEQ ID NO: 861 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1246 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



60 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 
30 (B) STRAIN: ATCC 50129 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 861 

CCGTGCTGAC GGCGCTGGAT GTGACGGAGG ACCTTGGCCG CGATGAGCCG 50 

35 CTGACGCTGG AGATCGTGCA GCACTTGGAC GCGAAC AC CG GCCGCTGCAT 100 

TGCGATGCAG ACGACGGACC TGCTGAAGCT GAAGTCGAAG GTTGTGTCGA 150 

CCGGCGGCAA CATCTCTGTG CCGGTGGGCC GTGAGACGCT GGGCCGCATC 2 00 

TTCAACGTTC TGGGCGACGC GATCGACCAG CGCGGCCCCG TGGGCGAGAA 2 50 

GATGCGCATG GCGATCCACG CCGAGGCCCC GAAGCTGGCG GATCAGGCCG 3 00 

40 CGGAGGACAC GATCCTGACG ACCGGCATCA AGGTGATCGA CCTGATTCTG 3 50 

CCCTACTGCA AGGGTGGCAA GATCGGCCTG TTCGGCGGTG CCGGTGTGGG 400 

CAAGACTGTG ATCATCATGG AGCTGATCAA CAACGTCGCG AAGGGCCACG 45 0 

GTGGTTTCTC CGTGTTTGCC GGCGTTGGCG AGCGCACGCG CGAGGGCACG 500 

GACCTGTACC TGGAGATGAT GCAGTCGAAG GTGATTGACC TGAAGGGCGA 550 

45 GTCGAAGTGC GTGCTTGTGT ACGGGCAGAT GAACGAGCCC CCGGGTGCGC 600 

GCGCGCGCGT TGCGCAGTCT GCGCTGACGA TGGCGGAGTA CTTCCGCGAC 650 

GTGGAGGGCC AGAACGTGCT GCTGTTCATC GACAACATCT TCCGCTTCAC 700 

GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT GGGCCGCATT CCGGCCGCCG 750 

TGGGCTACCA GCCGACGCTT GCGGAGGATC TTGGTATGCT GCAGGAGCGC 8 00 

50 ATCACGTCGA CAACGAAGGG GTCGATCACG TCCGTGCAGG CCGTGTACGT 850 

GCCAGCGGAT GATATCACGG ATCCCGCGCC CGCGACGACG TTCTCGCACC 900 

TGGACGCGAC GACTGTGCTG GACCGCGCGG TGGCGGAGTC GGGCATCTAC 950 

CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG CGTATCATGG ACCCCGATGT 1000 

GATCGATGTG GACCACTACA ACGTTGCGCA GGATATCGTG CAGATGCTGA 1050 

55 CCAAGTACAA GGAGCTGCAG GATATCATTG CGGTGCTTGG CATCGACGAG 1100 

CTGAGCGAGG AAGACAAGGT TGTTGTGGAC CGCGCGCGCA AGGTGACCCG 1150 

GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA GGTGTTCACG GGCATGACGG 12 00 

GCCACTACGT GCAGCTGGTC GACACGGTGG AGTCGTTCTC TGGCCT 124 6 
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2) INFORMATION FOR SEQ ID NO: 862 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 65 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 30815 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 862 

GGCGTGCCGC CCGTGCTGAC GGCGCTGGAT GTGACGGAGG ACCTTGGCCG 50 

CGATGAGCCG CTGACGCTGG AGATCGTGCA GCACTTGGAC GCGAACACGG 100 

GCCGCTGCAT TGCGATGCAG ACGACGGACC TGCTGAAGCT GAAGTCGAAG 150 

GTCGTGTCGA CCGGCGGCAA CATCTCTGTG CCGGTGGGCC GTGAGACGCT 2 00 

GGGCCGCATC TTCAAYGTTC TGGGCGACGC GATCGACCAG CGCGGCCCCG 2 50 

TGGGCGAGAA GATGCGCATG GCGATCCACG CCGAGGCCCC GAAGCTGGCG 300 

GATCAGGCCG CGGAGGACAC GATCCTGACG ACCGGCATCA AGGTGATCGA 350 

CCTGATTCTG CCCTACTGCA AGGGTGGCAA GATCGGCCTG TTCGGCGGTG 4 00 

CCGGTGTGGG CAAGACTGTG ATCATCATGG AGCTGATCAA CAACGTCGCG 450 

AAGGGCCACG GCGGTTTCTC CGTGTTTGCC GGCGTTGGCG AGCGCACGCG 500 

CGAGGGCACG GACCTGTACC TGGAGATGAT GCAGTCGAAG GTGATTGACC 550 

TGAAGGGCGA GTCGAAGTGY GTGCTTGTGT ATGGGCAGAT GAACGAGCCC 600 

CCGGGTGCGC GCGCGCGCGT TGCGCAGTCT GCGCTGACGA TGGCGGAGTA 650 

CTTCCGCGAC GTGGAGGGCC AGAACGTGCT GCTGTTCATC GACAACATCT 700 

TCCGCTTCAC GCAGGCGAAC TCCGAGGTGT CTGCGCTGCT GGGCCGCATT 750 

CCGGCCGCCG TGGGCTACCA GCCGACGCTT GCGGAGGATC TTGGTATGCT 80 0 

GCAGGAGCGC ATCACGTCGA CAACGAAGGG GTCGATCACG TCCGTGCAGG 850 

CCGTGTACGT GCCAGCGGAT GATATCACGG ATCCCGCGCC CGCGACGACG 900 

TTCTCGCACC TGGACGCGAC GACTGTGCTG GACCGCGCGG TGGCGGAGTC 950 

GGGCATCTAC CCTGCCGTGA ACCCGCTGGA GTGCGCGTCG CGTATCATGG 1000 

ACCCTGATGT GATCGATGTG GACCACTACA ACGTTGCGCA GGATATCGTG 1050 

CAGATGCTGA CCAAGTACAA GGAGCTGCAG GATATCATTG CGGTGCTTGG 1100 

CATCGACGAG CTGAGCGAGG AAGACAAGGT TGTTGTGGAC CGCGCGCGCA 1150 

AGGTGACCCG GTTCCTGTCG CAGCCGTTCC AGGTTGCGGA GGTGTTCACG 12 0 0 

GGCATGACGG GCCACTACGT GCAGCTGGTC GACACGGTGG AGTCGTTCTC 1250 

TGGCCTGCTG ATGGG 12 65 



2) INFORMATION FOR SEQ ID NO: 863 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1191 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bordetella pertussis 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 863 

484 
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ATGGCAAAAG 
GATTGGTCAC 
CGGTGCTGTC 
5 GACGCGGCGC 
CGTTGAGTAC 
GTCACGCTGA 
GGCGCGATCC 
CGAGCACATT 

10 TCCTGAACAA 
GAGATGGAAG 
CACGCCGATC 
GCGACCTGGG 
TACATTCCGA 

15 GGAAGACGTG 
TCGAGCGCGG 
AAGCCGACGG 
GCTGGACCAG 
CCAAGCGTGA 

2 0 ATCAACCCGC 
AGAGGGTGGC 
ACTTCCGCAC 
GAAATGGTGC 
CCCGATCGCC 

2 5 GTACCGTCGG 



GCAAGTTTGA 
GTTGACCACG 
GAACAAGTTC 
CGGAAGAGAA 
GAGACGGAGA 
CTACGTGAAG 
TGGTGGTGTC 
TTGCTGTCGC 
GGCGGACATG 
TCCGCGAACT 
GTGAATGGTG 
CGAGCAGGCG 
CGCCGGAGCG 
TTCTCGATCT 
CGTGGTGAAG 
TGAAGACGAC 
GGCCAGGCGG 
AGACGTCGAG 
ACACGGACTT 
CGTCACACGC 
GACGGACGTG 
TGCCGGGCGA 
ATGGAAGAAG 
TGCCGGCGTC 



ACGTACCAAG 
GCAAAACGAC 
GGCGGCGAGG 
GGCGCGTGGG 
CGCGTCACTA 
AACATGATCA 
GGCCGCAGAC 
GCCAGGTTGG 
GTTGATGACG 
GCTGAGCAAG 
CGGCCAAGCT 
ATTCTGTCGC 
CGCGGTCGAC 
CGGGCCGTGG 
GTTGGCGAGG 
CTGCACGGGC 
GCGACAACGT 
CGTGGCCAGG 
CACGGCCGAG 
CGTTCTTCAA 
ACCGGCACGA 
CAACGTGTCG 
GTCTGCGTTT 
GTCGCCAAGA 



CCGCACGTGA 
GTTGACGGCG 
CTCGCGGCTA 
ATCACGATCA 
CGCGCACGTT 
CGGGTGCTGC 
GGCCCGATGC 
CGTGCCGTAC 
CGGAGCTGCT 
TACGATTTCC 
GGCGCTGGAA 
TGGCGCAAGC 
GGTGCGTTCC 
CACGGTGGTG 
AAATCGAAAT 
GTGGAGATGT 
GGGTATCTTG 
TGCTGGCCAA 
GTGTACATTC 
CGGCTATCGT 
TCGACCTGCC 
ATGACCGTCA 
CGCCATCCGT 
TCATCAAGTA 



ACGTGGGTAC 50 

GCGATCACGA 100 

CGACCAGATT 150 

ACACCTCGCA 2 00 

GATTGCCCGG 2 50 

GCAGATGGAC 300 

CGCAGACGCG 350 

ATCATCGTGT 4 00 

CGAGCTGGTG 450 

CGGGCGATGA 50 0 

AGCGACAACG 550 

GCTGGACACG 600 

TGATGCCGGT 650 

ACTGGCCGTA 700 

CGTGGGCATC 750 

TCCGCAAGCT 800 

CTGCGCGGCA 850 

GCCGGGTTCG 900 

TGTCCAAGGA 950 

CCGCAGTTCT 1000 

GGCGGACAAG 105 0 

AGCTGCTGGC 110 0 

GAAGGCGGTC 115 0 

A 1191 



2) INFORMATION FOR SEQ ID NO: 864 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

3 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

4 0 (A) ORGANISM: Trypanosoma brucei 

(B) STRAIN: LVH/75/USAMRU-K/18 

(C) ACCESSION NUMBER: extracted from U10562 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 864 



ATGGGAAAGG AAAAGGTGCA CATGAATCTT GTGGTGGTGG GCCACGTCGA 50 

TGCCGGTAAA TCCACTGCAA CGGGTCACTT GATCTACAAG TGCGGTGGTA 10 0 

TTGACAAACG TACGATCGAG AAGTTCGAGA AAGAAGCTGC CGACATTGGT 150 

AAGGCCTCAT TCAAGTACGC ATGGGTGCTG GACAAGCTGA AGGCTGAGCG 2 00 

50 CGAACGTGGT ATCACGATCG ACATTGCACT GTGGAAATTC GAGTCACCCA 250 

AGTCTGTCTT CACTATTATT GATGCTCCTG GGCACCGTGA CTTCATCAAG 300 

AACATGATCA CCGGCACATC GCAAGCCGAC GCAGCCATCC TCATCATTGC 350 

CTCTGCGCAG GGTGAGTTCG AGGCTGGTAT CTCCAAGGAT GGACAGACCC 400 

GCGAGCACGC GTTGCTGGCC TTCACTTTGG GTGTGAAGCA GATGGTTGTG 450 

55 TGCTGCAACA AAATGGACGA CAAGACTGTG AACTACGGAC AGGAGCGGTA 500 

TGACGAGATT GTGAAGGAGG TGTCTGCTTA CATCAAGAAG GTTGGGTACA 550 

ACGTGGAGAA GGTGCGCTTC GTCCCCATCT CCGGATGGCA GGGCGACAAC 600 

ATGATTGAGA AATCCGAGAA GATGCCATGG TACAAGGGTC CAACGCTCCT 650 

GGAGGCACTA GACATGCTGG AGCCACCAGT GCGTCCGAGC GACAAGCCCC 700 

60 TGCGTCTGCC ACTGCAGACG TGTACAAAGA TCGGTGGTAT TGGCACCGTG 750 

485 
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CCCGTTGGTC 
GTTTGCCCCC 
ACGAGCAGCT 
AAGAACGTTT 
CAAGAACGAC 
TCCTGAACCA 
TGCCACACAT 
CGACCGTCGC 
CTGGCGACGC 
GAGGTCTTCA 
GCGCCAGACC 
GTTCTGGTGG 



GTGTGGAGAC 
GCCAACGTGA 
CGCTGAGGCG 
CTGTAAAGGA 
CCCCCAAAGG 
CCCCGGACAG 
CGCACATTGC 
TCTGGCAAGG 
CGCGATCGTG 
ACGACTACGC 
GTCGCTGTCG 
TAAGGTGACG 



CGGCGTGATG 
CGACCGAGGT 
ACCCCCGGTG 
CATCCGCCGT 
AGGCCGCCGA 
ATTGGAAACG 
CTGCAAGTTC 
AGCTGGAGAA 
CGCATGGTGC 
GCCACTCGGC 
GTATCATCAA 
AAGGCTGCGG 



AAGCCTGGTG 
GAAATCGATC 
ACAACGTCGG 
GGCAACGTCT 
CTTCACGGCA 
GTTATGCGCC 
GCGGAGATCG 
GGCTCCCAAG 
CGCAGAAGCC 
CGCTTTGCCG 
GGCCGTGACC 
TGAAGGCTTC 



PCT/CA00/01150 

ATGTGGTGAC 800, 

GAGATGC AC C 8 50 

CTTTAACGTG 900 

GCGGTAACAC 950 

CAGGTGATCA 1000 

CGTGCTGGAC 1050 

AGTCGAAGAT 1100 

TCGATCAAGT 1150 

TATGTGCGTG 12 00 

TGCGTGACAT 12 50 

AAGAAGGACG 13 00 

GAAGAAATAA 13 50 



15 



2) INFORMATION FOR SEQ ID NO: 865 



20 



25 



30 



35 



40 



45 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1052 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptosporidium parvum 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 865 



AAGCTCAAGG 
GCAATTCGAA 
ACAGAGATTT 
GCTTTATTGG 
AGGTCAAACC 
AAATGATTGT 
CGTTTTGATG 
TTACAACACC 
ATAATATGGT 
TTAGTCGAAG 
GCCACTCCGT 
CTGTCCCAGT 
GTTACCTTCG 
GCACCATGAG 
ATGTTAAGAA 
GATGCCAAGA 
TATCGTCCTC 
TTGACTGTCA 
AAGATGGACA 
CAAGTCTGGT 
GTGTTGAGGC 
GA 



CTGAGAGAGA 
ACCCCAAAAT 
CATCAAGAAT 
TTGTCCCAGC 
AGAGAACATG 
CGGTATTAAC 
AAATCTTCAA 
GAGAAGATCC 
TGAGAGATCT 
CCCTCGACAC 
CTCCCATTAC 
CGGTCGTGTT 
CTCCAGCTGG 
CAGATGCCAG 
CGTCTCCATC 
ATGACCCAGC 
AACCACCCAG 
CACCGCTCAC 
AGAGATCTGG 
GATGCTGCTT 
CTTCACTGAC 



AAGAGGTATT 
ACCACTACAC 
ATGATTACTG 
CGATCGTTTC 
CTTTATTGGC 
AAGATGGATA 
CGAAGTTGAT 
CATTCGTTGC 
GACAAGATGC 
TATGGAACCA 
AAGATGTTTA 
GAGACTGGTA 
TGTTACCACT 
AGGCCGTCCC 
AAGG AT AT C A 
TAAGGGCTGT 
GTGAAATCAA 
ATTTCCTGCA 
TAAGGTTTTG 
TGGTTGTTAT 
TACCCACCTC 



ACCATCGATA 
TGTCATTGAT 
GTACCTCTCA 
GAAGGTGCCT 
CTTCACTTTG 
CCTGTGAATA 
GGTTACCTCA 
CATTTCTGGT 
CATGGTATAA 
CCAAAGAGAC 
CAAGATAGGT 
TCATCAGACC 
GAAGTTAAGT 
AGGTGACAAC 
AGAGAGGTTT 
GAAGACTTCA 
GAACGGTTAC 
AATTCCAGAC 
GAAGAAAACC 
GCAACCTTTG 
TAGGTCGTTT 



TTGCTTTATG 50 

GCCCCAGGTC 100 

AGCTGATGTT 150 

TCTCCAAGGA 2 00 

GGTGTCAGAC 2 50 

CAAGCAATCT 3 00 

AGAAGGTTGG 3 50 

TTCGTTGGTG 4 00 

GGGTAAGACC 4 50 

CAACTGACAA 5 00 

GGTGTAGGTA 550 

AGGTATGAAT 600 

CAGTAGAAAT 650 

GTTGGTTTCA 700 

CGTTGCTTCT 750 

CTGCTCAAGT 8 00 

TCTCCAGTCG 850 

TATCACTGCT 900 

CAAAGCTTAT 950 

AAGCCACTTT 10 00 

CGCTGTCCGT 1050 
1052 



55 



60 



2) INFORMATION FOR SEQ ID NO: 866 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 7 bases 

(B) TYPE: Nucleic acid 

486 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 35552 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 866 

CAATGAAGTT CCAGAAATTA ACAATGCCTT AGTCGTAGAC GTTGAAAGAG 50 

ATGAAGGTAC AGTATCTCTT ACATTAGAAG TGGCATTACA ACTTGGCGAT 100 

GATGTCGTAC GTACAATTGC AATGGATTCT ACTGATGGTG TTAAACGTGG 150 

15 TACAGAAGTT CGAGATAGCG GAGATAGCAT CAGTGTTCCA GTTGGTGATG 2 00 

CTACGTTAGG ACGTGTGTTT AATGTTCTTG GTGATACAAT TGACTTAGAC 2 50 

GAGAAGCTTG ATACTTCTGT CAAACGTGAT CCAATTCATA GAGAAGCAC C 3 00 

TGCATTCGAT CAATTATCAA CAAAAGTTGA AATCTTAGAA ACAGGTATTA 350 

AAGTAATTGA TTTACTTGCA CCATATATTA AAGGTGGTAA AATCGGTTTA 4 00 

2 0 TTCGGTGGCG CTGGTGTAGG TAAAACAGTA TTAATTCAAG AATTAATTAA 450 

TAATATAGCT CAAGAACATG GTGGTATTTC AGTATTTGCC GGCGTAGGTG 500 

AACGTACGCG TGAAGGTAAT GACTTATACT ACGAAATGAG TGATAGTGGT 550 

GTTATTAAGA AAACAGCTAT GGTCTTCGGA CAAATGAATG AGCCACCTGG 600 

TGCGCGTATG CGTGTTGCTT TATCAGGCTT AACAATGGCT GAACACTTCC 650 

2 5 GTGATGTACA AGGACAAGAT GTTTTACTAT TTATTGATAA CATATTCAGA 7 00 

TTTACGCAAG CTGGTTCAGA AGTATCAGCA CTATTAGGTC GTATGCCATC 750 

AGCCGTTGGT TATCAACCTA CCCTTGCTAC TGAAATGGGT CAATTACAAG 800 

AACGTATTAC ATCAACAACT AAAGGATCTG TAACGTC 837 



30 



45 



2) INFORMATION FOR SEQ ID NO: 867 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zoogloea ramigera 

(B) STRAIN: ATCC 25935 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 867 



AAGGTATTCG ATGCCTTGAA AATGGAAGGC TCCGAGCTGA CCCTGGAAGT 50 

ACAACAGCAG CTGGGCGACG GCATTGTCCG TACCATTGCA CTGGGTACCT 100 

50 CCGACGGCCT GCGTCGCGGC ATGATGATCC AGAACACCGG CAAACCTATC 150 

ATGGTGCCAG TCGGTAAAGC AACCCTGGGT CGCATCATGG ACGTGCTGGG 2 00 

TAACCCGATC GACGAATGCG GCGCGGTCGC TCACGACCAG ATCGCTTCGA 2 50 

TCCACCGCGC TCCTCCTGCG TACGACGAAC TGTCGCCATC GCAAGATCTG 3 00 

CTGGAAACCG GCATTAAAGT TATTGACCTG GTGTGCCCGT TCGCCAAGGG 3 50 

55 CGGTAAAGTC GGTCTGTTCG GCGGTGCAGG TGTGGGCAAG ACCGTGAACA 4 00 

TGATGGAACT GATCAACAAC ATCGCCAAAG CACACTCGGG TCTGTCCGTG 4 50 

TTTGCCGGTG TGGGTGAGCG TACCCGTGAA GGTAACGACT TCTACCACGA 500 

GATGGCTGAC GCCAAAGTGG TCGATCTGGA AAATCCAGAG AACTCCAAGG 550 

TTGCGATGGT CTACGGTCAG ATGAATGAAC CACCAGGCAA CCGTCTGCGC 600 

60 GTGGCGCTGA CCGGTCTGAC CATGGCTGAA GCATTCCGTG ACGAAGGCAA 650 
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AGACGTTCTG TTCTTCGTGG 
CCGAAGTATC GGCACTGCTG 
CCTACGCTGG CCGAAGAAAT 
CAAGACCGGT TCGATCAC 



ACAACATCTA CCTgCTTCACC 
GGCCGTATGC CATCGGCTGT 
GGGTCGCCTG CAAGAGCGCA 



PCT/CAOO/01150 

CTGGCCG(?£A~ * 700~ 
GGGTTACCAG 750 
TCACTTCGAC 800 

818 



2) INFORMATION FOR SEQ ID NO: 868 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 43867 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 868 

CTATCTTAGT AGTATCTGCT GCTGATGGCC CAATGCCACA AACTCGTGAA 50 

CACATTCTTT TATCACGTAA CGTTGGTGTT CCAGCATTAG TTGTATTCTT 10 0 

AAACAAAGTT GACATGGTTG ACGATGAAGA ATTATTAGAA TTAGTAGAAA 150 

TGGAAGTTCG TGACTTATTA AGCGAATATG ACTTCCCAGG TGACGATGTA 200 

CCTGTAATCT CTGGTTCTGC ATTAAAAGCT TTAGAAGGCG ACGCTGACTA 250 

TGAGCAAAAA ATCTTAGACT TAATGCAAGC TGTTGATGAC TTCATTCCAA 300 

CACCAGAACG TGATTCTGAC AAACCATTCA TGATGCCAGT TGAGGACGTA 3 50 

TTCTCAATCA CTGGTCGTGG TACTGTTGCT ACAGGCCGTG TTGAACGTGG 4 00 

TCAAATCAAA GTCGGTGAAG AAATCGAAAT CATCGGTATG CAAGAAGAAT 450 

CAAGCAAAAC AACTGTTACT GGTGTAGAAA TGTTCCGTAA ATTATTAGAC 50 0 

TACGCTGAAG CTGGTGACAA CATTGGTGCA TTATTACGTG GTGTTTCACG 550 

TGATGACGTA CAACGTGGTC AAGTTTTAGC TGCTCCTGGT ACTATTACAC 600 

CACATACAAA ATTCAAAGCG GATGTTTACG TTTTATCTAA AGATGAAGGT 650 

GGTCGTCATA CACCATTCTT CACTAACTAC CGCCCACAAT TCTATTTCCG 700 

TACTACTGAC GTAACTGGTG TTGTTAACTT ACCAGAAGGT ACTGAAATGG 750 

TTATGCCTGG CGATAACGTT GAAATGGA 778 



2) INFORMATION FOR SEQ ID NO: 869 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 69 

TGGTCCTATG CCTCAAACAC GTGAACACAT CTTGTTATCA CGTAACGTTG 50 
GTGTACCATA CATCGTTGTT TTCTTAAACA AAATGGATAT GGTTGATGAC 10 0 
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GAAGAATTAC TAGAATTAGT TGAAATGGAA GTTCGTGACT TATTGT&AGA 150 

ATATGACTTC CCAGGCGACG ATGTTCCTGT AATCGCTGGT TCTGCTTTGA 200 

AAGCTCTTGA AGGCGATGCT TCATACGAAG AAAAAATCAT GGAATTAATG 250 

GCTGCAGTTG ACGAATACGT TCCAACTCCA GAACGTGACA CTGACAAACC 300 

ATTCATGATG CCAGTCGAAG ACGTATTCTC AATCACTGGA CGTGGTACTG 350 

TTGCTACAGG CCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGACGAAGTT 4 00 

GAAATCGTTG GTATTGCTGA AGAAACTGCT AAAACAACTG TAACTGGTGT 450 

TGAAATGTTC CGTAAATTGT TAGACTATGC TGAAGCAGGG GATAACATTG 500 

GTGCATTGCT ACGTGGTGTT GCTCGTGAAG ACATCCAACG TGGACAAGTA 550 

TTGGCTAAAG CTGGTACAAT CACACCTCAT ACAAAATTTA AAGCTGAAGT 600 

TTACGTTTTA ACAAAAGAAG AAGGTGGACG TCACACTCCA 64 0 



2 ) INFORMATION FOR SEQ ID NO : 8 70 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 644 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseli flaws 

(B) STRAIN: R754 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 870 

GTCCTATGCC TCAAACACGT GAACACATCT TGTTATCACG TAACGTTGGT 50 

GTACCATACA TCGTTGTTTT CTTAAACAAA ATGGATATGG TTGATGACGA 100 

AGAATTACTA GAATTAGTTG AAATGGAAGT TCGTGACTTA TTGTCAGAAT 150 

ATGACTTCCC AGGCGACGAT GTTCCTGTAA TCGCTGGTTC TGCTTTGAAA 2 00 

GCTCTTGAAG GCGATGCTTC ATACGAAGAA AAAATCATGG AATTAATGGC 250 

TGCAGTTGAC GAATACGTTC CAACTCCAGA ACGTGACACT GACAAACCAT 3 00 

TCATGATGCC AGTCGAAGAC GTATTCTCAA TCACTGGACG TGGTACTGTT 3 50 

GCTACAGGCC GTGTTGAACG TGGACAAGTT CGCGTTGGTG ACGAAGTTGA 4 00 

AATCGTTGGT ATTGCTGAAG AAACTGCTAA AACAACTGTA ACTGGTGTTG 4 50 

AAATGTTCCG TAAATTGTTA GACTATGCTG AAGCAGGGGA TAACATTGGT 500 

GCATTGCTAC GTGGTGTTGC TCGTGAAGAC ATCCAACGTG GACAAGTATT 550 

GGCTAAAGCT GGTACAATCA CACCTCATAC AAAATTTAAA GCTGAAGTTT 600 

ACGTTTTAAC AAAAGAAGAA GGTGGACGTC ACACACCATT CTTC 644 



2) INFORMATION FOR SEQ ID NO : 871 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 637 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 

(B) STRAIN: R758 
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(xi) SEQUENCE DESCRIPTION: SEQ 

TCCTATGCCT CAAACACGTG AACACATCTT 
TACCATACAT CGTTGTTTTC TTAAACAAAA 
5 GAATTACTAG AATTAGTTGA AATGGAAGTT 
TGACTTCCCA GGCGACGATG TTCCTGTAAT 
CTCTTGAAGG CGATGCTTCA TACGAAGAAA 
GCAGTTGACG AATACGTTCC AACTCCAGAA 
CATGATGCCA GTCGAAGACG TATTCTCAAT 

10 CTACAGGCCG TGTTGAACGT GGACAAGTTC 
ATCGTTGGTA TTGCTGAAGA AACTGCTAAA 
AATGTTCCGT AAATTGTTAG ACTATGCTGA 
CATTGCTACG TGGGGTTGCT CGTGAAGACA 
GCTAAAGCTG GTACAATCAC ACCTCATACA 

15 CGTTTTAACA AAAGAAGAAG GTGGACGTCA 



PCT/CA00/01150 

ID NO: 871 

GTTATCACGT AACGTTGGTG 50 

TGGATATGGT TGATGACGAA 100 

CGTGACTTAT TGTCAGAATA 150 

CGCTGGTTCT GCTTTGAAAG 2 00 

AAATCATGGA ATTAATGGCT 2 50 

CGTGACACTG ACAAACCATT 3 00 

CACTGGACGT GGTACTGTTG 350 

GCGTTGGTGA CGAAGTTGAA 400 

ACAACTGTAA CTGGTGTTGA 4 50 

AGCAGGGGAT AACATTGGTG 500 

TCCAACGTGG ACAAGTATTA 550 

AAATTTAAAG CTGAAGTTTA 600 

CACTCCA 637 



2) INFORMATION FOR SEQ ID NO: 872 



20 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 643 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R631 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 872 

35 GTCCTATGCC TCAAACTCGT GAACACATCT TGTTATCACG TAACGTTGGC 50 

GTACCATACA TCGTTGTTTT CTTGAACAAA ATGGATATGG TTGATGACGA 100 

AGAATTGCTA GAATTAGTTG AAATGGAAGT TCGTGACCTA TTGTCTGAAT 150 

ATGACTTCCC AGGCGACGAT GTTCCTGTAA TCGCCGGTTC TGCTTTGAAA 2 00 

GCTCTTGAAG GAGATCCTTC ATACGAAGAA AAAATCATGG AATTGATGGC 250 

4 0 TGCAGTTGAC GAATACGTTC CAACTCCAGA ACGTGATACT GACAAACCAT 300 

TCATGATGCC AGTCGAAGAC GTATTCTCAA TCACTGGACG TGGTACTGTT 3 50 

GCTACAGGCC GTGTTGAACG TGGACAAGTT CGCGTTGGTG ATGAAGTAGA 4 00 

AATCGTTGGT ATTGCTGACG AAACTGCTAA AACAACTGTA ACAGGTGTTG 45 0 

AAATGTTCCG TAAATTGTTA GACTATGCTG AAGCAGGGGA TAACATTGGT 50 0 

45 GCATTGCTAC GTGGGGTTGC TCGTGAAGAC ATCCAACGTG GACAAGTATT 550 

GGCTAAAGCT GGTACAATCA CACCTCATAC AAAATTCAAA GCTGAAGTTT 60 0 

ATGTTTTGAC AAAAGAAGAA GGTGGACGTC ACACTCCATT CTT 64 3 



50 



60 



2) INFORMATION FOR SEQ ID NO: 873 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 641 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R691 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 873 

GTCCTATGCC TCAAACTCGT GAACACATCT TGTTATCACG TAACGTTGGC 50 

GTACCATACA TCGTTGTTTT CTTGAACAAA ATGGATATGG TTGATGACGA 100 

AGAATTGCTA GAATTAGTTG AAATGGAAGT TCGTGACCTA TTGTCTGAAT 150 

10 ATGACTTCCC AGGCGACGAT GTTCCTGTAA TCGCCGGTTC TGCTTTGAAA 2 00 

GCTCTTGAAG GAGATCCTTC ATACGAAGAA AAAATCATGG AATTGATGGC 250 

TGCAGTTGAC GAATACGTTC CAACTCCAGA ACGTGATACT GACAAACCAT 3 00 

TCATGATGCC AGTCGAAGAC GTATXCTCAA TCACTGGACG TGGTACTGTT 3 50 

GCTACAGGCC GTGTTGAACG TGGACAAGTT CGCGTTGGTG ATGAAGTAGA 4 00 

15 AATCGTTGGT ATTGCTGACG AAACTGCTAA AACAACTGTA ACAGGTGTTG 450 

AAATGTTCCG TAAATTGTTA GACTATGCTG AAGCAGGGGA TAACATTGGT 500 

GCATTGCTAC GTGGGGTTGC TCGTGAAGAC ATCCAACGTG GACAAGTATT 550 

GGCTAAAGCT GGTACAATCA CACCTCATAC AAAATTCAAA GCTGAAGTTT 600 

ATGTTTTGAC AAAAGAAGAA GGTGGACGTC ACACTCCATT C 641 

20 

2) INFORMATION FOR SEQ ID NO: 874 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 681 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



55 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 
35 (B) STRAIN: LSPQ 2514 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 874 

ACCAGCATTA GTAGTATTCT TAAATAAAGT TGACATGGTT GACGATGAAG 50 

4 0 AATTATTAGA ATTAGTTGAA ATGGAAGTAC GTGACTTATT ATCTGAATAC 100 

GACTTCCCAG GTGACGATGT ACCTGTAATC GCTGGTTCAG CATTAAAAGC 150 

TTTAGAAGGC GATGCTCAAT ACGAAGAAAA AATCTTAGAA TTAATGCAAG 2 00 

CAGTTGATGA CTACATTCCA ACTCCAGAAC GTGATTCTGA CAAACCATTC 2 50 

ATGATGCCAG TTGAGGACGT ATTCTCAATC ACTGGTCGTG GTACTGTTGC 300 

45 TACAGGCCGT GTTGAACGTG GGCAAATCAA AGTTGGTGAA GAAGTTGAAA 3 50 

TCATTGGTAT CCATGACACT TCTAAAACAA CTGTTACTGG TGTAGAAATG 4 00 

TTCCGTAAAT TATTAGACTA CGCTGAAGCT GGTGACAACA TCGGTGCATT 450 

ATTACGTGGT GTTGCTCGTG AAGACGTACA ACGTGGTCAA GTATTAGCTG 500 

CTCCAGGTTC AATCACACCT CACACAAAAT TTAAAGCAGA CGTATACGTT 550 

50 TTATCTAAAG ACGAAGGTGG ACGTCACACT CCATTCTTCA CAAACTATCG 600 

TCCACAATTC TATTTCCGTA CTACTGACGT AACTGGTGTT GTTAACTTAC 650 

CAGAAGGTAC TGAAATGGTT ATGCCTGGCG A 681 



2) INFORMATION FOR SEQ ID NO: 875 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 675 bases 
60 (B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epideimtidis 

(B) STRAIN: R591 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 875 

ATTATCACGT AACGTTGGTG TACCAGCATT AGTTGTATTC TTAAACAAAG 50 

TTGACATGGT AGACGACGAA GAATTATTAG AATTAGTTGA AATGGAAGTT 100 

CGTGACTTAT TAAGCGAATA TGACTTCCCA GGTGACGATG TACCTGTAAT 150 

15 CGCTGGTTCT GCATTAAAAG CATTAGAAGG CGATGCTGAA TACGAACAAA 2 00 

AAATCTTAGA CTTAATGCAA GCAGTTGATG ATTACATTCC AACTCCAGAA 2 50 

CGTGATTCTG ACAAACCATT CATGATGCCA GTTGAGGACG TATTCTCAAT 3 00 

CACTGGTCGT GGTACTGTTG CTACAGGCCG TGTTGAACGT GGTCAAATCA 350 

AAGTTGGTGA AGAAGTTGAA ATCATCGGTA TGCACGAAAC TTCTAAAACA 4 00 

2 0 ACTGTTACTG GTGTAGAAAT GTTCCGTAAA TTATTAGACT ACGCTGAAGC 4 50 

TGGTGACAAC ATCGGTGCTT TATTACGTGG TGTTGCACGT GAAGACGTAC 500 

AACGTGGTCA AGTATTAGCT GCTCCTGGTT CTATTACACC ACACACAAAA 550 

TTCAAAGCTG AAGTATACGT ATTATCTAAA GATGAAGGTG GACGTCACAC 600 

TCCATTCTTC ACTAACTATC GCCCACAATT CTATTTCCGT ACTACTGACG 650 

2 5 TAACTGGTGT TGTAAACTTA CCAGA 675 



2) INFORMATION FOR SEQ ID NO: 876 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 704 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Staphylococcus epideirmidis 

(B) STRAIN: CSG 10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 876 

4 5 TCTTATTATC ACGTAACGTT GGTGTACCAG CATTAGTTGT ATTCTTAAAC 50 

AAAGTTGACA TGGTAGACGA CGAAGAATTA TTAGAATTAG TTGAAATGGA 100 

AGTTCGTGAC TTATTAAGCG AATATGACTT CCCAGGTGAC GATGTACCTG 150 

TAATCGCTGG TTCTGCATTA AAAGCATTAG AAGGCGATGC TGAATACGAA 2 00 

CAAAAAATCT TAGACTTAAT GCAAGCAGTT GATGATTACA TTCCAACTCC 2 50 

50 AGAACGTGAT TCTGACAAAC CATTCATGAT GCCAGTTGAG GACGTATTCT 3 00 

CAATCACTGG TCGTGGTACT GTTGCTACAG GCCGTGTTGA ACGTGGTCAA 3 50 

ATCAAAGTTG GTGAAGAAGT TGAAATCATC GGTATGCACG AAACTTCTAA 4 00 

AACAACTGTT ACTGGTGTAG AAATGTTCCG TAAATTATTA GACTACGCTG 4 50 

AAGCTGGTGA CAACATCGGT GCTTTATTAC GTGGTGTTGC ACGTGAAGAC 500 

55 GTACAACGTG GTCAAGTATT AGCTGCTCCT GGTTCTATTA CACCACACAC 550 

AAAATTCAAA GCTGAAGTAT ACGTATTATC TAAAGATGAA GGTGGACGTC 600 

ACACTCCATT CTTCACTAAC TATCGCCCAC AATTCTATTT CCGTACTACT 650 

GACGTAACTG GTGTTGTAAA CTTACCAGAA GGTACAGAAA TGGTTATGCC 700 

TGGC 704 
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2) INFORMATION FOR SEQ ID NO: 877 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 770 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococ 

(B) STRAIN: ATCC 35984 

(xi) SEQUENCE DESCRIPTION: SEQ 

TCTTAGTTGT ATCTGCTGCT GACGGTCCAA 
ATCTTATTAT CACGTAACGT TGGTGTACCA 
CAAAGTTGAC ATGGTAGACG ACGAAGAATT 
AAGTTCGTGA CTTATTAAGC GAATATGACT 
GTAATCGCTG GTTCTGCATT AAAAGCATTA 
ACAAAAAATC TTAGACTTAA TGCAAGCAGT 
CAGAACGTGA TTCTGACAAA CCATTCATGA 
TCAATCACTG GTCGTGGTAC TGTTGCTACA 
AATCAAAGTT GGTGAAGAAG TTGAAATCAT 
AAACAACTGT TACTGGTGTA GAAATGTTCC 
GAAGCTGGTG ACAACATCGG TGCTTTATTA 
CGTACAACGT GGTCAAGTAT TAGCTGCTCC 
CAAAATTCAA AGCTGAAGTA TACGTATTAT 
CACACTCCAT TCTTCACTAA CTATCGCCCA 
TGACGTAACT GGTGTTGTAA ACTTACCAGA 
CTGGCGACAA CGTTGAAATG 



s epidermidis 



ID NO: 877 

TGCCACAAAC TCGTGAACAC 50 

GCATTAGTTG TATTCTTAAA 100 

ATTAGAATTA GTTGAAATGG 150 

TCCCAGGTGA CGATGTACCT 2 00 

GAAGGCGATG CTGAATACGA 2 50 

TGATGATTAC ATTCCAACTC 3 00 

TGCCAGTTGA GGACGTATTC 3 50 

GGCCGTGTTG AACGTGGTCA 4 00 

CGGTATGCAC GAAACTTCTA 450 

GTAAATTATT AGACTACGCT 500 

CGTGGTGTTG CACGTGAAGA 550 

TGGTTCTATT ACACCACACA 600 

CTAAAGATGA AGGTGGACGT 650 

CAATTCTATT TCCGTACTAC 700 

AGGTACAGAA ATGGTTATGC 750 

770 



2) INFORMATION FOR SEQ ID NO: 878 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 716 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 35983 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 878 

TTGTATTCTT AAACAAAGTT GACATGGTAG ACGACGAAGA ATTATTAGAA 50 

TTAGTTGAAA TGGAAGTTCG TGACTTATTA AGCGAATATG ACTTCCCAGG 100 

TGACGATGTA CCTGTAATCG CTGGTTCTGC ATTAAAAGCA TTAGAAGGCG 150 

ATGCTGAATA CGAACAAAAA ATCTTAGACT TAATGCAAGC AGTTGATGAT 2 00 

TACATTCCAA CTCCAGAACG TGATTCTGAC AAACCATTCA TGATGCCAGT 2 50 

TGAGGACGTA TTCTCAATCA CTGGTCGTGG TACTGTTGCT ACAGGCCGTG 3 00 

TTGAACGTGG TCAAATCAAA GTTGGTGAAG AAGTTGAAAT CATCGGTATG 350 

493 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

CACGAAACTT CTAAAACAAC TGTTACTGGT 
ATTAGACTAC GCTGAAGCTG GTGACAACAT 
TTGCACGTGA AGACGTACAA CGTGGTCAAG 
ATTACACCAC ACACAAAATT CAAAGCTGAA 
TGAAGGTGGA CGTCACACTC CATTCTTCAC 
ATTTCCGTAC TACTGACGTA ACTGGTGTTG 
GAAATGGTTA TGCCTGGCGA CAACGTTGAA 
TCCAATCGCT ATCGAA 



PCT/CAOO/01150 

GTAGAAATGT TCCGTAAATT 4 00 

CGGTGCTTTA TTACGTGGTG 4 50 

TATTAGCTGC TCCTGGTTCT 50 0 

GTATACGTAT TATCTAAAGA 550 

TAACTATCGC CCACAATTCT 600 

TAAACTTACC AGAAGGTACA 650 

ATGACAGTTG AATTAATCGC 700 

716 



2) INFORMATION FOR SEQ ID NO: 879 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R764 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 879 

CGGTCCTATG CCTCAAACTC GTGAACACAT CTTGTTATCA CGTAACGTTG 50 

GCGTACCATA CATCGTTGTT TTCTTGAACA AAATGGATAT GGTTGATGAC 100 

GAAGAATTGC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTGTCTGA 150 

ATATGACTTC CCAGGCGACG ATGTTCCTGT AATCGCCGGT TCTGCTTTGA 200 

AAGCTCTTGA AGGAGATCCT TCATACGAAG AAAAAATCAT GGAATTGATG 2 50 

GCTGCAGTTG ACGAATACGT TCCAACTCCA GAACGTGATA CTGACAAACC 3 00 

ATTCATGATG CCAGTCGAAG ACGTATTCTC AATCACTGGA CGTGGTACTG 3 50 

TTGCTACAGG CCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGATGAAGTA 4 00 

GAAATCGTTG GTATTGCTGA CGAAACTGCT AAAACAACTG TAACAGGTGT 4 50 

TGAAATGTTC CGTAAATTGT TAGACTATGC TGAAGCAGGG GATAACATTG 500 

GTGCATTGCT ACGTGGGGTT GCTCGTGAAG ACATCCAACG TGGACAAGTA 550 

TTGGCTAAAG CTGGTACAAT CACACCTCAT ACAAAATTCA AAGCTGAAGT 600 

TTATGTTTTG ACAAAAGAAG AAGGTGGACG TCACACTCCA 64 0 



2) INFORMATION FOR SEQ ID NO: 880 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: PAO-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 880 

CGGCGCGATC CTGGTTTGCT CGGCTGCCGA CGGCCCCATG CCGCAGACCC 
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GCGAGCACAT 
TTCCTGAACA 
CGAGATGGAA 
ACACTCCGAT 
GACAACGGCA 
CTCCTACATT 
CGATCGAAGA 
CGTGTAGAGC 
CATCAAGGCG 
AGCTGCTCGA 
GGCACCAAGC 
CACCATCAAG 
AGGAAGAAGG 
TTCTACTTCC 
CGTAGAGATG 
TCGCTCCGAT 



CCTGCTGTCC 
AGGCCGACAT 
GTTCGCGATC 
CATCATCGGT 
TCGGCGTAAG 
CCGGAGCCGG 
CGTGTTCTCG 
GCGGCATCAT 
ACCACCAAGA 
CGAAGGTCGT 
GTGAAGACGT 
CCGCACACCA 
TGGTCGTCAC 
GTAC CACCGA 
GTAATGC CGG 
CGCCATGGAA 



CGCCAGGTAG 
GGTCGACGAC 
TGCTGAACAC 
TCCGCGCTGA 
CGCCGTGCAG 
TTCGTGCCAT 
ATCTCCGGCC 
CAAGGTCCAG 
CTACCTGCAC 
GCTGGTGAGA 
AGAGCGTGGC 
AGTTCGAGTG 
ACCCCGTTCT 
YGTGACCGGT 
GCGACAACAT 
GATGGCCTGC 



GCGTTCCCTA CATCGTC&TG 
GCCGAGCTGC TGGAACTGGT 
CTACGACTTC CCGGGCGACG 
TGGCGCTGGA AGGCAAGGAT 
AAGCTGGTAG AGACCCTGGA 
CGACCAGCCG TTCCTGATGC 
GCGGTACCGT GGTAACCGGT 
GAAGAAGTGG AAATCGTCGG 
CGGCGTTGAA ATGTTCCGCA 
ACGTTGGTAT CCTGCTGCGT 
CAGGTTCTGG CCAAGCCGGG 
CGAAGTGTAC GTGCTGTCCA 
TCAAGGGCTA CCGTCCGCAG 
AACTGCGAAC TGCCGGAAGG 
CAAGATGGTT GTCACCCTGA 
G 



100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
831 



2) INFORMATION FOR SEQ ID NO: 881 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseli flaws 

(B) STRAIN: R421 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 881 

CCTATGCCTC AAACACGTGA ACACATCTTG TTATCACGTA ACGTTGGTGT 50 

AC CAT AC AT C GTTGTTTTCT TAAACAAAAT GGATATGGTT GATGACGAAG 100 

AATTACTAGA ATTAGTTGAA ATGGAAGTTC GTGACTTATT GTCAGAATAT 150 

GACTTCCCAG GCGACGATGT TCCTGTAATC GCTGGTTCTG CTTTGAAAGC 2 00 

TCTTGAAGGC GATGCTTCAT ACGAAGAAAA AATCATGGAA TTAATGGCTG 250 

CAGTTGACGA ATACGTTCCA ACTCCAGAAC GTGACACTGA CAAACCATTC 300 

ATGATGCCAG TCGAAGACGT ATTCTCAATC ACTGGACGTG GTACTGTTGC 3 50 

TACAGGCCGT GTTGAACGTG GACAAGTTCG CGTTGGTGAC GAAGTTGAAA 400 

TCGTTGGTAT TGCTGAAGAA ACTGCTAAAA CAACTGTAAC TGGTGTTGAA 450 

ATGTTCCGTA AATTGTTAGA CTATGCTGAA GCAGGGGATA ACATTGGTGC 500 

ATTGCTACGT GGTGTTGCTC GTGAAGACAT CCAACGTGGA CAAGTATTGG 550 

CTAAAGCTGG TACAATCACA CCTCATACAA AATTTAAAGC TGAAGTTTAC 600 

GTTTTAACAA AAGAAGAAGG TGGACGTCAC ACACCATTCT TC 642 



2) INFORMATION FOR SEQ ID NO: 882 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 63 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R775 



(xi) SEQUENCE DESCRIPTION: SEQ 



ID NO: 882 



CCTATGCCTC 
ACCATACATC 
AATTACTAGA 
GACTTCCCAG 
TCTTGAAGGC 
CAGTTGACGA 
ATGATGCCAG 
TACAGGCCGT 
TCGTTGGTAT 
ATGTTCCGTA 
ATTGCTACGT 
CTAAAGCTGG 
GTTTTAACAA 



AAACACGTGA 
GTTGTTTTCT 
ATTAGTTGAA 
GCGACGATGT 
GATGCTTCAT 
ATACGTTCCA 
TCGAAGACGT 
GTTGAACGTG 
TGCTGAAGAA 
AATTGTTAGA 
GGTGTTGCTC 
TACAATCACA 
AAGAAGAAGG 



ACACATCTTG 
TAAACAAAAT 
ATGGAAGTTC 
TCCTGTAATC 
ACGAAGAAAA 
ACTCCAGAAC 
ATTCTCAATC 
GACAAGTTCG 
ACTGCTAAAA 
CTATGCTGAA 
GTGAAGACAT 
CCTCATACAA 
TGGACGTCAT 



TTATCACGTA ACGTTGGTGT 
GGATATGGTT GATGACGAAG 
GTGACTTATT GTCAGAATAT 
GCTGGTTCTG CTTTGAAAGC 
AATCATGGAA TTAATGGCTG 
GTGACACTGA CAAACCATTC 
ACTGGACGTG GTACTGTTGC 
CGTTGGTGAC GAAGTTGAAA 
CAACTGTAAC TGGTGTTGAA 
GCAGGGGATA ACATTGGTGC 
CCAACGTGGA CAAGTATTGG 
AATTTAAAGC TGAAGTTTAC 
ACACCA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
636 



2) INFORMATION FOR SEQ ID NO: 883 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecal is 

(B) STRAIN: R422 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 883 

GGTCCTATGC CTCAAACACG TGAACATATC TTATTATCAC GTAACGTTGG 50 

TGTACCATAC ATCGTTGTAT TCTTAAACAA AATGGATATG GTTGATGACG 100 

AAGAATTATT AGAATTAGTA GAAATGGAAG TTCGTGACTT ATTATCAGAA 150 

TACGATTTCC CAGGCGATGA TGTTCCAGTT ATCGCAGGTT CTGCTTTGAA 2 00 

AGCTTTAGAA GGCGACGAGT CTTATGAAGA AAAAATCTTA GAATTAATGG 2 50 

CTGCAGTTGA CGAATATATC CCAACTCCAG AACGTGATAC TGACAAACCA 3 00 

TTCATGATGC CAGTCGAAGA CGTATTCTCA ATCACTGGAC GTGGTACTGT 3 50 

TGCTACAGGC CGTGTTGAAC GTGGTGAAGT TCGCGTTGGT GACGAAGTTG 4 00 

AAATCGTTGG TATTAAAGAC GAAACATCTA AAACAACTGT TACAGGTGTT 450 

GAAATGTTCC GTAAATTATT AGACTACGCT GAAGCAGGCG ACAACATCGG 500 

TGCTTTATTA CGTGGTGTAG CACGTGAAGA TATCGAACGT GGACAAGTAT 550 

TAGCTAAACC AGCTACAATC ACTCCACACA CAAAATTCAA AGCTGAAGTA 600 

TACGTATTAT CAAAAGAAGA AGGCGGACGT CACACTCCAT T 641 



2 ) INFORMATION FOR SEQ ID NO: 884 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecetlis 

(B) STRAIN: R575 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 884 

TATGCCTCAA ACACGTGAAC ATATCTTATT ATCACGTAAC GTTGGTGTAC 50 

CATACATCGT TGTATTCTTA AACAAAATGG ATATGGTTGA TGACGAAGAA 100 

TTATTAGAAT TAGTAGAAAT GGAAGTTCGT GACTTATTAT CAGAATACGA 150 

15 TTTCCCAGGC GATGATGTTC CAGTTATCGC AGGTTCTGCT TTGAAAGCTT 2 00 

TAGAAGGCGA CGAGTCTTAT GAAGAAAAAA TCTTAGAATT AATGGCTGCA 2 50 

GTTGACGAAT ATATCCCAAC TCCAGAACGT GATACTGACA AACCATTCAT 3 00 

GATGCCAGTC GAAGACGTAT TCTCAATCAC TGGACGTGGT ACTGTTGCTA 3 50 

CAGGCCGTGT TGAACGTGGT GAAGTTCGCG TTGGTGACGA AGTTGAAATC 4 00 

2 0 GTTGGTATTA AAGACGAAAC ATCTAAAACA ACTGTTACAG GTGTTGAAAT 4 50 

GTTCCGTAAA TTATTAGACT ACGCTGAAGC AGGCGACAAC ATCGGTGCTT 50 0 

TATTACGTGG TGTTGCACGT GAAGATATCG AACGTGGACA AGTATTAGCT 550 

AAACCAGCTA CAATCACTCC ACACACAAAA TTCAAAGCTG AAGTATACGT 6 00 

ATTATCAAAA GAAGAAGGCG GACGTCACAC TCCATTCTTC 64 0 

25 

2) INFORMATION FOR SEQ ID NO: 885 

30. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 632 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
40 (B) STRAIN: R492 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 885 

TGCCTCAAAC TCGTGAACAC ATCCTATTGT CTCGTCAAGT TGGTGTTCCT 50 

45 TACATCGTTG TATTCTTGAA CAAAGTAGAC ATGGTTGATG ACGAAGAATT 10 0 

ACTAGAATTA GTTGAAATGG AAGTTCGTGA CCTATTAACA GAATACGAAT 150 

TCCCTGGTGA CGATGTTCCT GTAGTTGCTG GATCAGCTTT GAAAGCTCTA 2 00 

GAAGGCGACG CTTCATACGA AGAAAAAATT CTTGAATTAA TGGCTGCAGT 2 50 

TGACGAATAC ATCCCAACTC CAGAACGTGA CAACGACAAA CCATTCATGA 3 00 

50 TGCCAGTTGA AGACGTGTTC TCAATTACTG GACGTGGTAC TGTTGCTACA 350 

GGTCGTGTTG AACGTGGACA AGTTCGCGTT GGTGACGAAG TTGAAGTTGT 4 00 

TGGTATTGCT GAAGAAACTT CAAAAACAAC AGTTACTGGT GTTGAAATGT 4 50 

TCCGTAAATT GTTAGACTAC GCTGAAGCTG GAGACAACAT TGGTGCTTTA 500 

CTACGTGGTG TTGCACGTGA AGACATCCAA CGTGGACAAG TTTTAGCTAA 550 

55 ACCAGGTACA ATCACACCTC ATACAAAATT CTCTGCAGAA GTATACGTGT 600 

TGACAAAAGA AGAAGGTGGA CGTCATACTC CA 632 



60 2) INFORMATION FOR SEQ ID NO: 886 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R576 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 886 



CGGCCCAATG CCTCAAACTC GTGAACACAT CCTATTGTCT CGTCAAGTTG 50 

GTGTTCCTTA CATCGTTGTA TTCTTGAACA AAGTAGACAT GGTTGATGAC 100 

GAAGAATTAC TAGAATTAGT TGAAATGGAA GTTCGTGACC TATTAACAGA 150 

ATACGAATTC CCTGGTGACG ATGTTCCTGT AGTTGCTGGA TCAGCTTTGA 2 00 

AAGCTCTAGA AGGCGACGCT TCATACGAAG AAAAAATTCT TGAATTAATG 250 

GCTGCAGTTG ACGAATACAT CCCAACTCCA GAACGTGACA ACGACAAACC 3 00 

ATTCATGATG CCAGTTGAAG ACGTGTTCTC AATTACTGGA CGTGGTACTG 3 50 

TTGCTACAGG TCGTGTTGAA CGTGGACAAG TTCGCGTTGG TGACGAAGTT 4 00 

GAAGTTGTTG GTATTGCTGA AGAAACTTCA AAAACAACAG TTACTGGTGT 4 50 

TGAAATGTTC CGTAAATTGT TAGACTACGC TGAAGCTGGA GACAACATTG 5 00 

GTGCTTTACT ACGTGGTGTT GCACGTGAAG ACATCCAACG TGGACAAGTT 550 

TTAGCTAAAC CAGGTACAAT CACACCTCAT ACAAAATTCT CTGCAGAAGT 600 

ATACGTGTTG ACAAAAGAAG AAGGTGGACG TCATACTCCA 64 0 



2) INFORMATION FOR SEQ ID NO: 887 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 06 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zoogloea ramigera 

(B) STRAIN: ATCC 25935 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 887 

ATCCTGGTTT GCTCCGCAGC TGACGGCCCA ATGCCACAGA CCCGCGAGCA 50 

CATCCTGCTG GCCCGCCAAG TTGGCGTTCC ATACATCATC GTGTTCCTGA 100 

ACAAGTGCGA CCTGGTTGAC GACGCAGAAC TGCTGGAACT GGTCGAAATG 150 

GAAGTGCGTG AATTGCTGTC GAAATACGAG TTCCCAGGCG ACGACGTACC 2 00 

AATCATCAAG GGTTCGGCAC GTATGGCGCT GGAAGGCAAA GAAGGCGAGA 250 

TGGGCGTTGA CGCCATCATG CGTCTGGCCG ATGCACTGGA CAGCTACATC 3 00 

CCTACGCCAG AGCGCGCAGT CGATGGCGCC TTCCTGATGC CAGTGGAAGA 350 

CGTGTTCTCG ATCTCGGGTC GCGGTACCGT TGTGACCGGT CGTATCGAGC 400 

GCGGCGTGAT CAAGGTCGGC GAAGAGATCG AAATCGTCGG CATTATCGAC 450 

ACCGTCAAAA CCACTTGCAC CGGCGTGGAA ATGTTCCGCA AGCTGCTGGA 500 

CCAGGGTCAA GCCGGCGACA ACGTTGGTCT GCTGCTGCGC GGCACCAAGC 550 

GTGAAGACGT ACAGCGTGGT CAGGTTCTGG CCAAGCCAGC GTCGATCAAG 600 

CCGCACAACC ACTTCACCGG CGAGATCTAC GTTCTGTCGA AAGATGAAGG 650 
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CGGCCGTCAC ACCCCGTTCT 
GTACGACTGA CGTGACCGGT 
GTCATGCCAG GCGACAACGT 
CGCGAT 



TCAACAACTA TCGTCCACAG 
TCGATCGAAC TGCCAGCAGA 
GTCGATCACC GTCAAGCTGA 
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TTCTACTTCC " ^70CT 
CAAAGAAATG 750 
TCAACCCGAT 800 

806 



2 ) INFORMATION FOR SEQ ID NO: 888 



10 



15 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 634 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecal is 

(B) STRAIN: R503 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 888 



25 



30 



35 



TATGCCTCAA 
CATACATCGT 
TTATTAGAAT 
TTTCCCAGGC 
TAGAAGGCGA 
GTTGACGAAT 
GATGCCAGTC 
CAGGCCGTGT 
GTTGGTATTA 
GTTCCGTAAA 
TATTACGTGG 
AAACCAGCTA 
ATTATCAAAA 



ACACGTGAAC 
TGTATTCTTA 
TAGTAGAAAT 
GATGATGTTC 
CGAGTCTTAT 
ATATCCCAAC 
GAAGACGTAT 
TGAACGTGGT 
AAGACGAAAC 
TTATTAGACT 
TGTAGCACGT 
CAATCACTCC 
GAAGAAGGCG 



ATATCTTATT 
AACAAAATGG 
GGAAGTTCGT 
CAGTTATCGC 
GAAGAAAAAA 
TCCAGAACGT 
TCTCAATCAC 
GAAGTTCGCG 
ATCTAAAACA 
ACGCTGAAGC 
GAAGATATCG 
ACACACAAAA 
GACGTCACAC 



ATCACGTAAC 
ATATGGTTGA 
GACTTATTAT 
AGGTTCTGCT 
TCTTAGAATT 
GATACTGACA 
TGGACGTGGT 
TTGGTGACGA 
ACTGTTACAG 
AGGCGACAAC 
AACGTGGACA 
TTCAAAGCTG 
TCCA 



GTTGGTGTAC 50 

TGACGAAGAA 10 0 

CAGAATACGA 15 0 

TTGAAAGCTT 2 00 

AATGGCTGCA 2 50 

AACCATTCAT 3 00 

ACTGTTGCTA 3 50 

AGTTGAAATC 4 00 

GTGTTGAAAT 4 50 

ATCGGTGCTT 500 

AGTATTAGCT 550 

AAGTATACGT 600 

634 



40 2 ) INFORMATION FOR SEQ ID NO: 889 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 93 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: ATCC 14110 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 889 



TGTCTTCATC CAGGAATTGA TTGTGAGTCG TTCCACATGC TCACCTAGTT 50 

TTCGCTCGAT CTTTTCACTA ACGCAAACCA TGTAGAACAA CATTGCCAAG 100 

GCCCACGGTG GTTACTCCGT CTTCACTGGT GTTGGTGAGC GTACTCGTGA 150 

GGGTAACGAT CTGTACCACG AAATGCAGGA GACTGGTGTC ATTCAGCTCG 2 00 

60 AGGGTGAATC CAAGGTCGCA CTGGTGTTCG GACAGATGAA CGAGCCCCCC 2 50 

499 
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GGTGCCCGTG CCCGTGTCGC CCTTACCGGT 
CCGTGACGAG GAGGGTCAGG ACGTGCTGCT 
GTTTCACCCA GGCCGGTTCT GAGGTGTCTG 
TCTGCCGTCG GTTACCAGCC CACCCTGGCC 
5 GGAGCGTATC ACCACCACCA AGAAGGGTTC 



PCT/CA00/01150 

CTGACCATTG CCGAGTACTT 3 00 

CTTCATTGAC AACATTTTCC 3 50 

CCCTTCTCGG TCGTATCCCC 4 00 

GTCGACATGG GTGGTATGCA 450 

TATTACCTCC GTC 4 93 



10 



15 



20 



2) INFORMATION FOR SEQ ID NO: 890 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 66 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: ATCC 64101 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 890 



25 



30 



35 



TTGATTGTAC 
AAGCGTAGAA 
GGTGTCGGTG 
GGAAACTGGT 
TCGGTCAGAT 
GGTTTGACCA 
TCTCTTCATT 
CTGCCCTTCT 
GCCGTCGACA 
TTCCATCACC 



GTCTTTACCT 
CAACATTGCC 
AACGTACTCG 
GTCATTCAGC 
GAACGAGCCC 
TTGCCGAGTA 
GACAACATTT 
GGGTCGTATC 
TGGGTATCAT 
TCCGTC 



TTCTGCCTGA 
AAGGCTCACG 
TGAGGGTAAC 
TCGAGGGTGA 
CCCGGTGCCC 
CTTCCGTGAC 
TCCGTTTCAC 
CCCTCTGCCG 
GCAGGAGCGT 



CTGTTTACGA 
GTGGTTACTC 
GATTTGTACC 
ATCCAAGGTC 
GTGCC CGTGT 
GAGGAAGGTC 
TCAGGCCGGT 
TCGGTTACCA 
ATTACCACCA 



CAACTAACGA 50 

TGTCTTCACT 100 

ACGAAATGCA 150 

GCCCTCGTGT 2 00 

CGCTCTTACT 2 50 

AGGACGTGCT 3 00 

TCTGAGGTGT 3 50 

GCCCACCCTT 4 00 

CCACCAAGGG 4 50 

466 



2) INFORMATION FOR SEQ ID NO: 8 91 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 78 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paecilomyces lilacinus 
50 (B) STRAIN: ATCC 42 570 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 91 

AGGAGCTGAT CAACAACATC GCCAAGGCTC ACGGTGGTTA CTCCGTCTTC 50 

55 ACTGGTGTCG GTGAGCGTAC CCGTGAGGGT AACGATCTGT ACCACGAAAT 100 

GCAGGAGACC TCGGTCATTC AGCTCGAGGG CGAGTCTAAG GTGGCCCTGG 150 

TCTTTGGTCA GATGAACGAG CCCCCGGGTG CTCGTGCCCG TGTCGCTCTT 2 00 

ACTGGTCTTA CCGTCGCCGA GTACTTCCGT GACCAGGAGG GTCAGGATGG 250 

TTAGTTCTCG TCCACTCATG CCGAAACATG TGCGTGTTCC GAGGCTAATC 3 00 

60 AACGTGCCAG TGCTGCTTTT CATCGACAAC ATTTTCCGAT TCACACAGGC 350 

500 
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CGGTTCCGAG GTGTCTGCCC TGCTGGGTCC?"' #A/f ^tfe'TCT GCCGTtGGTT 4 0 ^/ 

ACCAGCCCAC CCTCGCCGTC GACATGGGTG GCATGCAGGA GCGTATCACC 4 50 

ACCACCAAGA AGGGCTCTAT CACCTCCG 4 78 



2) INFORMATION FOR SEQ ID NO: 892 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 481 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



20 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: ATCC 58950 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 892 



GTCTTTATCC 
TACGACAACT 

2 5 TACTCTGTCT 
GTACCACGAA 
AGGTCGCCCT 
CGTGTCGCTC 
AGGTCAGGAC 

30 CCGGTTCTGA 
TACCAGCCCA 
CACCACCACC 



AGGAGTTGAT 
AACGAAAGCG 
TCACTGGTGT 
ATGCAGGAAA 
CGTGTTCGGT 
TTACTGGTTT 
GTGCTTCTCT 
GGTGTCTGCC 
CCCTTGCCGT 
AAGGGTTCCA 



TGTACGTCTT 
TAGAACAACA 
CGGTGAACGT 
CTGGTGTCAT 
CAGATGAACG 
GACCATTGCC 
TCATTGACAA 
CTTCTGGGTC 
CGACATGGGT 
TCACCTCCGT 



TACCTTTCTG 
TTGCCAAGGC 
ACTCGTGAGG 
TCAGCTCGAG 
AGCCCCCCGG 
GAGTACTTCC 
CATTTTCCGT 
GTATCCCCTC 
ATCATGCAGG 
C 



CCTGACTGTT 50 

TCACGGTGGT 10 0 

GTAACGATTT 15 0 

GGTGAATCCA 200 

TGCCCGTGCC 2 50 

GTGACGAGGA 300 

TTCACTCAGG 350 

TGCCGTCGGT 400 

AGCGTATTAC 450 

481 



35 



40 



2) INFORMATION FOR SEQ ID NO: 893 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 0 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: ATCC 142 85 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 893 



CTCGTCCGTG 
CGTCGGCCCC 
TCGACGAGCG 

55 GAGGCTCCCG 
TGGTATCAAG 
TTGGTCTGTT 
CTCATCAACA 
TGTCGGCGAG 

60 AGACCTCTGT 



GTGCTAAGGC 
GGTACCCTCG 
CGGTCCCATC 
AGTTCGTTGA 
GTCGTCGATC 
TGGCGGTGCC 
ACATCGCCAA 
CGTACCCGTG 
CATTCAGCTT 



CACTGACACT GGTTSCCCCA 
GTCGCATCAT GAACGTCACC 
AAGACCGACA AGTTCCGTCC 
CCAGTCGACC ACCGCTGAGG 
TGCTTGCTCC CTACGCCCGT 
GGTGTTGGCA AGACCGTGTT 
GGCCCACGGT GGTTACTCCG 
AGGGTAACGA TCTGTACCAC 
GACGGTGACT CCAAGGTCGC 
501 



TTACCATCCS 50 

GGTGACCCGA 100 

CATCCACGCT 150 

TTCTCGTGAC 200 

GGTGGTAAGA 250 

CATCCAGGAG 300 

TCTTCACCGG 3 50 

GAAATGCAGG 4 00 

CCTGGTGTTC 450 
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GGTCAGATGA ACGAGCCCCC TGGTGCTCGT 
TTTGACTGTC GCTGAGTACT TCCGTGACGA 
TTTGAATTAT TTCCTTGTCG TACAGTTCCA 
GTCAGTGCTT CTCTTCATCG ACAACATTTT 
5 CTGAGGTGTC TGCCCTTCTG GGTCGTATTC 
CCCACGCTCG CCGTGGACAT GGGTCTGATG 
CCGCAAGGGC TCAATTACCT CCGTCCAGGC 
ATCTGACGGA TCCCGCCCCC GCCACCACCT 
ACTGTGCTGT CCCGAGGTAT CTCTGAGCTG 

10 CCCCCTCGAC TCCAAGTCGC GTATGCTGGA 
ACCACTACGA GACCGCCACT CGCGTCCAGC 
TCGCTGCAGG ACATCATCGC CATTCTGGGT 
CGACAAGCTT ACAGTCGAGC GTGCTCGTAA 
AGCCGTTCAC GGTCGCGCAG GTCTTCACTG 

15 GATCTGAAGG ACACTATCGC TTCGTTCAAG 
TGACAGCC 



PCT/CA00/01150 

GCCCGTGTCG CCTTGACC?GG 500 

GGAGGGCCAG GATGGTATGT 550 

AATCGAAGAA TTACTAACTT 600 

CCGCTTCACC CAGGCCGGTT 650 

CCTCCGCTGT CGGTTACCAG 70 0 

CAGGAGCGTA TTACCACCAC 75 0 

CGTCTACGTG CCCGCTGACG 800 

TCGCCCATCT GGACGCCACC 850 

GGTATCTACC CCGCTGTCGA 900 

CCCCCGTATT GTCGGTGACG 950 

AGATCCTCCA AGAGTACAAG 100 0 

ATGGACGAGC TGTCTGAGGC 1050 

GATCCAGCGT TTCCTGAGCC 1100 

GTATCGAAGG CCAGCTGGTC 115 0 

GCTATCCTGA GCGGTGAGGG 12 00 

1208 



20 2) INFORMATION FOR SEQ ID NO: 894 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Malbranchea fileunentosa 

(B) STRAIN: ATCC 48174 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8 94 



TTCAGGAACT TATTGTAAGC CGCCCTCTTT ATGCATTGAG GGTGAATAAG 5 0 

AAGGCTGACA GGTAATAGAA CAACATTGCC AAGGCTCACG GTGGTTACTC 10 0 

CGTGTTCACT GGTGTCGGTG AGCGTACCCG TGAAGGAAAC GATCTGTACC 15 0 

ATGAAATGCA GGAAACCCGC GTCATCCAGC TTGATGGCGA GTCTAAGGTC 200 

4 0 GCCCTTGTGT TCGGTCAGAT GAACGAGCCC CCTGGAGCCC GTGCCCGTGT 25 0 

CGCTCTTACT GGTCTTACCG TTGCCGAATA CTTCCGTGAC GAGGAGGGCC 300 

AAGATGGTAC GCCTTTTTAC TCTTCTTATT CTTCGGGTCG GACTACAGAA 35 0 

CTAACCTGCT CCAGTGCTTC TCTTCATTGA TAACATTTTC CGTTTCACAC 40 0 

AAGCCGGTTC TGAGGTGTCT GCCTTGCTTG GACGTATTCC CTCTGCCGTC 450 

45 GGTTACCAGC CCACTCTCGC CGTCGACATG GGTGGTATGC AGGAACGTAT 50 0 

CACAACCACC AACAAGGGTT CCATTACTTC CGTG 534 



50 2) INFORMATION FOR SEQ ID NO: 895 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 8 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 
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PCT/CA00/01150 



(A) ORGANISM: Paecilomyces lilacinus 

(B) STRAIN: ATCC 60735 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 895 

5 

CAAGGCTCAC GGTGGTTACT CCGTCTTCAC TGGTGTCGGT GAGCGTACCC 50 

GTGAGGGTAA CGATCTGTAC CACGAAATGC AGGAGACCTC GGTCATTCAG 100 

CTCGAGGGCG AGTCTAAGGT GGCCCTGGTC TTTGGTCAGA TGAACGAGCC 150 

CCCGGGTGCT CGTGCCCGTG TCGCTCTTAC TGGTCTTACC GTCGCCGAGT 2 00 

10 ACTTCCGTGA CCAGGAGGGT CAGGATGGTT AGTTCTCGTC CACTCATGCC 250 

GAAACATGTG CGTGTTCCGA GGCTAATCAA CGTGCCAGTG CTGCTTTTCA 3 00 

TCGACAACAT TTTCCGATTC ACCCAGGCCG GTTCCGAGGT GTCTGCCCTG 3 50 

CTGGGTCGTA TCCCCTCTGC CGTCGGTTAC CAGCCCACCC TCGCCGTCGA 400 

CATGGGTGGC ATGCAGGAGC GTATCACCAC CACCAAGAAG GGCTCTAT 44 8 

15 



2) INFORMATION FOR SEQ ID NO: 896 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 83 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



45 



60 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus nlger 
30 (B) STRAIN: ATCC 9508 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 896 

ATTCAGGAGC TGATTGTAAG TTGCCAATCC ATGAACTGGA GATTTGGTGT 50 

35 GACCCATAGA ACTAACAAAT TATTTAGAAC AACATCGCCA AGGCTCACGG 100 

TGGTTACTCC GTCTTCTGTG GTGTCGGTGA GCGTACTCGT GAGGGTAACG 150 

ATCTGTACCA CGAAATGCAG GAGACTGGTG TCATCCAGCT CGAGGGTGAC 2 00 

TCCAAGGTCG CTCTGGTCTT CGGTCAGATG AACGAGCCCC CGGGTGCCCG 250 

TGCCCGTGTC GCCCTTACCG GTCTGACCAT TGCCGAGTAC TTCCGTGACG 3 00 

40 AGGAGGGTCA GGACGTGCTG CTCTTCATTG ACAACATTTT CCGTTTCACC 3 50 

CAGGCCGGTT CTGAGGTGTC TGCCCTTCTG GGTCGTATCC CCTCTGCCGT 4 00 

CGGTTACCAG CCCACTCTGG CCGTCGACAT GGGTGGTATG CAGGAGCGTA 450 

TTACCACCAC CACCAAGGGT TCCATTACCT CCG 4 83 



2) INFORMATION FOR SEQ ID NO: 897 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1124 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : L i near 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: ATCC 14110 
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WO 01/23604 PCT/CA00/01150 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 897 

AAGCTCAAGT CCGAGCGTGA GCGTGGTATC ACCATCGACA TTGCCCTCTG 50 

GAAGTTCCAG ACTCCCAAGT ATGAGGTCAC TGTCATCGGT AAGCTCGACT 100 

5 CGCCCCGATA TGTTTTGGTG CTGTAGCTAA CACGATCTGA AGATGCCCCC 150 

GGTCACCGTG ACTTCATCAA GAACATGATC ACTGGTACCT CCCAGGCTGA 2 00 

CTGCGCTATC CTCATCATTG CCTCCGGTAC TGGTGAGTTC GAGGCTGGTA 2 50 

TCTCCAAGGA TGGCCAGACC CGTGAGCACG CTCTGCTGGC TTTCACCCTC 3 00 

GGTGTCAAGC AGCTCATCGT CGCCCTCAAC AAGATGGACA CCTGCAAGTG 3 50 

10 GTCCGAGGAT CGTTACAACG AAATTGTCAA GGAAACCTCC AACTTCATCA 400 

AGAAGGTCGG CTACAACCCC AAGGCCGTTC CCTTCGTCCC CATCTCTGGC 450 

TTCAACGGTG ACAACATGCT TGAGCCCTCC TCCAACTGCC CCTGGTACAA 50 0 

GGGATGGGAG AAGGAGACCA AGGCCGGCAA GGTCACTGGT AAGACCCTCA 550 

TCGAGGCCAT CGACGCCATT GAGCCCCCTG TCCGTCCCTY CAACAAGCCC 600 

15 CTCCGTCTTC CCCTCCAGGA TGTCTACAAG ATCTCTGGTA TCGGAACGGT 650 

CCCTGTCGGC CGTGTCGAGA CCGGTATCAT CAAGCCCGGC ATGGTCGTCA 700 

CCTTCGCCCC CGCCAACGTC ACCACTGAAG TCAAGTCCGT CGAGATGCAC 750 

CACCAGCAGC TCCAGGAGGG TGTCCCCGGT GACAACGTCG GTTTCAACGT 800 

CAAGAACGTT TCCGTCAAGG AAGTCCGCCG TGGTAACGTC TGCGGTGACT 850 

2 0 CCAAGAACGA TCCCCCTCAG GGTGCTGCCT CCTTCAACGC CCAGGTCATC 900 

GTCCTCAACC ACCCCGGTCA GGTCGGCGCT GGTTACGCCC CCGTCCTCGA 950 

CTGCCACACT GCCCACATTG CTTGCAAGTT CTCTGAGCTG CTTGAGAAGA 100 0 

TTGACCGCCG TACCGGCAAG TCTGTTGAGA ACAACCCCAA GTTCATCAAG 105 0 

TCCGGTGATG CCGCCATCGT GAAGATGGTT CCTTCCAAGC CCATGTGTGT 110 0 

25 CGAGTCCTTC ACTGACTACC CCCC 1124 
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2 ) INFORMATION FOR SEQ ID NO: 898 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 63 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Penicillium marneffei 

(D) STRAIN: WSA-214 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 898 

45 CTCAAGGCTG AGCGTGAGCG TGGTATCACC ATCGATATTG CTCTCTGGAA 50 

GTTCCAGACT GCCAAGTACG AGGTTACCGT CATTGACGCC CCCGGTCACC 100 

GTGATTTCAT CAAGAACATG ATCACTGGTA CCTCCCAGGC CGATTGCGCT 150 

ATTCTCATCA TTGCCTCTGG TACTGGTGAA TTCGAGGCTG GTATCTCCAA 2 00 

GGATGGCCAG ACTCGTGAGC ACGCTCTTTT GGCTTTCACC CTCGGTGTCC 250 

50 GTCAGCTCAT TGTTGCCCTC AACAAGATGG ACACTTGCAA GTGGTCTCAG 3 00 

GGTGAGTACT CGTACCTGCG TTTGGCCTTG AATATCTTAC TAATGCACCA 350 

TAGATCGTTA CAACGAAATT GTCAAGGAGA CTTCCAACTT CATCAAGAAG 4 00 

GTCGGATACA ACCCTAAGAA CGTTCCTTTC GTTCCTATCT CCGGTTTCAA 4 50 

CGGTGACAAC ATGCTTGAGC CCTCCCCCAA CTGCCCCTGG TACAAGGGTT 500 

55 GGGAGAAGGA GACCAAGGCC GGTAAGGTCA CTGGTAAGAC CCTCCTCGAG 550 

GCCATCGACG CCATTGAGCC CCCTACCCGT CCCGCCAACA AGGTTAGTCC 600 

CTCCTCGACT ACTCAAACCC TCCTCATAAG TTCATGATTA CGACTCGTTC 650 

ACAGCCCCTC CGTCTTCCCC TCCAGGATGT CTACAAGATC GGTGGTATTG 700 

GAACGGTTCC CGTCGGTCGT GTTGAGACTG GTACCATCGT TCCTGGTATG 750 

60 GTTGTCACCT TGTAAGTCAC TCTCCTCGCT TATCCTACCT GAAATCATCA 800 

504 
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TGTGCTAACT TGACACTCAG CGCTCCCGCC 
GAGTGTTGAA ATGCACCACC AGCAGCTCAC 
ACGTTGGTTT CAACGTGAAG AACGTCTCCG 
AACGTTGCTG GTGACAGCAA GAACGACCCC 
5 CAACGCCCAG GTCATCGTCC TCAACCACCC 
ACGCCCCAGT CCTCGATTGC CACACTGCCC 
GAGCTCCTCG AGAAGATTGA CCGTCGTACC 
CCCCAAGTTC ATCAAGTCCG GTGACGCTGC 
CCAAGCCCAT GTGTGTTGAG GCTTTCACCG 
10 TTCGCCGTTC GCGAGTAAGT TTTATCTCCG 
CCTTCTCCTC CGTCTTCCAT ATATATTTTT 
ACAAATCACG GGA 



PCT/CA00/O115O 

AACGTCACCA CTGAAGTCAA 850 

TGCCGGTCAG CCCGGTGACA 900 

TCAAGGAAAT CCGTCGTGGT 950 

CCTGCCGGTG CTGCCTCCTT 1000 

CGGTCAGGTC GGTGCTGGTT 1050 

ACATTGCTTG CAAGTTCGCT 1100 

GGAAAGTCTG TTGAGGACCA 1150 

CATCGTCAAG ATGATTCCTT 1200 

AGTACCCTCC TCTCGGTCGT 12 50 

TTGTCTATTT TCCATCCTTC 1300 

TCAGTTATAT GTGACTAACC 1350 

1363 
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2) INFORMATION FOR SEQ ID NO: 899 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 114 7 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Piedraia hortai 
<B) STRAIN: ATCC 24292 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 899 

CAAGCTGAAA GCCGAGCGTG AGCGTGGTAT CACTATCGAC ATTGCCCTCT 50 

GGAAGTTCGA GACTCCCAAG TACTATGTCA CCGTCATTGG TACGTCGCAT 100 

TATCTCACTC CTCACAGAAG CACGCTCCTA ACATCACACA GACGCTCCCG 150 

3 5 GTCACCGTGA TTTCATCAAG AACATGATCA CTGGTACCTC CCAGGCCGAC 200 

TGCGCTATCC TCATTATCGC TGCCGGTACT GGTGAGTTCG AGGCTGGTAT 250 

CTCCAAGGAT GGCCAGACTC GTGAGCACGC CCTGCTCGCC TACACCCTCG 300 

GTGTGCGTCA GCTCATCGTC GCCATCAACA AGATGGACAC CACCAAGTGG 3 50 

TCTGAGGCCC GTTACCAGGA GATCATCAAG GAGACCTCCA ACTTCATCAA 400 

40 GAAGGTCGGC TACAACCCCA AGACCGTCGC TTTCGTCCCC ATCTCTGGCT 450 

TCAACGGCGA CAACATGCTT GCCCCCTCCA CCAACTGCCC CTGGTACAAG 500 

GGATGGGAGC GTGAGGTCAA GGGCAACAAG CAGACCGGCA AGACCCTCCT 550 

CGAGGCCATT GACGGCATTG AGCCCCCCAA GCGTCCTTCC GACAAGCCCC 600 

TCCGTCTTCC TCTCCAGGAT GTTTACAAGA TCGGTGGTAT CGGAACTGTT 650 

45 CCTGTCGGCC GTATCGAGAC TGGTGTCCTC AAGCCCGGTA TGGTCGTTAC 700 

CTTCGCTCCC GCCAACGTCA CCACTGAAGT CAAGTCCGTC GAGATGCACC 750 

ACGAGCAGCT CACTGAGGGT CTTCCCGGTG ACAACGTCGG TTTCAACGTG 800 

AAGAACGTTT CCGTCAAGGA CATCCGCCGT GGTAACGTTG CCAGTGACTC 850 

CAAGAACGAC CCCGCTCTGG GTGCCGCTTC TTTCGACGCC CAGGTCATCG 900 

50 TCCTCAACCA CCCCGGTCAG GTCGGTGCTG GTTACGCCCC GGTCCTCGAT 950 

TGCCACACTG CCCACATTGC TTGCAAGTTC TCCGAGATCA AGGAGAAGAT 1000 

CGACCGCCGT ACCGGCAAGT CTGTTGAGGA CGCCCCCAAG TTCATCAAGT 1050 

CTGGTGACTC TGCCATCGTC AAGATGGTTC CCTCCAAGCC CATGTGCGTT 1100 

GAGGCTTTCA CCGACTACCC TCCTCTGGGC CGTTTCGCCG TCCGTGA 114 7 

55 

2) INFORMATION FOR SEQ ID NO: 900 

60 (i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1150 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paecilomyces lilacinus 

(B) STRAIN: ATCC 60735 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 900 

CTCAAGGCCG AGCGTGAGCG TGGTATCACC ATCGACATTG CCCTCTGGAA 50 

GTTCGAGACT CCCAAGTACT ATGTCACCGT CATTGGTACG TCGACTCGCG 100 

CGAGACTGGT CGCAATTTCC ACGTCGCTAA CGTGCTTGAA CAGACGCTCC 150 

CGGCCACCGT GACTTCATCA AGAACATGAT CACTGGTACC TCCCAGGCTG 2 00 

ACTGCGCTAT CCTCATTATC GCTGCCGGCA CTGGTGAGTT CGAGGCTGGT 250 

ATCTCCAAGG ATGGCCAGAC CCGTGAGCAC GCTCTGCTCG CCTACACCCT 3 00 

CGGTGTTAAG CAGCTCATCG TCGCTATCAA CAAGATGGAC ACCACCAAGT 3 50 

GGTCTGAGGC CCGTTTCCAG GAGATCATCA AGGAGACCTC CAACTTCATC 4 00 

AAGAAGGTCG GCTACAACCC CAAGACCGTC GCTTTCGTCC CCATCTCTGG 4 50 

TTTCCACGGC GACAACATGC TTTCCCCCTC CACCAACTGC CCCTGGTACA 500 

AGGGCTGGGA GAAGGAGACC AAGGCTGGCA AGTCCACCGG CAAGACCCTC 550 

CTTGAGGCCA TCGACTCCAT CGAGCCCCCC AAGCGCCCCA GCGACAAGCC 600 

CCTCCGCCTT CCCCTTCAGG ATGTGTACAA GATCGGCGGT ATCGGCACAG 65 0 

TCCCTGTCGG CCGTATCGAG ACTGGTGTCA TCAAGCCCGG CATGGTCGTG 700 

ACCTTCGCTC CTTCCAACGT CACCACCGAA GTCAAGTC CG T TGAGATGC A 750 

CCACGAGCAG CTCTCCGAGG GTGTCCCCGG TGACAACGTC GGCTTCAACG 800 

TCAAGAACGT CTCCGTCAAG GAGATCCGTC GTGGCAACGT CGCCGGTGAC 850 

TCCAAGAACG ACCCCCCTCT GGGTGCCGCT TCTTTCGATG CCCAGGTCAT 900 

CGTCCTCAAC CACCCCGGCC AGGTCGGTGC TGGCTACGCC CCCGTCCTCG 950 

ACTGCCACAC CGCCCACATT GCCTGCAAGT TCGCCGAGAT CAAGGAGAAG 1000 

ATCGACCGCC GTACCGGCAA GTCTGTCGAG TCCGCCCCCA AGTTCATCAA 1050 

GTCTGGCGAC TCTGCCATCG TCAAGATGAT TCCCTCCAAG CCCATGTGCG 1100 

TTGAGGCTTT CACCGACTAC CCTCCTCTGG GCCGCTTCGC CGTCCGTGAC 1150 



2) INFORMATION FOR SEQ ID NO: 901 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 32075 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 901 

TACCACTAAG TGGTCCGAGA CCCGATTCAA TGAAATTATC AAGGAAGTCA 50 

CCAATTTCAT TAAGAAGGTC GGATATAACC CCAAGACTGT TCCTTTCGTT 100 

CCCATTTCTG GTTTCGAGGG TGACAACATG ATCGAGCCCT CTGCCAACTG 150 

CCCATGGTAC AAGGGCTGGT CCAAGGAGAC TGCTCAGGGC AAGTACTCTG 200 

GCAAGACCCT TCTTGAGGCC ATCGACGCCA TTGAGCCCCC CACCCGTCCT 250 

506 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

ACCGATAAAC CTCTCCGTCT TCCCCTCCAG GATGTCTACA AGATCTcdGG 3 00 

TATTGGCACT GTTCCTGTCG GACGTGTTGA GACTGGAGTC ATCAAGCCCG 350 

GTATGGTCGT GACCTTCGCT CCCGCCAACG TCACCACTGA AGTCAAGTCC 40 0 

GTCGAAATGC ACCACCAGCA GCTTACCGCC GGTAACCCCG GTGACAACGT 450 

5 CGGCTTCAAC GTCAAGAATG TTTCCGTCAA AGAAGTCCGC CGTGGTAACG 500 

TTGCCGGTGA CTCTAAGAAT GATCCCCCAA AGGGCTGCGA TTCCTTCAAT 550 

GCCCAGGTCA TCGTCCTCAA CCACCCTGGT CAGGTTGGCG CTGGTTATGC 600 

CCCAGTCCTC GACTGCCATA CTGCCCACAT TGCCTGCAAA TTCGCTGAGC 650 

TCCTTGAGAA GATTGATCGA CGAACCGGAA AGTCTGTTGA GAACAACCCC 700 

10 AAGTTCATCA AGTCCGGTGA TGCTGCTATC GTCAAGATGA TTCCTTCCAA 750 
G 



15 2) INFORMATION FOR SEQ ID NO: 902 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 

(B) STRAIN: ATCC 7968 
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55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 902 



2) INFORMATION FOR SEQ ID NO: 903 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1366 bases 
60 (B) TYPE: Nucleic acid 

507 



751 



CGTGAGCGCG GTATC AC CAT CGATATTGCT CTGTGGAAGT TCGAGACCCC 50 

CAAGTACTAC GTCACCGTCA TTGACGCCCC CGGTCATCGC GATTTCATCA 100 

AGAACATGAT CACTGGTACC TCGCAGGCCG ACTGCGCCAT TCTCATCATT 150 

GCCGCTGGTA CTGGTGAGTT CGAGGCTGGT ATCTCCAAGG ATGGCCAGAC 2 00 

35 TCGTGAGCAC GCTCTGCTCG CCTACACCCT GGGTGTGCGG CAGCTGATCG 2 50 

TCGCCATCAA CAAGATGGAC ACGGCCAAGT GGGCTGAGGC TCGTTACCAG 30 0 

GAGATCATCA AGGAGACCTC CAACTTCATC AAGAAGGTCG GCTACAACCC 3 50 

CAAGACTGTT GCCTTCGTCC CCATCTCGGG CTTCCACGGC GACAACATGC 400 

TTACTCCCTC GACCAACTGC CCCTGGTACA AGGGCTGGGA GAAGGAGGGC 4 50 

4 0 AAGAGCGGCA AGGTTACCGG TAAGACTCTG CTGGACGCCA TTGACGCCGT 500 

CGAGCCCCCC AAGCGCCCCA CGGACAAGCC CCTGCGTCTG CCCCTCCAGG 550 

ATGTCTACAA GATCGGCGGT ATCGGCACTG TCCCTGTCGG CCGTATCGAG 600 

ACTGGTGTCC TGAAGCCCGG CATGGTCGTC ACCTTTGCCC CGTCCAACGT 650 

CACCACTGAA GTCAAGTCCG TCGAGATGCA CCACGAGCAG CTTGTTGAGG 700 

45 GTGTTCCCGG CGACAACGTC GGCTTCAACG TCAAGAACGT CTCCGTCAAG 750 

GAGATCCGTC GTGGCAACGT TGCCGGTGAC TCCAAGAACG ACCCCCCCTC 800 

GGGCGCCGCC ACCTTCAACG CCCAGGTCAT TGTCCTGAAC CACCCCGGCC 850 

AGGTCGGCAA CGGCTACGCC CCGGTTCTGG ACTGCCACAC CGCCCACATT 90 0 

GCCTGCAAGT TCACCGAGAT CCTTGAGAAG ATCGACCGCC GTACCGGCAA 950 

50 GTCGGTTGAG AACAACCCCA AGTTCATCAA GTCGGGTGAC GCCGCCATTG 100 0 

TCAAGCTGAC GCCCTCGAAG CCCATGTGCG TTGAGGCCTT CACTGACTAC 1050 
CCCCCT 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 

(B) STRAIN: ATCC 58950 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 903 

CAAGGCTGAG CGTGAGCGTG GTATCACCAT CGATATTGCT CTCTGGAAGT 50 

TCCAGACTGC CAAGTACGAG GTTACCGTCA TTGACGCCCC CGGTCACCGT 100 

GATTTCATCA AGAACATGAT CACTGGTACC TCCCAGGCCG ATTGCGCTAT 150 

TCTCATCATT GCCTCTGGTA CTGGTGAATT CGAGGCTGGT ATCTCCAAGG 200 

ATGGCCAGAC TCGTGAGCAC GCTCTTTTGG CTTTCACCCT CGGTGTCCGT 250 

CAGCTCATTG TTGCCCTCAA CAAGATGGAC ACTTGCAAGT GGTCTCAGGG 3 00 

TGAGTACTCG TACCTGCGTT TGGCCTTGAA TATCTTACTA ATGCACCATA 3 50 

GATCGTTACA ACGAAATTGT CAAGGAGACT TCCAACTTCA TCAAGAAGGT 4 00 

CGGATACAAC CCTAAGAACG TTCCTTTCGT TCCTATCTCC GGTTTCAACG 450 

GTGACAACAT GCTTGAGCCC TCCCCCAACT GCCCCTGGTA CAAGGGTTGG 500 

GAGAAGGAGA CCAAGGCCGG TAAGGTCACT GGTAAGACCC TCCTCGAGGC 550 

CATCGACGCC ATTGAGCCCC CTACCCGTCC CGCCAACAAG GTTAGTCCCT 60 0 

CCTCGACTAC TCAAACCCTC CTCATAAGTT CAGATTACTG ACTCGTTCAC 650 

AGCCCCTCCG TCTTCCCCTC CAGGATGTCT ACAAGATCGG TGGTATTGGA 70 0 

ACGGTTCCCG TCGGTCGTGT TGAGACTGGT ACCATCGTTC CTGGTATGGT 750 

TGTCACCTTG TAAGTCACTC TCCTCGCTTA TCCTACCTGA AATCATCATG 800 

TGCTAACTTG ACACTCAGCG CTCCCGCCAA CGTCACCACT GAAGTCAAGA 850 

GTGTTGAAAT GCACCACCAG CAGCTCACTG CCGGTCAGCC CGGTGACAAC 90 0 

GTTGGTTTCA ACGTGAAGAA CGTCTCCGTC AAGGAAATCC GTCGTGGTAA 950 

CGTTGCTGGT GACAGCAAGA ACGACCCCCC TGCCGGTGCT GCCTCCTTCA 100 0 

ACGCCCAGGT CATCGTCCTC AACCACCCCG GTCAGGTCGG TGCTGGTTAC 1050 

GCCCCAGTCC TCGATTGCCA CACTGCCCAC ATTGCTTGCA AGTTCGCTGA 110 0 

GCTCCTCGAG AAGATTGACC GTCGTACCGG AAAGTCTGTT GAGGACCACC 1150 

CCAAGTTCAT CAAGTCCGGT GACGCTGCCA TCGTCAAGAT GATTCCTTCC 1200 

AAGCCCATGT GTGTTGAGGC TTTCACCGAG TACCCTCCTC TCGGTCGTTT 12 50 

CGCCGTTCGC GAGTAAGTTT TATCTCCGTT GTCTATTTTC CATCCTTCCC 130 0 

TTCTCCTCCG TCTTCCATAT ATATTTTTTC AGTTATATGT GACTAACCAC 1350 

AAATCACGGG AATAGC 13 66 



2) INFORMATION FOR SEQ ID NO: 904 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Curvularia lunata 

(B) STRAIN: ATCC 26425 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 904 

CTATTATCGT TGTTGCCGCT TCCGACGGTC AAATGCCCCA GACTCGTGAG 50 
CATCTGCTGC TCGCCCGCCA GGTCGGTGTC CAGAAGATCG TTGTCTTCGT 100 

508 
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CAACAAGGTC GATGCTGTTG AGGACAAGGA 
TGGAGATGCG CGAATTGCTC AGCAGCTACG 
CCCATCATCA TGGGATCTGC CCTCTGCGCC 
GATTGGTGTC AACCGAATTG ATGAGCTGCT 
5 TCCCCACCCC TCAGCGTGAG AC CGACAAGC 
GATGTCTTCT CCATTGCTGG TCGTGGCACT 
GCGAGGTATC CTGAAGCGCG ATGCTGAAGT 
CCGCCCCCAT CAAGACCAAG GTTACCGATA 
TGCGAGGAGT CTCGCGCTGG TGACAACTCC 

10 CAAGCGTGAT GAAGTCCGCC GTGGTATGGT 
TCAAGGCGCA CAAGAAGTTC CTTGTCTCCA 
GAAGGTGGTC GTCACACTGG CTTCGGTGAG 
CATCCGCACT GCCGACGAGT CGTGTGCCCT 
AGGACGCCCA TGACAAGCTT GTTATGCCCG 

15 TGCGAGCTCC ATGCACCACA CGTCTTGGAG 



PCT/CA00/01150 

GATGTTGGAG CTCGTCGAGA 150 

GCTTCGAGGG CGACGAGACT 2 00 

ATTGAGGGCC GCGAACCTGA 250 

CGAGGCCGTT GATACTTGGA 3 00 

CTTTCCTCAT GGCCGTCGAG 3 50 

GTCGTCTCTG GCCGTGTCGA 4 00 

TGAGCTCGTC GGCAAGGGCA 450 

TCGAGACCTT CAAGAAGTCC 500 

GGTCTCCTTC TTCGTGGTGT 550 

CGTTTCCGTC CCTGGACAGG 600 

TGTACGTGTT GAGCAAGGAG 6 50 

AACTACAGGC CGCAAATGTT 700 

GTACTGGCCA GAAGGCACCG 750 

GTGACAACGT CGAGATGGTT 8 00 

CCTGGTCAAC G 841 



2) INFORMATION FOR SEQ ID NO: 905 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 967 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

3 0 (A) ORGANISM: Aspergillus niger 

(B) STRAIN: ATCC 9508 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 905 

35 CGGTGCTATC ATTGTCGTCG CCGCCTCCGA CGGTCAGATG TACGTTAACC 50 

TTAAAAGAAT AACTCTCCTT CAGTATATAT GCTTACACTG GCGATCAACA 100 

GGCCCCAGAC TCGTGAGCAC TTGCTGCTTG CTCGTCAGGT CGGTGTCCAG 150 

AAGATCGTTG TCTTCGTCAA CAAGGTCGAT GCTATCGATG ACCCCGAGAT 2 00 

GCTGGAGCTC GTTGAGCTGG AAATGCGCGA GCTTCTCAGC ACCTACGGAT 2 50 

4 0 TCGAGGGTGA GGAGACCCCC ATCGTCTTCG GCTCTGCTCT CTGCGCCATT 3 00 

GAGGACCGCC GCCCCGACAT CGGTACCGAG CGTATCGACG CTCTCCTCGA 3 50 

GGCCGTTGAC ACCTGGATCC CCACTCCCCA GCGTGACCTT GACAAGCCTT 4 00 

TCTTGATGTC CATTGAGGAA GTTTTCTCCA TCCCCGGTCG TGGTACCGTC 4 50 

GCCTCCGGCC GTGTCGAGCG TGGTCTCCTG AAGCGTGATA GCGAGGTTGA 500 

4 5 GATCATCGGT ACCACCAACG AGGTCATCAA GACCAAGGTT ACCGACATTG 550 

AGACCTTCAA GAAGTCCTGC TCCGAGTCCC GCGCCGGTGA CAACTCCGGT 600 

CTCCTGCTCC GTGGTGTCCG CCGTGAGGAT CTCCGCCGTG GTATGGTCAT 650 

TGCCGCTCCT GGCAGCGCCA AGGCCAACAG CAAGTTCATG GTCTCCATGT 700 

ACGTCCTGAC CGAGGCTGAR GGTGGTCGCC GTACCGGTTT CGGTGTCCAG 750 

50 TACCGTCCCC AGCTGTTCAT CCGCACTGCC GGTAAGTAAA ATTGCATTCT 8 00 

ATTCCGCTAC TAGGGAACCA TCTCTAATTC TATTTGCTAC AGATGAGGCT 850 

GCTGAGTTCA GCTTCCCCGA CGGAGACCAG TCCCGCCGTA TCATGCCCGG 900 

TGACAACGTC GAGATGATCG TCAAGACCCA CCGCCCCGTC GCCGCCGAGG 950 

CCGGTCAGCG CTTCAAC 967 

55 

2) INFORMATION FOR SEQ ID NO: 906 

60 (i) SEQUENCE CHARACTERISTICS: 

509 



SUBSTITUTE SHEET (RULE 26) 



10 



15 



20 



25 



30 



WO 01/23604 

(A) 
(B) 
(C) 
(D) 



PCT/CA00/01150 



LENGTH: 852 bases 
TYPE: Nucleic acid 
STRANDEDNESS : Double 
TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bipolaris hawaiiensis 

(B) STRAIN: ATCC 26067 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 906 



TGGTGCTATT 
GTGAGCATCT 
TTCGTTAACA 
CGAGATGGAG 
AGACCCCTAT 
CCCGACATTG 
TTGGATTCCC 
TCGAGGATGT 
GTCGAACGAG 
GGGCAGCGCA 
AGTCTTGCGA 
GGTGTTAAGC 
ACAGGTTAAG 
AGGAGGAAGG 
ATGTTCATCC 
CACCGAGGAT 
TGGTTTGCGA 
AA 



ATTGTTGTTG 
GCTGCTCGCC 
AGGTCGACGC 
ATGCGCGAAC 
CATCATGGGT 
GTGTCGAACG 
ACCCCTCAGC 
CTTCTCCATT 
GTATCCTGAA 
CCCATCAAGA 
GGAGTCCCGC 
GTGATGAAGT 
GCGCACAAGA 
TGGCCGACAC 
GCACTGCCGA 
GCCCACGACA 
GCTCCATGCA 



CCGCTTCCGA 
CGTCAGGTCG 
TGTCGAGGAC 
TGCTCAGCAG 
TCTGCTCTCT 
AATTGACGAG 
GTGAGACCGA 
GCTGGTCGTG 
GCGCGATGCT 
CCAAGGTTAC 
GCTGGTGACA 
CCGCCGTGGT 
AGTTCCTTGT 
ACTGGCTTCG 
CGAGTCGTGT 
AGCTTGTCAT 
CCACACGTCT 



CGGTCAAATG 
GTGTTCAGAA 
AAGGAGATGT 
CTATGGCTTC 
GCGCCATTGA 
CTGCTCGAGG 
AAAGCCTTTC 
GTACCGTCGT 
GAAGTTGAGC 
CGATATCGAG 
ACTCCGGTCT 
ATGGTCGTTT 
CTCCATGTAT 
GTGAGAACTA 
GCCCTGTACT 
GCCCGGTGAC 
TGGAGACTGG 



CCCCAGACTC 50 

GATCGTTGTC 100 

TGGAGCTTGT 150 

GAGGGCGACG 200 

AGGCCGTCAA 250 

CTGTTGATAC 3 00 

CTCATGGCCG 3 50 

CTCTGGCCGT 400 
TTGTGGGCAA " 450 

ACCTTCAAGA 500 

CCTTCTTCGT 550 

CCGTCCCTGG 600 

GTGCTGAGCA 650 

CAGGCCGCAA 700 

GGCCAGAAGG 750 

AACGTCGAGA 800 

TCAGCGCTTC 850 

852 



35 2) INFORMATION FOR SEQ ID NO: 907 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 966 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus flavus 

(B) STRAIN: ATCC 2 6947 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 907 



GGTGCTATTG TTGTCGTTGC TGCTTCGGAT GGTCATGATG TATGGACAGG 50 

CCCTTTGCTA CTGAATGGTT TCAAGATCTC GCGCTTACAC GTATTATAAT 100 

AGGCCCCAGA CCCGGGAGCA CTTGCTGCTT GCCCGTCAGG TCGGTGTCCA 150 

GAAGATCGTC GTTTTTGTCA ACAAGATTGA TGCCGTTGAG GACCCTGAGA 2 00 

55 TGTTGGAGCT TGTCGAGTTG GAAATGCGCG AGCTCCTTAG CAGCTACGGC 250 

TTCGAGGGCG AAGAGACTCC CATCATCTTC GGTTCTGCTC TGTGTGCTTT 3 00 

GGAGGACCGT CGCCCCGACA TTGGTGCCGA GCGTATCGAC GAGCTCATGA 3 50 

AGGCCGTTGA CACCTGGATC CCTACCCCTC AGCGTGATCT TGACAAGCCT 4 00 

TTCCTCATGT CTGTCGAGGA AGTCTTCTCC ATCGCCGGTC GTGGTACCGT 450 

60 TGCCTCCGGC CGTGTCGAAC GTGGTATCCT GAAGAAGGAC AGCGAAGTCG 500 

510 
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AGATCATCGG AGGTAGCTTC 
GAGACCTTCA AGAAGTCTTG 
CTTGCTTCTC CGTGGTATCC 
TTGCTGCTCC TGGCAGCACC 
5 TACGTTCTCA CTGAGGCTGA 
CTACCGCCCC CAGGTGTTCG 
TCACTTAACG GTATTGATTA 
GCTGACCTCA GCTTCCCCGA 
TGACAACGTC GAGATGGTCC 
10 CTGGCCAGCG CTTCAA 



GATGCTACCA AGACCAAGGT 
TGACGAGTCC CGCGCTGGTG 
GTCGTGAAGA CGTCCGCCGC 
AAGGCCCACG ACCAGTTCTT 
GGGTGGTCGT CGTACTGGCT 
TTCGCACTGC TGGTAAGTCA 
AGTTCTAACT GTTGTATCCT 
CGGTGATGAG TCCCGGAGGG 
TCAAGACTCA CCGCCCCATT 



PCT/CA00/01150 



CACCGACATT 55 0 

ACAACTCTGG 60 0 

GGAATGATCA 650 

GGTGTCCATG 700 

TCGGCTCCAA 750 

AGCCTTTTGC 800 

AGATGAGGCT 850 

TGATGCCTGG 90 0 

GCTGCTGAGG 950 

966 



15 



20 



25 



2) INFORMATION FOR SEQ ID NO: 908 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alternaria alternata 

(B) STRAIN: ATCC 62 0 99 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 908 



3 0 GGTGCTATCA 

TGAGCACTTG 
TCGTCAACAA 
GAGATGGAGA 
GACACCCATC 
35 CCGAGATCGG 
TGGATCCCCA 
TGAGGATGTC 
TCGAGCGCGG 
GGCACCGCGC 

4 0 GTCGTGCGAG 

GTGTCAAGCG 
CAAGTCAAGG 
AGAGGAGGGT 
TGTTCATCCG 
45 ACCGAGGATG 
GGTCTGCGAA 



TCGTCGTTGC 
CTGCTCGCCC 
GGTCGATGCT 
TGCGTGAGTT 
ATCATGGGTT 
TGTTACCAAG 
CCCCTCAGCG 
TTCTCGATTG 
TATCTTGAAG 
CAATCAAGAC 
GAGTCGCGCG 
TGATGACGTT 
CTCACAAGAA 
GGTCGTCACA 
AACTGCCGAT 
CGCACGACAA 
CTCCACCAGC 



TGCTTCCGAT 
GTCAGGTCGG 
GTCGAAGACC 
ACTCACCAGC 
CCGCTCTATG 
GTCGACGAGC 
TGAGACCGAG 
CTGGACGTGG 
CGTGACGCTG 
CAAGGTCACT 
CGGGTGATAA 
CGCCGCGGTA 
GTTCCTTGTC 
CCGGCTTCGG 
GAATCCTGCG 
GCTAGTTATG 
CCCACGTTCT 



GGTCAGATGC 
TGTTCAGAAG 
CGGAGATGTT 
TACGGCTTCG 
CGCCATCGAG 
TAATGGACGC 
AAGCCTTTCC 
TACCGTCGTT 
AAGTCGAGCT 
GATATTGAGA 
CTCCGGTCTT 
TGGTTGTTTC 
TCCATGTACG 
CGAGAACTAC 
CACTTCACTT 
CCCGGTGACA 
AGAGACCGGT 



CCCAGACCCG 50 

ATCGTTGTCT 100 

GGAACTCGTC 150 

AGGGCGACGA 200 

GGCCGCCAGC 2 50 

TGTCGACTCA 3 00 

TCATGGCTGT 3 50 

TCGGGCCGTG 400 

TGTCGGCAAG 4 50 

CCTTCAAGAA 500 

CTCCTCCGTG 550 

CGTTCCCGGA 6 00 

TTCTAAGCAA 650 

AGGCCGCAAA 700 

CCCAGAGGGT 7 50 

ACGTCGAGAT 8 00 

CAGCG 84 5 



50 2 ) INFORMATION FOR SEQ ID NO: 909 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

511 
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(A) ORGANISM: Penicillium marneffei 1 

(B) STRAIN: ATCC 64101 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 909 

5 

CGCTGTTGTT GTCGTCGCTG CTTCTGATGG TCAAATGTAA CATATCCACG 50 

AGCTGCCAAT TATGGACACT GCTGATAAGA ATAGGCCCCA AACCCGTGAG 10 0 

CACTTGCTCC TCGCCCGTCA GGTCGGTGTT CAAAAGATCG TCGTCTTCGT 150 

CAACAAGGTT GATGCCGTCG AGGACCCCGA GATGTTGGAA CTTGTCGAAT 2 00 

10 TGGAAATGCG TGAACTCTTG ACCACCTACG GTTTCGAGGG TGAAGAGACC 250 

CCTATCATTT TCGGATCCGC TCTTTGCGCC TTGGAAGGCC GCAAGCCCGA 3 00 

GATTGGCGAA CAGAAGATTG ACGAGCTCAT GAACGCCGTT GATACCTGGA 3 50 

TCCCCACCCC CCAGCGTGAC CTTGACAAGC CCTTCTTGAT GTCCGTTGAG 4 00 

GAAGTTTTCT CCATCTCTGG TCGTGGTACC GTTGCATCTG GTCGTGTTGA 4 50 

15 GCGTGGTATT TTGCGCAAGG ATTCTGAGGT TGAGATTATC GGATACCAGA 500 

AGAACCCTAT CAAGACCAAG GTTACCGACA TTGAGACCTT CAAGAAGTCT 550 

TGCGATGAAT CTCGTGCTGG TGACAACTCT GGCTTGCTTC TCCGTGGTAT 600 

CAAGCGTGAG GACATTCGTC GTGGTATGGT TATCGCTGCT CCTGGAACCA 650 

CCAAGGCTCA TGACAACTTC TTGGTCTCCA TGTATGTCTT GACTGAGGCT 700 

20 GAAGGTGGTC GTCGTACTGG ATTCGGCGCC AACTACCGTC CTCAAGCTTT 7 50 

CATCCGTACT GCCGGTATGT TCCCTTTCAA AGTCAATTAA TGAGCGATTT 800 

GCTAACGAGT TATAGATGAG GCTGCTACTC TCAGCTTCCC CGGTGACGAT 850 

CAGTCCAAGC AGGTCATGCC CGGTGACAAC GTTGAGATGA TCTTGAAGAC 900 

ACACCGTCCC GTTGCCGCCG AAGCTGGTCA G 931 

25 



2) INFORMATION FOR SEQ ID NO: 910 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 931 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Penicillium marneffei 
40 (B) STRAIN: ATCC 58950 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 910 

CGCTGTTGTT GTCGTCGCTG CTTCTGATGG TCAAATGTAA CATATCCACG 50 

45 AGCTGCCAAT TATGGACACT GCTGATAAGA ATAGGCCCCA AACCCGTGAG 100 

CACTTGCTCC TCGCCCGTCA GGTCGGTGTT CAAAAGATCG TCGTCTTCGT 150 

CAACAAGGTT GATGCCGTCG AGGACCCCGA GATGTTGGAA CTTGTCGAAT 200 

TGGAAATGCG TGAACTCTTG ACCACCTACG GTTTCGAGGG TGAAGAGACC 2 50 

CCTATCATTT TCGGATCCGC TCTTTGCGCC TTGGAAGGCC GCAAGCCCGA 300 

50 GATTGGCGAA CAGAAGATTG ACGAGCTCAT GAACGCCGTT GATACCTGGA 3 50 

TCCCCACCCC CCAGCGTGAC CTTGACAAGC CCTTCTTGAT GTCCGTTGAG 400 

GAAGTTTTCT CCATCTCTGG TCGTGGTACC GTTGCATCTG GTCGTGTTGA 4 50 

GCGTGGTATT TTGCGCAAGG ATTCTGAGGT TGAGATTATC GGATACCAGA 500 

AGAACCCTAT CAAGACCAAG GTTACCGACA TTGAGACCTT CAAGAAGTCT 550 

55 TGCGATGAAT CTCGTGCTGG TGACAACTCT GGCTTGCTTC TCCGTGGTAT 600 

CAAGCGTGAG GACATTCGTC GTGGTATGGT TATCGCTGCT CCTGGAACCA 650 

CCAAGGCTCA TGACAACTTC TTGGTCTCCA TGTATGTCTT GACTGAGGCT 700 

GAAGGTGGTC GTCGTACTGG ATTCGGCGCC AACTACCGTC CTCAAGCTTT 750 

CATCCGTACT GCCGGTATGT TCCCTTTCAA AGTCAATTAA TGAGCGATTT 800 

60 GCTAACGAGT TATAGATGAG GCTGCTACTC TCAGCTTCCC CGGTGACGAT 850 

512 
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CAGTCCAAGC AGGTCATGCC CGGTGACAAC GTTGAGATGA TCTTGA^GAC 9 0 0- 

ACACCGTCCC GTTGCCGCCG AAGCTGGTCA G 931 



2) INFORMATION FOR SEQ ID NO: 911 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 911 
GACGGMKKCA TGCCGCARAC 



2) INFORMATION FOR SEQ ID NO: 912 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 912 
GACGGCGKCA TGCCGCARAC 



2 ) INFORMATION FOR SEQ ID NO: 913 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 913 
GACGGYSYCA TGCCKCAGAC 



2 ) INFORMATION FOR SEQ ID NO: 914 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

513 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 914 
GAARAGCTGC GGRCGRTAGT G 



2) INFORMATION FOR SEQ ID NO: 915 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 915 
AAACGCCTGA GGRCGGTAGT T 



2) INFORMATION FOR SEQ ID NO: 916 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 916 
GCCGAGCTGG CCGGCTTCAG 



2) INFORMATION FOR SEQ ID NO: 917 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 917 
TCGTGCTACC CGTYGCCGCC AT 



2) INFORMATION FOR SEQ ID NO: 918 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 13 91 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



5 



(ii) MOLECULE TYPE: Genomic DNA 



10 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(E) ACCESSION NUMBER: J01672 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 918 

AGAGAAGCCT GTCGGCACCG TCTGGTTTGC TTTTGCCACT GCCCGCGGTG 

15 AAGGCATTAC CCGGCGGGAT GCTTCAGCGG CGACCGTGAT GCGGTGCGTC 
GTCAGGCTAC TGCGTATGCA TTGCAGACCT TGTGGCAACA ATTTCTACAA 
AACACTTGAT ACTGTATGAG CATACAGTAT AATTGCTTCA ACAGAACATA 
TTGACTATCC GGTATTACCC GGCATGACAG GAGTAAAAAT GGCTATCGAC 
GAAAACAAAC AGAAAGCGTT GGCGGCAGCA CTGGGCCAGA TTGAGAAACA 

20 ATTTGGTAAA GGCTCCATCA TGCGCCTGGG TGAAGACCGT TCCATGGATG 
TGGAAACCAT CTCTACCGGT TCGCTTTCAC TGGATATCGC GCTTGGGGCA 
GGTGGTCTGC CGATGGGCCG TATCGTCGAA ATCTACGGAC CGGAATCTTC 
CGGTAAAACC ACGCTGACGC TGCAGGTGAT CGCCGCAGCG CAGCGTGAAG 
GTAAAACCTG TGCGTTTATC GATGCTGAAC ACGCGCTGGA CCCAATCTAC 

2 5 GCACGTAAAC TGGGCGTCGA TATCGACAAC CTGCTGTGCT CCCAGCCGGA 
CACCGGCGAG CAGGCACTGG AAATCTGTGA CGCCCTGGCG CGTTCTGGCG 
CAGTAGACGT TATCGTCGTT GACTCCGTGG CGGCACTGAC GCCGAAAGCG 
GAAATCGAAG GCGAAATCGG CGACTCTCAC ATGGGCCTTG CGGCACGTAT 
GATGAGCCAG GCGATGCGTA AGCTGGCGGG TAACCTGAAG CAGTCCAACA 

30 CGCTGCTGAT CTTCATCAAC CAGATCCGTA TGAAAATTGG TGTGATGTTC 
GGTAACCCGG AAACCACTAC CGGTGGTAAC GCGCTGAAAT TCTACGCCTC 
TGTTCGTCTC GACATCCGTC GTATCGGCGC GGTGAAAGAG GGCGAAAACG 
TGGTGGGTAG CGAAACCCGC GTGAAAGTGG TGAAGAACAA AATCGCTGCG 
CCGTTTAAAC AGGCTGAATT CCAGATCCTC TACGGCGAAG GTATCAACTT 

35 CTACGGCGAA CTGGTTGACC TGGGCGTAAA AGAGAAGCTG ATCGAGAAAG 
CAGGCGCGTG GTACAGCTAC AAAGGTGAGA AGATCGGTCA GGGTAAAGCG 
AATGCGACTG CCTGGCTGAA AGATAACCCG GAAACCGCGA AAGAGATCGA 
GAAGAAAGTA CGTGAGTTGC TGCTGAGCAA CCCGAACTCA ACGCCGGATT 
TCTCTGTAGA TGATAGCGAA GGCGTAGCAG AAACTAACGA AGATTTTTAA 

40 TCGTCTTGTT TGATACACAA GGGTCGCATC TGCGGCCCTT TTGCTTTTTT 
AAGTTGTAAG GATATGCCAT GACAGAATCA ACATCCCGTC G 



45 2) INFORMATION FOR SEQ ID NO: 919 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1391 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



50 



(ii) 



MOLECULE TYPE: DNA 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 919 



GGICCIGART CITMIGGIAA RAC 



23 



60 
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2) INFORMATION FOR SEQ ID NO: 92 0 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 0 
TCICCVATIT CICCITCIAI YTC 



2) INFORMATION FOR SEQ ID NO: 921 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 921 
TIYRTIGAYG CIGARCAIGC 



2) INFORMATION FOR SEQ ID NO: 922 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 922 
TARAAYTTIA RIGCIYKICC ICC 



2) INFORMATION FOR SEQ ID NO: 923 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 923 
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GACGCIGCCA TCCTGATGAT C 



PCT/CAOO/01150 

21 



5 2 ) INFORMATION FOR SEQ ID NO: 924 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 924 

ACCTCAGTCG TCACGTTGGC G 21 



20 



30 



35 



60 



2 ) INFORMATION FOR SEQ ID NO: 925 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 5 
AAGCAGATGG TTGTGTGCTG 20 

2 ) INFORMATION FOR SEQ ID NO: 926 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

4 5 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 6 
CAGCTGCTCG TGGTGCATCT CGAT 24 

50. 

2) INFORMATION FOR SEQ ID NO: 927 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 7 
5 ACGCGGAGAA GGTGCGCTT 19 



10 



2) INFORMATION FOR SEQ ID NO: 92 8 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 8 

20 

GGTCGTTCTT CGAGTCACCG CA 22 



25 2) INFORMATION FOR SEQ ID NO: 929 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 8 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides fragilis 

(B) STRAIN: ATCC 25285 



40 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92 9 



TTCAGCATGC CATTTCAAAA CAGGCCGAAG CCGATATCGT GATTATCGCT 50 

GCTTGTGGGG AGCGTGCAAA TGAAGTTGTG GAAATCTTTA CCGAATTTCC 100 

GGAATTGGTG GACCCGCACA CGGGACGTAA GCTGATGGAG CGTACCATTA 150 

TTATCGCAAA TACATCGAAC ATGCCGGTAG CAGCGCGTGA AGCTTCTGTG 200 

45 TATACGGCCA TGACGATTGC CGAATACTAT CGTGCCATGG GATTGAAAGT 250 

C C TGCTGATG GCAGACTCCA CTTCCCGTTG GGCGCAGGCA TTGCGTGAGA 3 00 

TGTCGAACCG TATGGAGGAG TTGCCCGGAC CGGATGCATT CCCGATGGAC 350 

CTGTCCTCAA TCATTTCTAA CTTCTATGGC CGTGCAGGCT ACGTGAAACT 4 00 

GAATAACGGC GAGAGCGGTT CTATTACCTT TATCGGTACA GTATCACC 448 

50 

2) INFORMATION FOR SEQ ID NO: 93 0 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides distasonis 
5 (B) STRAIN: ATCC 8503 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 930 

GCTATCTCTA AACAAGCGGA AGCGGATATC GTGATTATCG CCGCCTGCGG 50 

10 TGAGCGTGCG AATGAGGTCG TAGAGGTATT TACGGAGTTC CCGGAATTGG 100 

TAGACCCGCA TACGGGACGT AAATTGATGG AACGTACGAT CATTATCGCC 150 

AATACATCCA ACATGCCGGT AGCCGCTCGT GAGGCATCCG TATATACGGC 2 00 

GATGACCATC GCCGAGTATT ATCGCAGCAT GGGTTTGAAG GTTCTGTTGA 250 

TGGCCGACTC TACTTCCCGC TGGGCACAGG CTTTGCGTGA GATGTCCAAC 3 00 

15 CGTTTGGAGG AGTTGCCGGG ACCGGATGCT TTCCCGATGG ACTTGTCCGC 3 50 

TATCGTGGCG AACTTCTACG CTCGTGCGGG ATTCGTTCAT TTGAATAACA 4 00 

ACGCTACAGG CTCCGTCACT TTCATCGGTA CGGTATCG 43 8 



20 



30 



2) INFORMATION FOR SEQ ID NO: 931 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 453 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas asaccharolytlca 

(B) STRAIN: ATCC 252 60 

3 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 931 

CCTCCAGCAC GCTATCTCTA AGCAGGCGGA GGCTGATATC GTCATTATGG 50 

CAGCCTGCGG TGAGCGTGCT AATGAGGTGG TGGAGATCTT TGCCGAGTTC 100 

CCTGAGCTCG AAGACCCACA CACGGGACGC AAGCTGATGG AGCGTACGAT 150 

4 0 CATCATCGCT AACACGAGTA ACATGCCAGT GGCTGCTCGT GAGGCTTCGG 2 00 

TCTACACCGC TATGACCATC GCTGAGTACT ACCGCTCGAT GGGTCTCAAA 250 

GTACTCCTAA TGGCTGACTC GACCTCTCGC TGGGCACAGG CACTGCGTGA 3 00 

GATGTCTAAC CGTCTAGAGG AGCTGCCTGG ACCAGATGCA TTCCCGATGG 3 50 

ACTTGTCGGC TATCGTGGCA AACTTCTACG CTCGTGCCGG CTTCGTCTAT 4 00 

4 5 CTCAACAACG GTGAGACAGG TTCTGTAACC TTCATCGGTA CGGTCTCTCC 450 

AGC 453 



50 2) INFORMATION FOR SEQ ID NO: 932 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 83 5 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
60 (vi) ORIGINAL SOURCE: 

519 
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(A) 
(B) 
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ORGANISM: Listeria monocytogenes 
STRAIN: ATCC 15313 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 932 



CGGAGCTATC 
GTGAACATAT 
TTCATGAACA 
TGAAATGGAA 
ACATTCCTGT 
GACTGGGAAG 
TCCAACTCCW 
ATGTATTCTC 
CGTGGACAAG 
AGAAAGCAAA 
TAGACTACGC 
GCTCGTGAAG 
TACTCCACAC 
AAGGTGGACG 
TTCCGTACTA 
AATGGTAAYG 
CAATCGCTAT 



TTAGTAGTAT 
CTTACTTTCA 
AATGTGACAT 
ATTCGTGATC 
AATCAAAGGT 
CTAAAATTGA 
GAACGTGATA 
AATCACTGGT 
TTAAAGTTGG 
AAAGTAGTAG 
TGAAGCTGGC 
ATATCCAACR 
ACTAACTTCA 
TCACACTCCA 
CTGACGTAAC 
CCTGGTGATA 
CGAAGACGGT 



CTGCTGCTGA 
CGTCAAGTTG 
GGTTGACGAT 
TATTAACTGA 
TCAGCTCTTA 
CGAGTTAATG 
CTGACAAACC 
CGTGGAACAG 
TGACGAAGTA 
TAACTGGAGT 
GACAACATTG 
TGGTCAAGTA 
AAGCTGAAAC 
TTCTTCAACA 
TGGTATTGTT 
ACATTGAGCT 
ACTAAATTCT 



TGGCCCAATG 
GTGTTCCATA 
GAAGAATTAC 
ATATGAATTC 
AAGCACTTCA 
GAAGCTGTAG 
ATTCATGATG 
TTGCAACTGG 
GAAGTTATCG 
AGAAATGTTC 
GCGCACTTCT 
TTAGCTAAAC 
TTATGTTTTA 
ACTACCGCCC 
ACACTTCCAG 
TGCAGTTGAA 
CTATC 



CCACAAACTC 50 

CATCGTTGTA 100 

TAGAATTAGT 150 

CCTGGCGATG 2 00 

AGGTGAAGCT 250 

ATTCTTACAT 300 

CCAGTTGAGG 350 

ACGTGTTGAA 4 00 

GTATCGAAGA 450 

CGTAAATTAC 500 

ACGTGGTGTT 550 

CAGGTTCGAT 600 

ACTAAAGAAG 650 

ACAATTCTAT 700 

AAGGTACTGA 750 

CTAATTGCAC 800 

835 



2) INFORMATION FOR SEQ ID NO: 933 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 933 
CATCATCGTI TTCMTGAACA ARTG 



2) INFORMATION FOR SEQ ID NO: 934 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 934 
TCACGYTTRR TACCACGCAG I AG A 



2) INFORMATION FOR SEQ ID NO: 935 

(i) SEQUENCE CHARACTERISTICS: 

520 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 935 
10 GGIAARWSIC ARYT I TGYCA YAC 23 



15 



2 ) INFORMATION FOR SEQ ID NO: 936 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 936 

25 

TCISIYTCIG GIARRCAIGG 2 0 

30 2 ) INFORMATION FOR SEQ ID NO: 937 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
4 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 7 

ATIACIGARG YITTYGGIGA RTT 23 

45 2 ) INFORMATION FOR SEQ ID NO: 938 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 8 

CYIGTIGYIS WIGCRTGIGC 2 0 



60 
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2) INFORMATION FOR SEQ ID NO: 93 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 03 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Saccharomyces cerevisiae 
(C) ACCESSION NUMBER: D10023 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93 9 

ATGTCTCAAG TTCAAGAACA ACATATATCA GAGTCACAGC TTCAGTACGG 50 

GAACGGTTCG TTGATGTCCA CTGTACCAGC AGACCTTTCA CAGTCAGTCG 100 

TTGATGGAAA CGGCAACGGT AGCAGCGAAG ATATTGAGGC CACCAACGGC 150 

20 TCCGGCGATG GTGGCGGATT GCAGGAGCAA GCGGAAGCGC AAGGTGAAAT 2 00 

GGAGGATGAA GCATACGATG AAGCTGCCTT AGGTTCGTTT GTGCCAATAG 2 50 

AAAAACTGCA AGTGAACGGG ATTACTATGG CGGATGTGAA AAAACTAAGG 3 00 

GAGAGTGGGC TTCACACTGC TGAAGCGGTA GCATATGCTC CCAGAAAGGA 3 50 

TTTATTGGAA ATCAAAGGTA TATCGGAAGC TAAGGCAGAT AAGTTGCTAA 4 00 

2 5 ACGAAGCGGC AAGGCTAGTG CCTATGGGAT TTGTCACGGC TGCTGATTTT 4 50 

CATATGAGAA GATCGGAGCT GATTTGTTTG ACAACGGGTT CTAAGAATTT 5 00 
GGACACTCTT TTGGGTGGTG GTGTGGAAAC TGGTTCTATT ACTGAGCTTT " 550 

TCGGTGAATT CAGGACAGGT AAGTCCCAGC TATGTCACAC TTTGGCCGTG 600 

ACATGCCAAA TTCCATTGGA TATTGGTGGC GGTGAAGGTA AGTGTTTGTA 650 

3 0 TATCGATACC GAAGGTACTT TCAGGCCGGT AAGATTGGTA TCCATAGCTC 700 

AGCGGTTCGG ATTAGACCCG GATGATGCTT TGAACAACGT TGCGTATGCA 750 

AGAGCCTATA ACGCCGATCA TCAGTTAAGA CTTCTGGATG CTGCTGCCCA 8 00 

AATGATGAGC GAGTCTCGGT TTTCCTTGAT TGTGGTCGAT TCTGTTATGG 850 

CTCTATACCG TACGGATTTT TCTGGTCGTG GTGAACTAAG CGCAAGGCAA 900 

3 5 ATGCATTTAG CCAAATTTAT GCGTGCTTTG CAAAGGCTGG CCGACCAATT 950 

TGGTGTTGCA GTCGTCGTTA CTAACCAAGT GGTCGCCCAA GTTGATGGTG 1000 

GTATGGCTTT TAATCCAGAT CCAAAGAAGC CTATCGGTGG TAATATTATG 1050 

GCACATTCTT CCACCACGCG ATTAGGTTTC AAAAAGGGTA AGGGATGTCA 110 0 

AAGATTATGC AAAGTTGTTG ACTCACCTTG CTTACCAGAG GCTGAATGTG 1150 

4 0 TGTTCGCGAT CTATGAAGAT GGTGTTGGTG ACCCCAGAGA AGAAGACGAG 12 00 

TAG 12 03 



4 5 2) INFORMATION FOR SEQ ID NO: 94 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1800 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 

(B) STRAIN: GRF88 

(C) ACCESSION NUMBER: M87549 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 940 
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CGATCCAATT GCTGGTCTTA AGATGCATTT GATTGATCTA GGTATTGCCi£ 50 
CTGAAGCTGA AGTCAAAGCT TACGACAAGT CCGCTAGAAA ATACGTTGAC 100 
GAACAAGTTG AATTAGCTGA TGCTGCTCCT CCTCCAGAAG CCAAATTATC 150 
5 CATCTTGTTT GAAGACGTCT ACGTGAAAGG TACAGAAACT CCAACCCTAA 2 00 

GAGGTAGGAT CCCTGAAGAT ACTTGGGACT TCAAAAAGCA AGGTTTTGCC 250 
TCTAGGGATT AATTAAATCG TAAGGAAAAA TAAAATAATA GTGCTGTGAT 3 00 

CGCATGATAT TCTTCCCTGG AAGCGCCATT TTATAGCAAG AAATGTAAGT 350 
CAAGTATATT TTAACTGTAT ATACAACAAT ATGACTCTTT TTTATGCCTT 4 00 

10 GTTGTTTTTC TTCGGGTTTT CCCACACATT GTGTGGAGAG ATAGTTATTA 4 50 

ACAGACCGAA AATAGCCGCC CAAGGATAAA CTTTTATATA AAGGGAAGGG 500 
TAGTTGACCC AAAAATTTGG ATTCTACTTT CCAGATTTAC TTTCACCCTT 550 
TTATATTTGC TGTAGTCTGT TATGCCAATC AGGAAAGCAT TTGAACAAAT 600 
ATGTCTGTTA CAGGAACTGA GATCGATAGT GATACAGCAA AAAATATTCT 650 

15 TAGTGTAGAT GAATTACAGA ACTATGGTAT TAATGCCTCA GATCTTCAAA 700 
AATTGAAGTC TGGTGGGATA TACACAGTCA ATGTATGTTA TAATAACATT 750 
TTTAAAACCT CTGCTGTAGA GGTTCTTTCC CCCTTTCTTT TACTAACTAA 8 00 

TAATTTGGAA AGGAACTTTT ATAGACCGTT TTGTCAACAA CAAGAAGACA 850 
TCTATGTAAA ATTAAAGGGT TAAGTGAGGT GAAAGTGGAA AAAATTAAAG 900 

2 0 AAGCTGCTGG AAAAATCATA CAGGTGGGAT TTATCCCCGC TACTGTACAA 950 

TTAGATATAA GACAGCGTGT GTACTCCTTA TCAACTGGAT CTAAGCAACT 1000 

AGATTCAATC CTAGGTGGTG GAATAATGAC AATGAGTATC ACTGAAGTAT 1050 

TTGGTGAATT TAGGTGTGGT AAGACACAGA TGTCTCATAC TTTGTGTGTT 110 0 

ACCACGCAGC TTCCGAGGGA AATGGGTGGT GGTGAAGGGA AAGTAGCATA 1150 

25 TATTGATACA GAAGGCACTT TCAGGCCCGA GAGGATTAAG CAAATTGCAG 12 00 

AAGGTTATGA ATTGGATCCC GAGTCATGTT TGGCAAACGT TTCATATGCT 1250 

AGAGCCTTGA ATAGTGAACA TCAAATGGAA CTTGTTGAAC AATTGGGTGA 13 00 

AGAACTTAGT TCTGGAGATT ATCGCCTTAT CGTGGTAGAT TCTATAATGG 13 50 

CAAACTTCAG AGTAGACTAC TGCGGTAGAG GTGAACTAAG CGAAAGACAG 14 00 

30 CAAAAGCTAA ATCAACATCT TTTCAAATTG AATAGATTGG CAGAGGAATT 1450 

TAATGTTGCA GTATTTCTGA CAAACCAAGT TCAATCAGAC CCAGGTGCTT 1500 

CTGCATTATT TGCCTCGGCA GATGGTAGGA AACCAATTGG AGGGCACGTT 1550 

CTGGCACATG CGTCAGCAAC AAGGATTTTG TTGAGAAAAG GGCGTGGTGA 1600 

CGAAAGAGTT GCCAAGTTAC AAGATTCCCC AGATATGCCT GAAAAAGAAT 1650 

35 GTGTCTACGT AATTGGTGAA AAAGGTATTA CCGATTCAAG TGACTAGTTT 1700 

TTGTATACTT TTTTAATGAA GATGACATTG CTCCTTTATT AAACTTTTCT 1750 

TTTACTTTGT GTTACTAATA TTATTAATAT CTTGTTATGA TTCTTTGTTT 18 00 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus humicolus 

(B) STRAIN: ATCC 38294 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 941 

CGTCCTTATC CAGGAGCTCA TTAACAACAT TGCCAAGGCC CACGGTGGTT 50 

TCTCCGTCTT CACCGGTGTC GGTGAGCGTA CCCGTGAGGG TAACGACCTG 10 0 

TACCACGAGA TGCGTGAGAC TGGTGTCATC AACCTCGAGG GCGACTCCAA 150 

2 0 GGTCGCTCTC GTCTTCGGCC AGATGAACGA GCCCCCCGGA GCCCGTGCCC 2 00 

GTGTCGCCCT TACCGGCCTC ACCATCGCCG AGTACTTCCG TGACGAGGAG 2 50 

GGTCAGGACG TGCTTCTCTT CATCGACAAC ATTTTCCGTT TCACCCAGGC 3 00 

CGGTTCCGAG GTGTCTGCCC TTCTCGGTCG TATCCCCTCG GCCGTCGGTT 3 50 

ACCAGCCCAC CCTCGCTACC GACATGGGTT CCATGCAGGA GCGTATCACC 400 

2 5 ACCACCAAGA AGGGTTCGAT TACCTCCGTC 43 0 



2) INFORMATION FOR SEQ ID NO: 942 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

3 5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

4 0 (A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 43895 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 942 

4 5 CGTGTACGAT GCTCTTGAGG TGCAAAATGG TAATGAGCGT CTGGTGCTGG 50 

AAGTTCAGCA GCAGCTCGGC GGCGGTATCG TGCGTACCAT CGCAATGGGT 100 

TCCTCCGACG GTCTGCGTCG CGGTCTGGAT GTAAAAGACC TCGAACACCC 150 

GATCGAAGTC CCGGTAGGTA AAGCGACTCT GGGCCGTATC ATGAACGTAC 2 00 

TGGGTGAACC GGTCGACATG AAAGGCGAGA TCGGTGAAGA AGAG CGTTGG 2 50 

50 GCGATTCACC GCGCAGCACC TTCCTACGAA GAGCTGTCAA ACTCTCAGGA 300 

ACTGCTGGAA ACCGGTATCA AAGTTATCGA CCTGATGTGT CCGTTCGCTA 3 50 

AGGGCGGTAA AGTTGGTCTG TTCGGTGGTG CGGGTGTAGG TAAAACCGTA 400 

AACATGATGG AGCTCATTCG TAACATCGCG ATCGAGCACT CCGGTTACTC 450 

TGTGTTTGCG GGCGTAGGTG AACGTACTCG TGAGGGGAAC GACTTCTACC 50 0 

55 ACGAAATGAC CGACTCCAAC GTTATCGATA AAGTATCCCT GGTGTATGGC 550 

CAGATGAACG AGCCGCCGGG AAACCGTCTG CGCGTAGCTC TGACCGGTCT 6 00 

GACCATGGCT GAGAAATTCC GTGACGAAGG TCGTGACGTT CTGCTGTTCG 650 

TTGACAACAT CTATCGTTAC ACCCTGGCCG GTACGGAAGT ATCCGCACTG 7 00 

CTGGGCCGTA TGCCTTCAGC GGTAGGTTAT CAGCCGACCC TGGCGGAAGA 750 

60 GATGGGCGTT CTGCAGGAAC GTATCACCTC CACCAAAACC GGTT 7 94 
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2) INFORMATION FOR SEQ ID NO: 943 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 3 5401 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 943 

20 ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA TGGTAATGAG 50 

CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA TCGTACGTAC 100 

CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG GATGTAAAAG 150 

ACCTCGAACA CCCGATTGAA GTCCCGGTAG GTAAAGCGAC TCTGGGCCGT 2 00 

ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG AGATCGGTGA 2 50 

2 5 AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC GAAGAGCTGT 3 00 

CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG 3 50 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 400 

AGGTAAAACC GTAAACATGA TGGAGCTCAT TCGTAACATC GCGATCGAGC 450 

ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 500 

3 0 AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ACAAAGTATC 550 

CCTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTTCTGCTGT TCGTTGACAA CATCTATCGT TACACCCTGG CCGGTACGGA 700 

AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 750 

3 5 CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 8 00 

ACTGGTTCTA TCAC 814 



40 2) INFORMATION FOR SEQ ID NO: 944 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 11775 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 944 



GTGTACGATG CTCTTGAGGT GCAAAATGGT AATGAGCGTC TGGTGCTGGA 50 

AGTTCAGCAG CAGCTCGGCG GCGGTATCGT GCGTACCATC GCAATGGGTT 100 

CCTCCGACGG TCTGCGTCGC GGTCTGGATG TAAAAGACCT CGAACACCCG 150 

ATCGAAGTCC CGGTAGGTAA AGCGACTCTG GGCCGTATCA TGAACGTACT 200 

60 GGGTGAACCG GTCGACATGA AAGGCGAGAT CGGTGAAGAA GAGCGTTGGG 250 
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CGATTCACCG CGCAGCACCT TCCTACGAAG 
CTGCTGGAAA CCGGTATCAA AGTTATCGAC 
GGGCGGTAAA GTTGGTCTGT TCGGTGGTGC 
ACATGATGGA GCTTATTCGT AACATCGCGA 
5 GTGTTTGCGG GCGTAGGTGA ACGTACTCGT 
CGAAATGACC GACTCCAACG TTATCGACAA 
AGATGAACGA GCCGCCGGGA AACCGTCTGC 
ACCATGGCTG AGAAATTCCG TGACGAAGGT 
TGACAACATC TATCGTTACA CCCTGGCCGG 
10 TGGGCCGTAT GCCTTCAGCG GTAGGTTATC 
ATGGGCGTTC TGCAGGAACG TATCACCTCC 



PCT/CA00/01150 

AGCTGTCAAA CTCTC^GGAA 3 0/0 

CTGATGTGTC CGTTCGCTAA 3 50 

GGGTGTAGGT AAAACCGTAA 4 00 

TCGAGCACTC CGGTTACTCT 4 50 

GAGGGTAACG ACTTCTACCA 500 

AGTATCCCTG GTGTATGGCC 550 

GCGTTGCTCT GACCGGTCTG 600 

CGTGACGTTC TGCTGTTCGT 650 

TACGGAAGTA TCCGCACTGC 700 

AGCCGACCCT GGCGGAAGAG 750 

ACCAAAACCG GTTCTATC 798 



15 2) INFORMATION FOR SEQ ID NO: 945 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 25922 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 5 



ATGCCGTACC GCGCGTGTAC GATGCTCTTG AGGTGCAAAA TGGTAATGAG 50 

CGTCTGGTGC TGGAAGTTCA GCAGCAGCTC GGCGGCGGTA TCGTGCGTAC 100 

CATCGCAATG GGTTCCTCCG ACGGTCTGCG TCGCGGTCTG GATGTAAAAG 150 

ACCTCGAACA CCCGATCGAA GTCCCGGTAG GTAAAGCGAC TCTGGGCCGT 2 00 

3 5 ATCATGAACG TACTGGGTGA ACCGGTCGAC ATGAAAGGCG AGATCGGTGA 250 

AGAAGAGCGT TGGGCGATTC ACCGCGCAGC ACCTTCCTAC GAAGAGCTGT 3 00 

CAAACTCTCA GGAACTGCTG GAAACCGGTA TCAAAGTTAT CGACCTGATG 350 

TGTCCGTTCG CTAAGGGCGG TAAAGTTGGT CTGTTCGGTG GTGCGGGTGT 4 00 

AGGTAAAACC GTAAACATGA TGGAGCTTAT TCGTAACATC GCGATCGAGC 4 50 

4 0 ACTCCGGTTA CTCTGTGTTT GCGGGCGTAG GTGAACGTAC TCGTGAGGGT 5 00 

AACGACTTCT ACCACGAAAT GACCGACTCC AACGTTATCG ACAAAGTATC 550 

CTTGGTGTAT GGCCAGATGA ACGAGCCGCC GGGAAACCGT CTGCGCGTTG 600 

CTCTGACCGG TCTGACCATG GCTGAGAAAT TCCGTGACGA AGGTCGTGAC 650 

GTATTGCTGT TCGTCGATAA CATCTATCGT TACACCCTGG CCGGTACCGA 7 00 

45 AGTATCCGCA CTGCTGGGCC GTATGCCTTC AGCGGTAGGT TATCAGCCGA 7 50 

CCCTGGCGGA AGAGATGGGC GTTCTGCAGG AACGTATCAC CTCCACCAAA 8 00 

ACCGGTTCTA TC 812 



50 

2) INFORMATION FOR SEQ ID NO: 946 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 832 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 94 6 



GCGACGCTAT 
CTGACTCTGG 
TGCAATGGGT 

10 CTGGTGCGCC 
GTCGACGTAT 
CAAGAGCCGT 
CAACTACCGA 
CCGTTTGCTA 

15 CAAAACCGTG 
GCGGTCTGTC 
GACTTCTACC 
GGTTTACGGT 
TGACCGGTTT 

2 0 GGTAAAGGTC 
TCTGGCCGGT 
TGGGTTACCA 
ATTACCTCTA 



CCCGCATGTT 
AAGTTCAACA 
AGTTCAGACG 
AATCACTGTG 
TGGGTACGCC 
GCCATTCACC 
ATTGTTGGAA 
AAGGCGGTAA 
AACATGATGG 
CGTGTTCGCA 
ACGAGATGAA 
CAGATGAACG 
GACCATGGCG 
GCGACGTATT 
ACCGAAGTAT 
ACCGACATTG 
CCCAAACCGG 



TACGATGCCC 
ACTTCTGGGT 
GCCTGAAACG 
CCGGTAGGTA 
TGTTGATGAA 
AAACTGCTCC 
ACCGGTATTA 
AGTAGGTCTG 
AATTGATCAA 
GGTGTGGGCG 
AGATTCCAAC 
AACCTCCGGG 
GAATACTTCC 
GTTCTTCGTT 
CTGCACTGTT 
GCTGAAGAAA 
TTCCATTACT 



TGAAATTGGA 
GACGGCGTTG 
CGGCATGTCT 
AAGGTACTTT 
GCAGGTCCGA 
GAAATTCGAC 
AAGTGATCGA 
TTCGGTGGTG 
CAACATCGCC 
AGCGTACCCG 
GTATTGGATA 
CAACCGTTTG 
GTGACGAAAA 
GACAACATCT 
GGGCCGTATG 
TGGGTCGTTT 
TC 



CGAGAACGGT 50 

TCCGTACTAT 100 

GTAAGCAATA 150 

GGGTCGTATT 2 00 

TCGATACCGA 2 50 

GAGTTGTCTG 3 00 

CTTGCTGTGT 3 50 

CCGGTGTAGG 4 00 

AAAGCGCACA 4 50 

TGAAGGTAAC 500 

AAGTGGCAAT 550 

CGCGTCGCAT 600 

AGACGAAAAC 650 

ACCGTTACAC 700 

CCTTCTGCAG 750 

GCAAGAGCGT 8 00 

832 



25 



30 



35 



40 



45 



50 



55 



60 



2) INFORMATION FOR SEQ ID NO: 947 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria sicca 

(B) STRAIN: ATCC 9913 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 7 



TCCGCGCGAT 
ACGGCCTGAC 
ACTATTGCAA 
CAATACAGAT 
GTATTGTCGA 
ACCGACAAAC 
GTCTGCTACT 
TGTGTCCGTT 
GTAGGCAAAA 
GCATAGTGGT 
GTAACGACTT 
GCGATGGTTT 
AGCCTTGACC 
AAAGCGGCAA 
TACACTCTGG 
AGCAGTAGGT 
AGCGTATTAC 



GCCATTCCGC 
TTTGGAAGTA 
TGGGTAGTTC 
GCGCCGATTA 
TGTGTTGGGT 
ACCGTGCTAT 
ACCGAGCTGC 
TGCCAAAGGC 
CCGTCAACAT 
TTGTCCGTGT 
CTACCACGAG 
ACGGTCAGAT 
GGTTTGACGA 
AGGTCGCGAC 
CCGGTACAGA 
TACCAACCGA 
CCTCTACTCA 



ATGTTTACGA 
CAACAGCTTC 
GGACGGTCTG 
CTGTGCCGGT 
ACACCTGTTG 
CCATCAGACA 
TGGAAACAGG 
GGTAAAGTAG 
GATGGAATTG 
TCGCCGGTGT 
ATGAAAGATT 
GAACGAACCT 
TGGCCGAATA 
GTATTGTTCT 
AGTATCCGCA 
CATTGGCTGA 
AACAGGCTCC 



CGCCCTGAAA 
TGGGCGACGG 
AAACGCGGCA 
AGGTAAAGGT 
ATGAAGCAGG 
GCTCCGAAAT 
CATTAAAGTG 
GTCTGTTCGG 
ATTAACAACA 
GGGGGAACGT 
CCAACGTATT 
CCGGGTAACC 
CTTCCGTGAT 
TCGTGGACAA 
TTGCTCGGTC 
AGAAATGGGT 
ATTACTTCTA 



TTGGATGCAA 50 

CGTGGTTCGT 100 

TGACTGTAAG 150 

ACTTTGGGAC 200 

TCCGATTGAT 250 

TCGATGAGTT 300 

ATTGACTTGC 350 

TGGTGCCGGT 400 

TCGCCAAAGC 450 

ACCCGTGAAG 500 

GGACAAAGTG 550 

GTCTGCGTGT 600 

GAAAAAGACG 650 

CATTTACCGT 700 

GTATGCCTTC 750 

CGTCTGCAAG 800 

840 
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2) INFORMATION FOR SEQ ID NO: 948 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 
15 (B) STRAIN: ATCC 903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 8 

GCAGCTGGCG ACAAGCTACC TGAGATCAAT AATGCACTTG TAGTCTATAA 50 

2 0 AAATGACGAA AAAAAATCAA AAATCGTCCT TGAAGTAGCT CTTGAGCTTG 100 

GTGATGGAGT GGTTCGGACC ATCGCTATGG AATCAACGGA TGGGTTGACT 150 

CGTGGCATGG AAGTGCTAGA TACTGGCCGT CCAATTTCTG TGCCAGTCGG 2 00 

CAAAGAAACA CTTGGTCGCG TCTTTAACGT TTTGGGAGAT ACCATTGACT 2 50 

TGGATGCTCC TTTTGCGGAT GATGCAGAGC GCCAGCCAAT CCATAAGAAA 3 00 

25 GCTCCAACCT TTGATGAGTT GTCTACTTCT TCAGAGATCT TAGAGACAGG 3 50 

TATCAAGGTT ATCGACCTGT TAGCCCCTTA TCTGAAAGGT GGTAAAGTTG 4 00 

GACTCTTCGG TGGTGCCGGA GTTGGTAAGA CCGTCCTGAT TCAAGAATTG 4 50 

ATCCACAACA TTGCCCAAGA ACACGGTGGT ATTTCTGTAT TTACTGGCGT 500 

TGGGGAACGT ACCCGTGAAG GGAATGACCT TTATTGGGAA ATGAAAGAGT 550 

3 0 CTGGTGTTAT CGAGAAAACA GCCATGGTCT TCGGTCAGAT GAATGAGCCG 600 

CCAGGAGCGC GTATGCGGGT TGCTTTGACT GGTTTGACGA TTGCAGAATA 650 

CTTCCGTGAT GTGGAAGGTC AAGATGTCTT GCTCTTCATT GACAACATGT 700 

TCCGTTTCAC GCAGGCAGGT TCTGAAGTTT CTGCCCTTTT GGGTCGGATG 750 

CCGTCAGCCG TTGGTTACCA ACCAACACTT GCGACAGAAA TGGGGCAATT 800 

3 5 GCAAGAGCGT ATCACATCGA CTAAGAAGGG TTCTGTAACC TCT 843 



40 



2) INFORMATION FOR SEQ ID NO: 94 9 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 841 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 4 9456 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 9 

55 GCAGCAGGGG AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTGG 100 

GAGATGGTAT GGTCCGTACT ATCGCCATGG AATCAACAGA TGGTTTGACT 150 

CGTGGAATGG AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG 2 00 

TAAAGAAACT TTGGGACGTG TCTTCAATGT TTTGGGAGAT ACCATTGACT 2 50 

60 TGGAAGCTCC TTTTACAGAA GATGCAGAGC GTCAGCCAAT TCATAAAAAA 300 
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SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GCTCCAACTT TTGATGAATT GTCTACCTCT 
GATTAAGGTT ATCGACCTTC TTGCCCCTTA 
GACTTTTCGG TGGTGCCGGA GTTGGTAAAA 
ATTCACAACA TTGCCCAAGA ACACGGTGGT 
5 TGGGGAACGT ACTCGTGAGG GTAATGACCT 
CAGGTGTTAT CGAGAAAACA GCCATGGTAT 
CCAGGAGCAC GTATGCGTGT TGCCCTAACT 
CTTCCGTGAT GTGGAAGGCC AAGACGTGCT 
TCCGTTTCAC TCAGGCTGGT TCAGAAGTAT 
10 CCATCAGCCG TTGGTTACCA ACCAACACTT 
GCAAGAGCGT ATTACATCAA CTAAAAAGGG 



PCT/CA00/01150 

TCTGAAATCC TTGAAACAGG 3 50 

CCTTAAAGGT GGTAAGGTTG 4 00 

CCGTCTTAAT CCAAGAATTG 4 50 

ATTTCAGTAT TTACCGGTGT 500 

TTACTGGGAA ATGAAAGAAT 550 

TTGGTCAGAT GAATGAGCCG 6 00 

GGTTTGACAA TCGCCGAATA 650 

TCTCTTTATC GATAATATCT 700 

CTGCCCTTTT GGGTCGTATG 750 

GCTACGGAAA TGGGTCAATT 800 

TTCTGTAACC T 841 



15 2) INFORMATION FOR SEQ ID NO: 950 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 7 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: LSPQ 2583 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 950 



GCTACCTGAG ATCAATAATG CACTTGTAGT CTATAAAAAT GACGAAAATA 50 

AATCAAAAAT CGTCCTTGAA GTAGCTCTTG AGCTTGGTGA TGGAGTGGTT 100 

CGGACCATCG CTATGGAATC AACGGATGGG TTGACTCGTG GCATGGAAGT 150 

GCTAGATACT GGTCGTCCAA TTTCTGTGCC AGTCGGCAAA GAAACACTTG 2 00 

3 5 GTCGCGTCTT TAACGTTTTG GGAGATACCA TTGACTTGGA TGCTCCTTTT 250 

GCGGATGATG CAGAGCGCCA GCCAATCCAT AAGAAAGCTC CAACCTTTGA 3 00 

TGAGTTGTCT ACTTCATCAG AGATCTTAGA GACAGGTATC AAGGTTATCG 350 

ACCTGTTAGC ACCTTATCTG AAAGGTGGTA AAGTCGGACT CTTCGGTGGT 400 

GCCGGAGTTG GTAAGACCGT CCTGATTCAG GAATTGATCC ACAACATTGC 450 

4 0 CCAAGAGCAT GGTGGTATTT CCGTGTTTAC CGGTGTTGGG GAACGTACCC 500 

GTGAAGGGAA TGACCTTTAC TGGGAAATGA AGGAGTCTGG CGTTATCGAG 550 

AAAACAGCCA TGGTCTTCGG TCAGATGAAT GAGCCACCAG GAGCGCGTAT 600 

GCGGGTTGCT TTGACTGGTT TGACGATTGC AGAGTACTTC CGTGATGTAG 650 

AAGGTCAAGA TGTCTTGCTC TTCATTGACA ACATCTTCCG TTTCACGCAG 700 

45 GCAGGTTCTG AAGTCTCTGC CCTTTTGGGT CGGATGCCAT CAGCCGTTGG 750 

TTACCAACCA ACACTTGCGA CTGAAATGGG ACAACTCCAA GAGCGTATTA 800 

CATCGACTAA GAAAGGTTCT GTAACTT 827 



50 

2) INFORMATION FOR SEQ ID NO: 951 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 844 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: Genomic DNA 

529 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 951 

GCAGCAGGGG AAACACTTCC TGAGATTAAT AATGCACTTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTGG 100 

GTGATGGTAT GGTC CGTACG ATCGCCATGG AATCAACAGA TGGTTTGACT 150 

10 CGTGGAATGG AAGTTTTGGA CACAGGCCGT CCAATCTCTG TACCAGTAGG 2 00 

TAAAGAAACT TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACTATTGACT 250 

TGGATGCTCC TTTCGCTGAA GACGCTGAGC GTCAGCCAAT TCATAAGAAA 3 00 

GCTCCAACTT TTGATGAATT GTCTACCTCA TCTGAAATCT TGGAAACAGG 3 50 

GATTAAGGTT ATCGACCTTC TTGCCCCTTA CCTTAAAGGT GGGAAAGTTG 4 00 

15 GACTCTTCGG TGGTGCCGGA GTTGGTAAAA CTGTCTTGAT CCAAGAGTTG 450 

ATTCACAACA TTGCCCAAGA ACATGGTGGT ATTTCAGTAT TTACCGGTGT 500 

TGGAGAACGT ACCCGTGAGG GGAACGACCT TTACTGGGAA ATGAAAGAAT 550 

CAGGCGTTAT CGAGAAAACA GCCATGGTAT TTGGTCAGAT GAATGAGCCA 600 

CCTGGAGCAC GTATGCGTGT TGCTCTTACT GGTTTGACAA TCGCCGAATA 650 

20 CTTCCGTGAT GTAGAAGGCC AAGATGTGCT TCTCTTTATC GACAATATCT 700 

TCCGTTTCAC TCAAGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGGATG 750 

CCTTCAGCCG TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT 800 

GCAAGAACGT ATCACATCAA CTAAGAAGGG TTCTGTAACC TCTA 844 



25 



40 



60 



2) INFORMATION FOR SEQ ID NO: 952 



(i) SEQUENCE CHARACTERISTICS: 
3 0 (A) LENGTH: 83 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 952 



GCAGCAGGGG AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG 100 

45 GAGATGGTAT GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT 150 

CGTGGAATGG AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG 2 00 

TAAAGAAACT TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGATT 2 50 

TGGAAGCTCC TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA 3 00 

GCTCCAACTT TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG 3 50 

50 GATCAAGGTT ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG 4 00 

GACTTTTCGG TGGTGCCGGA GTTGGTAAAA CCGTCTTAAT CCAAGAATTG 4 50 

ATTCACAACA TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTACTGGTGT 500 

TGGGG AACGT ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT 550 

CAGGCGTTAT CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA 600 

55 CCAGGAGCAC GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA 650 

CTTCCGTGAT GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT 7 00 

TCCGTTTCAC TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG 750 

CCATCAGCCG TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT 800 

GCAAGAACGT ATCACATCAA CCAAGAAGGG 83 0 
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2) INFORMATION FOR SEQ ID NO: 953 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
15 (B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 953 

AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA AAATGACGAA 50 

2 0 AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG GAGATGGTAT 100 

GGTTCGTACT ATCGCCATGG AAT CAACAGA TGGGTTGACT CGTGGAATGG 150 

AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG TAAAGAAACT 2 00 

TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGATT TGGAAGCTCC 250 

TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA GCTCCAACTT 3 00 

25 TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG GATCAAGGTT 3 50 

ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG GACTTTTCGG 4 00 

TGGTGCCGGA GTTGGTAAAA CCGTCTTAAT CCAAGAATTG ATTCACAACA 4 50 

TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTACTGGTGT TGGGGAACGT 50 0 

ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT CAGGCGTTAT 550 

3 0 CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA CCAGGAGCAC 600 

GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA CTTCCGTGAT 650 

GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT TCCGTTTCAC 7 00 

TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG CCATCAGCCG 750 

TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT GCAAGAACGT 8 00 

3 5 ATCACATCAA CTAAGAAGGG TTC 823 



2) INFORMATION FOR SEQ ID NO: 954 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 954 

55 GCAGCAGGGG AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA 50 

AAATGACGAA AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG 100 

GAGATGGTAT GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT 150 

CGTGGAATGG AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG 2 00 

TAAAGAAACT TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGACT 250 

60 TGGAAGCTCC TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA 3 00 
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GCTCCAACTT TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG 3 50 

GATCAAGGTT ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG 4 00 

GACTTTTCGG TGGTGCCGGA GTTGGTAAAA CTGTCTTAAT CCAAGAATTG 4 50 

ATTCACAACA TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTGCTGGTGT 500 

5 TGGGGAACGT ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT 550 

CAGGCGTTAT CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA 60 0 

CCAGGAGCAC GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA 65 0 

CTTCCGTGAT GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT 700 

TCCGTTTCAC TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG 750 

10 CCATCAGCCG TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT 8 00 

GCAAGAACGT ATCACATCAA CCAAGAAGGG TTCTGTAACC TCTA 844 



15 2) INFORMATION FOR SEQ ID NO: 955 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 834 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 955 



AAAAACTTCC TGAGATTAAC AATGCACTTG TCGTCTACAA AAATGACGAA 50 

AGAAAAACAA AAATCGTCCT TGAAGTAGCC TTGGAGTTAG GAGATGGTAT 100 

GGTTCGTACT ATCGCCATGG AATCAACAGA TGGGTTGACT CGTGGAATGG 150 

AAGTATTGGA CACAGGTCGT CCAATCTCTG TACCAGTAGG TAAAGAAACT 2 00 

3 5 TTGGGACGTG TCTTCAACGT TTTGGGAGAT ACCATTGATT TGGAAGCTCC 2 50 

TTTTACAGAA GACGCAGAGC GTCAGCCAAT TCATAAAAAA GCTCCAACTT 3 00 

TTGATGAGTT GTCTACCTCT TCTGAAATCC TTGAAACAGG GATCAAGGTT 3 50 

ATTGACCTTC TTGCCCCTTA CCTTAAAGGT GGTAAAGTTG GACTTTTCGG 40 0 

TGGTGCCGGA GTTGGTAAAA CCGTCTTAAT CCAAGAATTG ATTCACAACA 450 

40 TTGCCCAAGA GCACGGTGGT ATTTCAGTAT TTACTGGTGT TGGGGAACGT 500 

ACTCGTGAGG GGAATGACCT TTACTGGGAA ATGAAAGAAT CAGGCGTTAT 550 

CGAGAAAACA GCCATGGTCT TTGGTCAGAT GAATGAGCCA CCAGGAGCAC 600 

GTATGCGTGT TGCCCTTACT GGTTTGACAA TCGCTGAATA CTTCCGTGAT 650 

GTGGAAGGCC AAGACGTGCT TCTCTTTATC GATAATATCT TCCGTTTCAC 700 

45 TCAGGCTGGT TCAGAAGTAT CTGCCCTTTT GGGTCGTATG CCATCAGCCG 750 

TTGGTTACCA ACCAACACTT GCTACGGAAA TGGGTCAATT GCAAGAACGT 8 00 

ATCACATCAA CCAAGAAGGG TTCTGTAACC TCTA 834 



50 

2) INFORMATION FOR SEQ ID NO: 956 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 95 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Babesia microti 

(B) STRAIN: Persing-1 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 956 

TTGTATATCA CAGGCACTCA GCAAATATTC CGATAC TG AC GTAATTATAT 5 0 

ACGTGGGTTG TGGTGAACGT GGGAATGAAA TGGCTGAGAT TCTATGCGAA 100 

TTCCCTGAAC TATCTACTGT AGTTAATGAT GAAAAGGTGG CCATTATGGA 150 

10 ACGTACATGC TTAGTTGCCA ATACTTCTAA TATGCCAGTG GCCGCTAGAG 2 00 

AAGCTAGTAT ATACACTGGT ATTACAATTG CTGAATATTT CCGTGATATG 2 50 

GGTTACAACT GCACTCTTAT GGCCGATTCC ACTAGCCGAT GGGCAGAGGC 3 00 

TCTAAGGGAA ATTTCTGGTA GATTGGCTGA AATGCCTGCA GATTCTGGCT 3 50 

ATCCGGCCTA TTTATCGTCA AGGTTGTCAG CTTTTTATGA ACGTGCAGGT 4 00 

15 GGGATAACTG TCTAATTAAT TTAGGCTTGA TTAAGTGCTT AGGTTCACCA 4 50 

ACACGAACCG GATCTATTAC GGTTGTAGGA GCAGTTTCTC CACCA 4 95 



20 2) INFORMATION FOR SEQ ID NO: 957 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: En t amoeba histolytica 

(B) STRAIN: HM1-1MSS 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 957 



AGTTATTTCA CAAGCATTAA GTAAATATAG TAATTCAGAT GTTATTATTT 50 

ATGTAGGATG TGGTGAACGA GGAAATGAAA TGGCAGAAGT TCTTCGAGAT 100 

TTTCCAGCTC TTTCTATTAA AGTAGGAGAT AAAGAAGAAT CTATTATGAC 150 

AAGAACAGCA CTTGTTGCTA ATACATCTAA TATGCCTGTT GCAGCACGTG 2 00 

4 0 AAGCATCAAT TTATACTGGA ATTACATTAT CAGAATATTA TAGAGATATG 2 50 

GGATATAATG TTGCTATGAT GGCAGATTCA ACATCAAGAT GGGCTGAAGC 3 00 

ACTTAGAGAA ATTTCAGGAC GTCTTGCAGA AATGCCAGCT GATTCTGGAT 3 50 

ATCCAGCATA TCTTGCAGCA CGTTTAGCAT CATTTTATGA ACGTGCAGGT 4 00 

ATGGTTGAAT GTTTAGGATC ACCAAAAAGA ATAGGGTCAG TTTCTATTGT 450 

45 AGGAGCTGTT TCACCACCT 4 69 



50 



2) INFORMATION FOR SEQ ID NO: 958 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 52 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Fusobacterium nucleatum subsp . polymorphism 
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(xi) SEQUENCE DESCR! 

5 TACAACACCA ACTTGCTAAA 
GGTTGTGGGG AACGTGGAAA 
AGAAATTATT GACCCTAAGA 
TTATAGCTAA TACTTCTAAT 
TATACTGGTA TAACTATTGC 
10 GGCACTTATG GCAGATTCAA 
TGTCAGGACG TTTGGAAGAA 
CTATCAAGTA GAATAGCAGA 
TCTAGGTAAT GGAGAAGAAG 
CA 

15 



PTION: SEQ ID NO: 958 

TGGGCAGATG CAGAAGTAGT 
TGAAATGACC GATGTACTTA 
CAGGACAATC TTTAATGAAG 
ATGCCAGTTG CTGCTCGTGA 
AGAATATTTT AGAGATATGG 
CAAGTCGTTG GGCAGAAGCA 
ATGCCAGGTG ATGAAGGATA 
GTTTTATGAA AGAGCAGGGC 
GAGCATTAAC TGTAATTGGA 



TGTTTATGTT 50 

TGGAATTCCC ■ 100 

AGAACAGTTC 150 

GGCTTCAATC 2 00 

GATATTCAGT 250 

CTTCGTGAAA 3 00 

TCCAGCATAT 350 

TTGTTGAATG 4 00 

GCAGTATCTC 4 50 

452 



2) INFORMATION FOR SEQ ID NO: 959 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



45 



60 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania aethiopica 
30 (B) STRAIN: ATCC 50119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 959 

TGTAATTAGT CAGGCCCTCT CCAAGTACTC CAACTCCGAC TGCGTCATCT 50 

35 ATGTCGGCTG CGGCGAGCGC GGTAATGAGA TGGCCGAGGT GCTCATGGAG 100 

TTCCCGACCC TGACGACCGT GATCAATGGT CGCGAGGAGT CGATCATGAA 150 

GCGCACCTGC CTCGTGGCGA ACACTTCGAA CATGCCAGTC GCAGCCCGTG 2 00 

AGGCCTCTAT TTACACCGGC ATCACCCTGG CCGAGTACTA CCGTGATATG 2 50 

GGCAAGCATA TCGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 300 

4 0 GCTTCGTGAG ATTTCGGGTC GTCTGGCGGA GATGCCGGCC GATGGTGGCT 3 50 

ACCCTGCCTA TCTCAGCGCT CGTCTCGCCT CCTTCTACGA GCGCGCCGGC 4 00 

CTCGTCACCT GCATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACGATTGT 4 50 

CGGTGCCGTG TCTCCGCCG 4 69 



2) INFORMATION FOR SEQ ID NO: 960 



(i) SEQUENCE CHARACTERISTICS: 
50- (A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 3 0815 



534 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ 

TGTAATTAGT CAGGCCCTCT CCAAGTACTC 
ATGTCGGCTG CGGCGAGCGC GGTAATGAGA 
5 TTCCCGACCC TGACGACCGT GATCGATGGT 
GCGCACCTGC CTCGTGGCGA ACACTTCGAA 
AGGCCTCTAT TTACACCGGC ATCACCCTGG 
GGCAAGCATA TCGCCATGAT GGCCGACTCG 
GCTTCGTGAG ATTTCGGGTC GTTTGGCGGA 
10 ACCCTGCCTA TCTCAGCGCT CGTCTCGCCT 
CTCGTCACCT GCATCGGCGG GCCGAAGCGC 
CGGTGCCGTG TCTCCGCCG 



PCT/CA00/01150 

ID NO: 960 

CAACTCCGAC TGCGTCATCT 50 
TGGCCGAGGT GCTCATGGAG 100 
CGCGAGGAGT CGATCATGAA 150 
CATGCCAGTC GCAGCCCGTG 2 00 

CCGAGTACTA CCGTGATATG 250 
ACGTCTCGCT GGGCCGAGGC 3 00 

GATGCCGGCC GATGGTGGCT 3 50 

CCTTCTACGA GCGCGCCGGC 4 00 

CAGGGCTCCG TCACGATTGT 450 

469 



15 



25 



2) INFORMATION FOR SEQ ID NO: 961 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 469 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania guyanensis 

(B) STRAIN: ATCC 50126 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 961 

TGTCATCAGT CAGGCCCTCT CCAAGTACTC CAACTCGGAC TGTGTCATGT 50 

ATGTCGGCTG CGGTGAACGC GGTAACGAGA TGGCCGAGGT GCTCATGGAG 100 

TTCCCGACCC TGACGACTGT GATCGATGGT CGCGAAGAGT CCATCATGAA 150 

3 5 GCGCACCTGC CTCGTGGCGA ACACCTCGAA CATGCCCGTC GCAGCCCGTG 2 00 

AGGCCTCTAT TTATACCGGC ATCACCCTTG CTGAGTACTA CCGTGATATG 2 50 

GGCAAGCACA TTGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 3 00 

GCTGCGTGAG ATTTCGGGTC GATTGGCGGA GATGCCGGCT GATGGTGGCT 350 

ACCCTGCCTA CCTCAGCGCC CGCCTCGCCT CCTTCTACGA GCGCGCCGGT 4 00 

4 0 CTCGTCACCT GCATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACGATCGT 4 50 

CGGTGCAGTG TCTCCACCG 469 



45 2) INFORMATION FOR SEQ ID NO: 962 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 69 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania donovani 

(B) STRAIN: ATCC 50212 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 962 

535 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

TGTCATTAGT CAGGCCCTCT CCAAGTACTC CAACTCCGAT TGCGTCATCT 50 

ATGTCGGCTG CGGCGAGCGC GGTAATGAGA TGGCCGAGGT GCTCATGGAG 100 

TTCCCGACCC TGACGACCGT GATCGATGGC CGCGAGGAGT CGATCATGAA 150 

GCGCACCTGC CTCGTGGCGA ACACCTCGAA CATGCCAGTC GCAGCCCGTG 200 

5 AGGCCTCTAT TTACACCGGC ATCACCCTGG CCGAGTACTA CCGTGATATG 250 

GGCAAGCATA TCGCCATGAT GGCTGACTCG ACGTCTCGCT GGGCCGAGGC 300 

GCTTCGTGAG ATTTCGGGTC GTCTGGCGGA GATGCCGGCC GATGGTGGCT 350 

ACCCCGCCTA CCTCAGCGCT CGTCTCGCCT CCTTCTACGA GCGCGCCGGC 400 

CTCGTCACCT GCATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACGATCGT 450 

10 CGGTGCCGTG TCTCCACCG 4 69 



15 



2) INFORMATION FOR SEQ ID NO: 963 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
20. (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Leishmania hertigi 

(B) STRAIN: ATCC 50125 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 963 

30 TGTGATCAGT CAGGCCCTTT CCAAGTACTC CAACTCGGAC TGCGTCATCT 50 

ACGTCGGCTG TGGTGAGCGC GGGAACGAGA TGGCCGAGGT GCTCATGGAT 100 

TTCCCGACTT TGACGACTGT GATCGATGGT CGCGAGGAGT CGATCATGAA 150 

GCGCACCTGC CTCGTGGCGA ACACCTCCAA CATGCCAGTT GCAGCCCGTG 2 00 

AGGCTTCTAT CTATACCGGC ATCACACTGG CTGAGTACTA TCGTGATATG 250 

35 GGCAAGCACA TTGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 300 

GTTGCGTGAG ATTTCCGGTC GGCTGGCGGA GATGCCGGCC GATGGTGGTT 3 50 

ACCCCGCCTA CCTCAGTGCC CGTCTCGCCT CCTTCTACGA GCGCGCTGGC 400 

CTCGTGACCT GTATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACAATTGT 450 

TGGTGCGGTG TCTCCACCG 4 69 

40 



2 ) INFORMATION FOR SEQ ID NO: 964 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 69 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania mexicana 
55 (B) STRAIN: ATCC 50156 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 964 

GGTCATTAGT CAGGCCCTTT CCAAGTACTC CAACTCTGAC TGCGTCATCT 50 
60 ACGTCGGCTG CGGCGAGCGC GGTAATGAGA TGGCCGAGGT GCTCATGGAG 100 

536 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



TTCCCGACCC TGACGACCAT 
GCGCACCTGC CTCGTGGCGA 
AGGCCTCTAT CTACACCGGC 
GGCAAGCATA TCGCCATGAT 
GCTTCGTGAG ATTTCGGGTC 
ACCCCGCCTA CCTCAGCGCT 
CTCGTCACCT GCATCGGCGG 
CGGTGCCGTG TCTCCGCCG 



GATCGATGGT 
ACACCTCGAA 
ATCACCCTCG 
GGCCGACTCG 
GTCTGGCGGA 
CGTCTCGCCT 
GCCGAAGCGC 



CGGGAGGAGT CGATCATGAA 
CATGCCCGTC GCAGCCCGTG 
CCGAGTACTA CCGTGATATG 
ACGTCTCGCT GGGCTGAGGC 
GATGCCGGCC GATGGTGGCT 
CCTTCTACGA GCGCGCCGGC 
CAGGGCTCCG TCACGATCGT 



150 
200 
250 
300 
350 
400 
450 
469 



2) INFORMATION FOR SEQ ID NO: 965 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

( D ) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania tropica 

(B) STRAIN: ATCC 50129 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 965 

TGTAATTAGT CAGGCCCTCT CCAAGTACTC CAACTCCGAC TGCGTCATCT 50 

ATGTCGGCTG CGGCGAGCGC GGTAATGAGA TGGCCGAGGT GCTCATGGAG 10 0 

TTCCCGACCC TGACGACCGT GATCGATGGT CGCGAGGAGT CGATCATGAA 150 

GCGCACCTGC CTCGTGGCGA ACACTTCGAA CATGCCAGTC GCAGCCCGTG 2 00 

AGGCCTCTAT TTACACCGGC ATCACCCTGG CCGAGTACTA CCGTGATATG 2 50 

GGCAAGCATA TCGCCATGAT GGCCGACTCG ACGTCTCGCT GGGCCGAGGC 3 00 

GCTTCGTGAG ATTTCGGGTC GTTTGGCGGA GATGCCGGCC GATGGTGGCT 350 

ACCCTGCCTA TCTCAGCGCT CGTCTCGCCT CCTTCTACGA GCGCGCCGGC 4 00 

CTCGTCACCT GCATCGGCGG GCCGAAGCGC CAGGGCTCCG TCACGATTGT 4 50 

CGGTGCCGTG TCTCCGCCG 4 69 



2) INFORMATION FOR SEQ ID NO: 966 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus anaerobius 

(B) STRAIN: ATCC 27337 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 966 

CACCAGTTCG CCAAGTGGGC AGATGCTCAG ATAGTTGTAT ACGTTGGTTG 50 

TGGAGAACGT GGTAACGAGA TGACAGACGT TCTAAATGAA TTCCCAGAAC 100 

TGATTGACCC TCATACAGGC GAATCTCTAA TGAAGAGAAC AGTTCTTATA 150 

GCTAATACGT CAAATATGCC AGTTGCAGCC AGAGAGGCAA GTATATATAC 2 00 



537 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

AGGTATTACA ATAGCTGAAT ATTTTAGAGA 
TAATGGCCGA CTCTACATCA AGATGGGCAG 
GGTCGTCTAG AAGAAATGCC TGGTGATGAA 
TTCTAGAGCT GCAGAGTTCT ATGAAAGAGC 
5 GTAGCGATAA TAGAGAGGGA GCCCTTACAA 



PCT/CAOO/01150 

CATGGGATAT TCAGTAGCGG 250 

AGGCCCTAAG AGAGATGTCA 3 00 

GGTTATCCAG CCTATCTAGG 350 

AGGTAAGGTA ATATGTAAGG 4 00 

TAATAGGTGC CGTGTCACC 44 9 



2) INFORMATION FOR SEQ ID NO: 967 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Bordetella pertussis 

(B) STRAIN: ATCC 97 97 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 967 

25 CTGGTGGTGT CGGCCGCAGA CGGCCCGATG CCGCAGACGC GCGAGCACAT 50 

TTTGCTGTCG CGCCAGGTTG GCGTGCCGTA CATCATCGTG TTCCTGAACA 100 

AGGCGGACAT GGTTGATGAC GCGGAGCTGC TCGAGCTGGT GAAGATGGAA 150 

GTCCGCGAAC TGCTGAGCAA GTACGATTTC CCGGGCGATG ACACGCCGAT 2 00 

CGTGAAGGGT TCGGCCAAGC TGGCGCTGGA AGGCGACAAG GGCGAACTGG 250 

30 GCGAGCAGGC GATTCTGTCG CTGGCGCAAG CGCTGGACAC GTACATTCCG 3 00 

ACGCCGGAGC GCGCGGTCGA CGGTGCGTTC CTGATGCCGG TGGAAGACGT 350 

GTTCTCGATC TCGGGCCGTG GCACGGTGGT GACTGGCCGT ATCGAGCGCG 4 00 

GCGTGGTGAA GGTTGGCGAG GAAATCGAAA TCGTGGGCAT CAAGCCGACG 4 50 

GTGAAGACGA CCTGCACGGG CGTGGAGATG TTCCGCAAGC TGCTGGACCA 500 

35 GGGCCAGGCG GGCGACAACG TGGGTATCTT GCTGCGCGGC ACCAAGCGTG 550 

AAGACGTCGA GCGTGGCCAG GTGCTGGCCA AGCCGGGTTC GATCAACCCG 600 

CACACGGACT TCACGGCCGA GGTGTACATT CTGTCCAAGG AAGAGGGTGG 650 

CCGTCACACG CCGTTCTTCA ACGGCTATCG TCCGCAGTTC TACTTCCGCA 700 

CGACGGACGT GACCGGCACG ATCGACCTGC CGGCGGACAA GGAAATGGTG 750 

40 CTGCCGGGCG ACAACGTGTC GATGACCGTC AAGCTGCTGG CCCCGATCGC 8 00 

CATGGAAGAA GGTCTGCGTT TCGCCA 826 



45 2) INFORMATION FOR SEQ ID NO: 968 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

55 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bordetella. pertussis 

(B) STRAIN: BD180 



60 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 968 

538 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

CGATCCTGGT GGTGTCGGCC GCAGACGGCC 
CACATTTTGC TGTCGCGCCA GGTTGGCGTG 
GAACAAGGCG GACATGGTTG ATGACGCGGA 
TGGAAGTCCG CGAACTGCTG AGCAAGTACG 
5 CCGATCGTGA AGGGTTCGGC CAAGCTGGCG 
ACTGGGCGAG CAGGCGATTC TGTCGCTGGC 
TTCCGACGCC GGAGCGCGCG GTCGACGGTG 
GACGTGTTCT CGATCTCGGG CCGTGGCACG 
GCGCGGCGTG GTGAAGGTTG GCGAGGAAAT 

10 CGACGGTGAA GACGACCTGC ACGGGCGTGG 
GACCAGGGCC AGGCGGGCGA CAACGTGGGT 
GCGTGAAGAC GTCGAGCGTG GCCAGGTGCT 
ACCCGCACAC GGACTTCACG GCCGAGGTGT 
GGTGGCCGTC ACACGCCGTT CTTCAACGGC 

15 CCGCACGACG GACGTGACCG GCACGATCGA 
TGGTGCTGCC GGGCGACAAC GTGTCGATGA 
ATCGCCATGG AAGAAGG 



PCT/CA00/01150 

CGATGCCGCA GACGCGCGAG 50 

CCGTACATCA TCGTGTTCCT 100 

GCTGCTCGAG CTGGTGGAGA 150 

ATTTCCCGGG CGATGACACG 2 00 

CTGGAAGGCG ACAAGGGCGA 2 50 

GCAAGCGCTG GACACGTACA 3 00 

CGTTCCTGAT GCCGGTGGAA 3 50 

GTGGTGACTG GCCGTATCGA 4 00 

CGAAATCGTG GGCATCAAGC 4 50 

AGATGTTCCG CAAGCTGCTG 50 0 

ATCTTGCTGC GCGGCACCAA 550 

GGCCAAGCCG GGTTCGATCA 600 

ACATTCTGTC CAAGGAAGAG 650 

TATCGTCCGC AGTTCTACTT 70 0 

CCTGCCGGCG GACAAGGAAA 750 

CCGTCAAGCT GCTGGCCCCG 8 00 

817 



20 



30 



60 



2) INFORMATION FOR SEQ ID NO: 969 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 63 7 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus columbae 

(B) STRAIN: ATCC 51263 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 969 

CCTATGCCAC AAACTCGTGA ACACATTCTT TTATCACGTA ACGTTGGTGT 50 

GCCATACATC GTTGTTTTCT TAAACAAAGT TGATATGGTT GACGACGAAG 100 

AATTATTAGA ATTAGTTGAA ATGGAAGTTC GTGACTTATT AACTGAATAT 150 

4 0 GACTTCCCAG GAGACGATGT TCCTGTAATC GCTGGTTCTG CATTAAAAGC 2 00 

TTTAGAAGGC GACCCTGCTT ACGAAGAAAA AATCTTAGAA TTAATGGCTG 2 50 

CAGTTGACGA ATACATCCCA ACTCCAGAAC GTGACAACGA CAAACCATTC 3 00 

ATGATGCCAG TTGAAGACGT GTTCTCAATT ACTGGTCGTG GTACTGTTGC 3 50 

TACAGGTCGT GTTGAACGTG GACAAGTTCG TGTTGGTGAC GAAGTTGAAA 400 

4 5 TCGTTGGTAT CGCTGACGAA ACTTCTAAAA CAACAGTTAC TGGTGTTGAA 4 50 

ATGTTCCGTA AATTATTAGA TTACGCTGAA GCTGGAGACA ACATCGGTGC 50 0 

ATTATTACGT GGTGTGGCTC GTGAAGACAT CCAACGTGGT CAAGTATTAG 550 

CTAAACCAGG TTCAATCACT CCACATACAA AATTCACTGC TGAAGTGTAC 600 

GTTTTAACTA AAGAAGAAGG TGGACGTCAT ACTCCAT 63 7 

50 

2) INFORMATION FOR SEQ ID NO: 970 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 634 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



539 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CAOO/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
5 (B) STRAIN: ATCC 4 9997 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 970 

TATGCCTCAA ACACGTGAAC ACATCTTGTT ATCACGTAAC GTTGGTGTAC 50 

10 CATACATCGT TGTTTTCTTA AACAAAATGG ATATGGTTGA TGACGAAGAA 100 

TTACTAGAAT TAGTTGAAAT GGAAGTTCGT GACTTATTGT CAGAATATGA 150 

CTTCCCAGGC GACGATGTTC CTGTAATCGC TGGTTCTGCT TTGAAAGCTC 2 00 

TTGAAGGCGA TGCTTCATAC GAAGAAAAAA TCATGGAATT AATGGCTGCA 250 

GTTGACGAAT ACGTTCCAAC TCCAGAACGT GACACTGACA AACCATTCAT 3 00 

15 GATGCCAGTC GAAGACGTAT TCTCAATCAC TGGACGTGGT ACTGTTGCTA 350 

CAGGCCGTGT TGAACGTGGA CAAGTTCGCG TTGGTGACGA AGTTGAAATC 4 00 

GTTGGTATTG CTGAAGAAAC TGCTAAAACA ACTGTAACTG GTGTTGAAAT 450 

GTTCCGTAAA TTGTTAGACT ATGCTGAAGC AGGGGATAAC ATTGGTGCAT 500 

TGCTACGTGG GGTTGCTCGT GAAGACATCC AACGTGGACA AGTATTAGCT 550 

2 0 AAAGCTGGTA CAATCACACC TCATACAAAA TTTAAAGCTG AAGTTTACGT 600 

TTTAACAAAA GAAGAAGGTG GACGTCACAC TCCA 634 



25 2) INFORMATION FOR SEQ ID NO: 971 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 87 bases 

(B) TYPE: Nucleic acid 

3 0 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-55 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 971 



GGACCAATGC CACAAACTCG TGAGCACATC CTTCTTTCAC GTCAGGTTGG 50 

TGTTAAACAC CTTATCGTCT TCATGAACAA AGTTGACTTG GTTGACGACG 100 

AAGAATTGCT TGAATTGGTT GAAATGGAAA TCCGTGACCT ATTGTCAGAA 150 

TACGACTTCC CAGGTGACGA TCTTCCAGTT ATCCAAGGTT CAGCACTTAA 200 

45 AGCTCTTGAA GGTGACTCTA AATACGAAGA CATCGTTATG GAATTGATGA 250 

ACACAGTTGA TGAGTATATC CCAGAACCAG AACGTGACAC TGACAAACCA 3 00 

TTGCTTCTTC CAGTCGAGGA CGTATTCTCA ATCACTGGAC GTGGTACAGT 350 

TGCTTCAGGA CGTATCGACC GTGGTATCGT TAAAGTCAAC GACGAAATCG 400 

AAATCGTTGG TATCAAAGAA GAAACTCAAA AAGCAGTTGT TACTGGTGTT 450 

50 GAAATGTTCC GTAAACAACT TGACGAAGGT CTTGCTGGAG ATAACGTAGG 500 

TGTCCTTCTT CGTGGTGTTC AACGTGATGA AATCGAACGT GGACAAGTTA 550 

TCGCTAAACC AGGTTCAATC AACCCACACA CTAAATTCAA AGGTGAAGTC 600 

TACATCCTTA CTAAAGAAGA AGGTGGACGT CACACTCCAT TCTTCAACAA 650 

CTACCGTCCA CAATTCTACT TCCGTACTAC TGACGTTACA GGTTCAATCG 700 

55 AACTTCCAGC AGGTACTGAA ATGGTAATGC CTGGTGATAA CGTGACAATC 750 

GACGTTGAGT TGATTCACCC AATCGCCGTA GAACAAG 787 



60 2) INFORMATION FOR SEQ ID NO: 972 

540 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 35401 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 972 



TAGTTGCTGC GACTGACGGC CCGATGCCGC AGACTCGTGA GCACATCCTG 50 

CTGGGTCGTC AGGTAGGCGT TCCGTACATC ATCGTGTTCC TGAACAAATG 100 

CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTTGAA ATGGAAGTTC 150 

GTGAACTTCT GTCTCAGTAC GACTTCCCGG GCGACGACAC TCCGATCGTT 2 00 

2 0 CGTGGTTCTG CTCTGAAAGC GCTGGAAGGC GACGCAGAGT GGGAAGCGAA 2 50 

AATCCTGGAA CTGGCTGGCT TCCTGGATTC TTACATTCCG GAACCAGAGC 300 

GTGCGATTGA CAAGCCGTTC CTGCTGCCGA TCGAAGACGT ATTCTCCATC 350 

TCCGGTCGTG GTACCGTTGT TACCGGTCGT GTAGAACGCG GTATCATCAA 400 

AGTTGGTGAA GAAGTTGAAA TCGTTGGTAT CAAAGAGACT CAGAAGTCTA 450 

2 5 CCTGTACTGG CGTTGAAATG TTCCGCAAAC TGCTGGACGA AGGCCGTGCT 500 

GGTGAGAACG TAGGTGTTCT GCTGCGTGGT ATCAAACGTG AAGAAATCGA 550 

ACGTGGTCAG GTACTGGCTA AGCCGGGCAC CATCAAGCCG CACACCAAGT 600 

TCGAATCTGA AGTGTACATT CTGTCCAAAG ATGAAGGCGG CCGTCATACT 650 

CCGTTCTTCA AAGGCTACCG TCCGCAGTTC TACTTCCGTA CTACTGACGT 700 

3 0 GACTGGTACC ATCGAACTGC CGGAAGGCGT AGAGATGGTA ATGCCGGGCG 750 

ACAACATCAA AATGGTTGTT ACCCTGATCC ACCCGATCGC GATGGACGAC 800 

GGT 803 



35 



45 



2) INFORMATION FOR SEQ ID NO: 973 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 762 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 4 3 895 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 973 

ACTGACGGCC CGATGCCGCA GACTCGTGAG CACATCCTGC TGGGTCGTCA 50 

GGTAGGCGTT CCGTACATCA TCGTGTTCCT GAACAAATGC GACATGGTTG 100 

ATGACGAAGA GCTGCTGGAA CTGGTTGAAA TGGAAGTTCG TGAACTTCTG 150 

55 TCTCAGTACG ACTTCCCGGG CGACGACACT CCGATCGTTC GTGGTTCTGC 200 

TCTGAAAGCG CTGGAAGGCG ACGCAGAGTG GGAAGCGAAA ATCCTGGAAC 250 

TGGCTGGCTT CCTGGATTCC TACATTCCGG AACCAGAGCG TGCRATTGAC 300 

AAGCCGTTCC TGCTGCCGAT CGAAGACGTA TTCTCCATCT CCGGTCGTGG 350 

TACCGTTGTT ACCGGTCGTG TAGAACGCGG TATCATCAAA GTTGGTGAAG 4 00 

60 AAGTTGAAAT CGTTGGTATC AAAGAGACTC AGAAGTCTAC CTGTACTGGC 4 50 

541 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GTTGAAATGT TCCGCAAACT 
AGGTGTTCTG CTGCGTGGTA 
TACTGGCTAA GCCGGGCACC 
GTGTACATTC TGTCCAAAGA 
AGGCTACCGT CCGCAGTTCT 
TTGAACTGCC GGAAGGCGTA 
ATGGTTGTTA CC 



GCTGGACGAA GGCCGTGCTG 
TCAAACGTGA AGAAATCGAA 
ATCAAGCCGC ACACCAAGTT 
TGAAGGCGGC CGTCATACTC 
ACTTCCGTAC TACTGACGTG 
GAGATGGTAA TGCCGGGCGA 



PCT/CAOO/01150 

GTGAGAACGT 500 

CGTGGTCAGG 55 0 

CGAATCTGAA 600 

CGTTCTTCAA 650 

ACTGGTACCA 700 

CAACATCAAA 750 

762 



2) INFORMATION FOR SEQ ID NO: 974 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 11775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 974 



CCTGGTAGTT GCTGCGACTG 
TCCTGCTGGG TCGTCAGGTA 
AAATGCGACA TGGTTGATGA 
AGTTCGTGAA CTTCTGTCTC 
- TCGTTCGTGG TTCTGCTCTG 
GCGAAAATCC TGGAACTGGC 
AGAGCGTGCG ATTGACAAGC 
CCATCTCCGG TCGTGGTACC 
ATCAAAGTTG GTGAAGAAGT 
GTCTACCTGT ACTGGCGTTG 
GTGCTGGTGA GAACGTAGGT 
ATCGAACGTG GTCAGGTACT 
CAAGTTCGAA TCTGAAGTGT 
ATACTCCGTT CTTCAAAGGC 
GACGTGACTG GTACCATCGA 
GGGCGACAAC ATCAAAATGG 
ACGA 



ACGGCCCGAT GCCGCAGACT 
GGCGTTCCGT ACATCATCGT 
CGAAGAGCTG CTGGAACTGG 
AGTACGACTT CCCGGGCGAC 
AAAGCGCTGG AAGGCGACGC 
TGGCTTCCTG GATTCCTACA 
CGTTCCTGCT GCCGATCGAA 
GTTGTTACCG GTCGTGTAGA 
TGAAATCGTT GGTATCAAAG 
AAATGTTCCG CAAACTGCTG 
GTTCTGCTGC GCGGTATCAA 
GGCTAAGCCG GGCACCATCA 
ACATTCTGTC CAAAGATGAA 
TACCGTCCGC AGTTCTACTT 
ACTGCCGGAA GGCGTAGAGA 
TTGTTACCCT GATCCACCCG 



CGTGAGCACA 50 

GTTCCTGAAC 100 

TTGAAATGGA 150 

GACACTCCGA 2 00 

AGAGTGGGAA 2 50 

TTCCGGAACC 3 00 

GACGTATTCT 3 50 

ACGCGGTATC 4 00 

AGACTCAGAA 4 50 

GACGAAGGCC 500 

ACGTGAAGAA 550 

AGCCGCACAC 600 

GGCGGTCGTC 650 

CCGTACTACT 70 0 

TGGTAATGCC 750 

ATCGCGATGG 8 00 



804 



2) INFORMATION FOR SEQ ID NO: 975 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 25922 



542 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 975 



PCT/CA00/01150 



GCGATCCTGG TAGTTGCTGC GACTGACGGC CCGATGCCGC AGACTCGTGA 50 

GCACATCCTG CTGGGTCGTC AGGTAGGCGT TCCGTACATC ATCGTGTTCC 100 

5 TGAACAAATG CGACATGGTT GATGACGAAG AGCTGCTGGA ACTGGTTGAA 150 

ATGGAAGTTC GTGAACTTCT GTCTCAGTAC GACTTCCCGG GCGACGACAC 2 00 

TCCGATCGTT CGTGGTTCTG CTCTGAAAGC GCTGGAAGGC GACGCAGAGT 2 50 

GGGAAGCGAA AATCCTGGAA CTGGCTGGCT TCCTGGATTC YTACATTCCG 3 00 

GAACCAGAGC GTGCGATTGA CAAGCCGTTC CTGCTGCCGA TCGAAGACGT 350 

10 ATTCTCCATC TCCGGTCGTG GTACCGTTGT TACCGGTCGT GTAGAACGCG 4 00 

GTATCATCAA AGTTGGTGAA GAAGTTGAAA TCGTTGGTAT CAAAGAGACT 450 

CAGAAGTCTA CCTGTACTGG CGTTGAAATG TTCCGCAAAC TGCTGGACGA 500 

AGGCCGTGCT GGTGAGAACG TAGGTGTTCT GCTGCGTGGT ATCAAACGTG 550 

AAGAAATCGA ACGTGGTCAG GTACTGGCTA AGCCGGGCAC CATCAAGCCG 600 

15 CACACCAAGT TCGAATCTGA AGTGTACATT CTGTCCAAAG ATGAAGGCGG 650 

TCGTCATACT CCGTTCTTCA AAGGCTACCG TCCGCAGTTC TACTTCCGTA 700 

CTACTGACGT GACTGGTACC ATCGAACTGC CGGAAGGTGT AGAGATGGTA 750 

ATGCCGGGCG ACAACATCAA AATGGTTGTT ACCCTGATCC ACCCGATCGC 800 

GATG 8 04 

20 



2) INFORMATION FOR SEQ ID NO: 976 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium avium 
35 (B) STRAIN: Mavi-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 976 

GGCGCGATCC TGGTGGTCGC CGCCACCGAC GGCCCGATGC CGCAGACCCG 50 

4 0 TGAGCACGTG CTGCTCGCCC GTCAGGTCGG TGTGCCCTAC ATCCTGGTCG 100 

CCCTGAACAA GGCCGACATG GTCGACGACG AGGAGCTCCT CGAGCTCGTC 150 

GAGATGGAGG TCCGCGAGCT GCTGGCCGCC CAGGAGTTCG ACGAGGACGC 2 00 

CCCGGTGGTG CGGGTCTCGG CGCTCAAGGC GCTCGAGGGC GACGCCAAGT 250 

GGGTCGAGTC CGTCGAGCAG CTGATGGAGG CCGTCGACGA GTCGATCCCG 3 00 

45 GACCCGGTCC GCGAGACGGA GAAGCCGTTC CTGATGCCGG TGGAGGACGT 350 

CTTCACCATC ACCGGGCGTG GCACCGTGGT CACCGGTCGT GTCGAGCGCG 4 00 

GTGTGATCAA CGTGAACGAG GAAGTCGAGA TCGTCGGCAT CCGCCCGACC 450 

AGCACCAAGA CCACGGTCAC CGGTGTGGAR ATGTTCCGCA AGCTGCTCGA 500 

CCAGGGCCAG GCCGGTGACA ACGTCGGTCT GCTGCTGCGT GGTATCAAGC 550 

50 GTGAGGACGT CGAGCGCGGC CAGGTCGTGA CCAAGCCCGG CACCACCACG 600 

CCGCACACCG AGTTCGAGGG CCAGGTCTAC ATCCTGTCCA AGGACGAGGG 650 

CGGCCGGCAC ACGCCGTTCT TCAACAACTA CCGTCCGCAG TTCTACTTCC 700 

GCACCACCGA CGTGACCGGT GTGGTGACGC TGCCGGAGGG CACCGAGATG 750 

GTGATGCCCG GTGACAACAC CAACATCTCG GTGAAGCTGA TCCAGCCCGT 800 

55 CGCCATGGAC GACGGTSTGC GGTTC 825 



2) INFORMATION FOR SEQ ID NO: 977 

60 



543 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 977 

15 TATCCTTGTA GTAGCTTCAA CTGACGGACC AATGCCACAA ACTCGTGAGC 50 

ACATCCTTCT TTCACGTCAG GTTGGTGTTA AACACCTTAT CGTCTTCATG 100 

AACAAAGTTG ACTTGGTTGA CGACGAAGAA TTGCTTGAAT TGGTTGAAAT 150 

GGAAATCCGT GACCTATTGT CAGAATACGA CTTCCCAGGT GACGATCTTC 2 00 

CAGTTATCCA AGGTTCAGCA CTTAAAGCTC TTGAAGGTGA CTCTAAATAC 250 

2 0 GAAGACATCG TTATGGAATT GATGAACACA GTTGATGAGT ATATCCCAGA 3 00 

ACCAGAACGT GACACTGACA AACCATTGCT TCTTCCAGTC GAGGACGTAT 3 50 

TCTCAATCAC TGGACGTGGT ACAGTTGCTT CAGGACGTAT CGACCGTGGT 4 00 

ATCGTTAAAG TCAACGACGA AATCGAAATT GTTGGTATCA AAGAAGAAAC 450 

TCAAAAAGCA GTTGTTACTG GTGTTGAAAT GTTCCGTAAA CAACTTGACG 500 

2 5 AAGGTCTTGC TGGAGATAAC GTAGGTGTCC TTCTTCGTGG TGTTCAACGT 550 

GATGAAATCG AACGTGGACA AGTTATCGCT AAACCAGGTT CAATCAACCC 600 

ACACACTAAA TTCAAAGGTG AAGTCTACAT CCTTACTAAA GAAGAAGGTG 650 

GACGTCACAC TCCATTCTTC AACAACTACC GTCCACAATT CTACTTCCGT 700 

ACTACTGACG TTACAGGTTC AATCGAACTT CCAGCAGGTA CTGAAATGGT 750 

3 0 AATGCCTGGT GATAACGTGA CAATCGACGT TGAGTTGATT CACCCAATCG 800 

CCGTAGAACA AGGTACTACA 82 0 



35 2) INFORMATION FOR SEQ ID NO: 978 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

4 0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium gordonae 

(B) STRAIN: M-Gor-1 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 978 



GGCGCGATCC TGGTGGTCGC CGCCACCGAT GGCCCGATGC CGCAGACCCG 50 

TGAGCACGTG CTGCTCGCCC GTCAGGTGGG CGTGCCCTAC ATCCTGGTGG 100 

CGCTGAACAA GTCCGACGCG GTCGACGACG AGGAGCTGCT CGAGCTCGTC 150 

GAGCTGGAGG TCCGCGAGTT GCTGGCCGCC CAGGACTTCG ACGAGGAAGC 2 00 

55 TCCGGTGGTC CGGGTCTCGG CGCTGAAGGC GCTCGAGGGC GACGCCACCT 2 50 

GGGTGAAGTC GGTAGAGGAC TTGATGGACG CGGTCGACGA GTCGATTCCG 3 00 

GACCCGGTCC GCGACACCGA CAAGCCGTTC CTGATGCCCG TCGAGGACGT 3 50 

CTTCACCATC ACCGGTCGTG GCACCGTCGT CACCGGCCGT GTGGAGCGCG 4 00 

GCGTGGTGAA CGTGAACGAG GAAGTCGAGA TCGTCGGCAT CAAGCCGACC 450 

60- AGCACCAAGA CCACGGTCAC CGGTGTGGAG ATGTTCCGCA AGCTGCTCGA 500 

544 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

CCAGGGTCAG 
GTGAGGACGT 
CCGCACACCG 
CGGCCGGCAC 
GCACCACCGA 
GTGATGCCCG 
CGCCATGGAC 



PCT/CA00/01150 



GCCGGTGACA 
CGAGCGCGGC 
AGTTCGAGGG 
ACGCCGTTCT 
CGTGACCGGT 
GTGACAACAC 
GACGGTCTGC 



ACGTCGGTCT 
CAGGTCGTCA 
TCAGGTCTAC 
TCAACAACTA 
GTGGTGACGC 
CAACATCTCG 
GG 



GCTGCTGCGT 
TCAAGCCCGG 
ATCCTGTCCA 
CCGTCCGCAG 
TGCCGGAGGG 
GTGAAGCTGA 



GGTGTCAAGC 
CACCACCACT 
AGGACGAGGG 
TTCTACTTCC 
CACCGAAATG 
TCCAGCCCGT 



550 
600 
650 
700 
750 
800 
822 



10 



15 



20 
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2) INFORMATION FOR SEQ ID NO: 979 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 979 



CTATCCTTGT 
CACATCCTTC 
GAACAAAGTT 
TGGAAATCCG 
CCAGTTATCC 
CGAAGACATC 
AACCAGAACG 
TTCTCAATCA 
TATCGTTAAA 
CTCAAAAAGC 
GAAGGTCTTG 
TGATGAAATC 
CACACACTAA 
GGACGTCACA 
TACTACTGAC 
TAATGCCTGG 
GCCGTAGAAC 



AGTAGCTTCA 
TTTCACGTCA 
GACTTGGTTG 
TGACCTATTG 
AAGGTTCAGC 
GTTATGGAAT 
TGACACTGAC 
CTGGACGTGG 
GTCAACGACG 
AGTTGTTACT 
CTGGAGATAA 
GAACGTGGAC 
ATTCAAAGGT 
CTCCATTCTT 
GTTACAGGTT 
TGATAACGTG 
AAGGTACTAC 



ACTGACGGAC 
GGTTGGTGTT 
ACGACGAAGA 
TCAGAATACG 
ACTTAAAGCT 
TGATGAACAC 
AAACCATTGC 
TACAGTTGCT 
AAATCGAAAT 
GGTGTTGAAA 
CGTAGGTGTC 
AAGTTATCGC 
GAAGTCTACA 
CAACAACTAC 
CAATCGAACT 
ACAATCGACG 
A 



CAATGCCACA 
AAACACCTTA 
ATTGCTTGAA 
ACTTCCCAGG 
CTTGAAGGTG 
AGTTGATGAG 
TTCTTCCAGT 
TCAGGACGTA 
CGTTGGTATC 
TGTTCCGTAA 
CTTCTTCGTG 
TAAACCAGGT 
TCCTTACTAA 
CGTCCACAAT 
TCCAGCAGGT 
TTGAGTTGAT 



AACTCGTGAG 50 

TCGTCTTCAT 100 

TTGGTTGAAA 150 

TGACGATCTT 2 00 

ACTCTAAATA 2 50 

TATATCCCAG 3 00 

CGAGGACGTA 3 50 

TCGACCGTGG 400 

AAAGAAGAAA 450 

ACAACTTGAC 50 0 

GTGTTCAACG 550 

TCAATCAACC 60 0 

AGAAGAAGGT 650 

TCTACTTCCG 70 0 

ACTGAAATGG 750 

TCACCCAATC 80 0 

821 
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2) INFORMATION FOR SEQ ID NO: 980 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: ATCC 25177 



545 
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WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 980 



PCT/CA00/01150 



GGTGCGATCC TGGTGGTCGC CGCCACCGAC GGCCCGATGC CCCAGACCCG 50 

CGAGCACGTT CTGCTGGCGC GTCAAGTGGG TGTGCCCTAC ATCCTGGTAG 100 

5 CGCTGAACAA GGCCGACGCA GTGGACGACG AGGAGCTGCT CGAACTCGTC 150 

GAGATGGAGG TCCGCGAGCT GCTGGCTGCC CAGGAATTCG ACGAGGACGC 2 00 

CCCGGTTGTG CGGGTCTCGG CGCTCAAGGC GCTCGAGGGT GACGCGAAGT 2 50 

GGGTTGCCTC TGTCGAGGAA CTGATGAACG CGGTCGACGA GTCGATTCCG 3 00 

GACCCGGTCC GCGAGACCGA CAAGCCGTTC CTGATGCCGG TCGAGGACGT 3 50 

10 CTTCACCATT ACCGGCCGCG GAACCGTGGT CACCGGACGT GTGGAGCGCG 4 00 

GCGTGATCAA CGTGAACGAG GAAGTTGAGA TCGTCGGCAT TCGCCCATCG 4 50 

ACCACCAAGA CCACCGTCAC CGGTGTGGAG ATGTTCCGCA AGCTGCTCGA 5 00 

CCAGGGCCAG GCGGGCGACA ACGTTGGTTT GCTGCTGCGG GGCGTCAAGC 550 

GCGAGGACGT CGAGCGTGGC CAGGTTGTCA CCAAGCCCGG CACCACCACG 600 

15 CCGCACACCG AGTTCGAAGG CCAGGTCTAC ATCCTGTCCA AGGACGAGGG 650 

CGGCCGGCAC ACGCCGTTCT TCAACAACTA CCGTCCGCAG TTCTACTTCC 700 

GCACCACCGA CGTGACCGGT GTGGTGACAC TGCCGGAGGG CACCGAGATG 7 50 

GTGATGCCCG GTGACAACAC CAACATCTCG GTGAAGTTGA TCCAGCCCGT 8 00 

CGCCATGGAC GAAGGTNTGC GTTTCGCG 82 8 

20 



2) INFORMATION FOR SEQ ID NO: 981 



25 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus warnerl 

(B) STRAIN: CSG 144 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 981 



40 



45 



50 



55 



CGGCGGTATC 
GTGAACACAT 
TTCTTAAACA 
TGAAATGGAA 
ATGTACCAGT 
GAATACGAAC 
CCCAACTCCA 
ACGTATTCTC 
CGTGGTCAAA 
AGAAAGCAAG 
TAGACTACGC 
GCACGTGAAG 
TACTCCACAC 
AAGGTGGACG 
TTCCGTACTA 
AATGGTTATG 
CAATCGCGAT 



TTAGTAGTAT 
CTTATTATCA 
AAGCTGACAT 
GTTCGTGACT 
TATCGTTGGT 
AAAAAATCTT 
GAACGTGACT 
AATCACTGGT 
TCAAAGTCGG 
AAAACAACAG 
TGAAGCTGGT 
ACGTACAACG 
ACAAAATTCA 
TCATACTCCA 
CTGACGTAAC 
CCTGGCGATA 
TGAAGACGG 



CTGCTGCAGA 
CGTAACGTTG 
GGTTGACGAC 
TATTATCTGA 
TCTGCATTAA 
AGACTTAATG 
CTGATAAACC 
CGTGGTACTG 
TGAAGAAGTT 
TTACAGGTGT 
GACAACATCG 
TGGACAAGTA 
AAGCTGATGT 
TTCTTCACTA 
TGGCGTTGTT 
ACGTAGAAAT 



TGGCCCAATG 
GTGTACCAGC 
GAAGAATTAT 
ATACGACTTC 
AAGCTTTAGA 
CAAGCTGTAG 
ATTCATGATG 
TAGCAACAGG 
GAAATCATCG 
AGAAATGTTC 
GTGCTTTATT 
TTAGCAGCTC 
TTACGTTTTA 
ACTACCGCCC 
CACTTACCAG 
GACTGTTGAA 



CCACAAACTC 50 

TTTAGTTGTA 100 

TAGAATTAGT 150 

CCTGGTGACG 200 

AGGCGACCCA 2 50 

ATGACT ACAT 3 00 

CCAGTTGAGG 3 50 

CCGTGTTGAA 4 00 

GTATCACTGA 4 50 

CGTAAATTAT 500 

ACGTGGTGTT 550 

CTGGCTCTAT 600 

TCTAAAGAAG 650 

ACAATTCTAC 700 

AAGGTACTGA 75 0 

TTAATCGCTC 8 00 

819 



2) INFORMATION FOR SEQ ID NO: 982 

60 



546 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Streptococcus mitis 

(B) STRAIN: LSPQ 2583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 982 

15 CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAGATC GACTTGGTTG ATGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATCCG TGACCTCTTG TCAGAATACG ACTTCCCAGG TGACGATCTT 2 00 

CCAGTTATCC AAGGTTCAGC TCTTAAAGCT CTTGAAGGTG ATACTAAGTA 250 

2 0 CGAAGACATC ATCATGGAAT TGATGAACAC TGTTGATGAG TACATCCCAG 300 

AAC CAGAACG TGATACTGAC AAACCTCTTC TTCTTCCAGT CGAAGACGTA 350 

TTCTCAATCA CTGGTCGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 4 00 

TACTGTTCGT GTCAACGATG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 4 50 

TCCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAGCTTGAC 500 

2 5 GAAGGTCTTG CAGGGGACAA CGTAGGTGTA CTTCTTCGTG GTATCCAACG 550. 

TGATGAAATC GAACGTGGTC AAGTTATCGC TAAACCAGGT TCAATCAACC 600 

CACACACTAA ATTCAAGGGT GAAGTTTACA TCCTTACTAA AGAAGAAGGT 650 

GGACGTCACA CTCCATTCTT CAACAACTAC CGTCCACAGT TCTACTTCCG 700 

TACAACTGAC GTTACAGGTT CAATCGAACT TCCAGCAGGT ACTGAAATGG 750 

30 TAATGCCTGG TGATAACGTA ACTATCGACG TTGAGTTGAT CCACCCAATC 800 

GCCGTTGAAC AAGG 814 



35 2) INFORMATION FOR SEQ ID NO: 983 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 983 



CTATCCTTGT AGTAGCTTCA ACTGACGGAC CAATGCCACA AACTCGTGAG 50 

CACATCCTTC TTTCACGTCA GGTTGGTGTT AAACACCTTA TCGTCTTCAT 100 

GAACAAAGTT GACTTGGTTG ACGACGAAGA ATTGCTTGAA TTGGTTGAAA 150 

TGGAAATCCG TGACCTATTG TCAGAATACG ACTTCCCAGG TGACGATCTT 2 00 

55 CCAGTTATCC AAGGTTCAGC TCTTAAAGCC CTTGAAGGTG ACACTAAATA 250 

CGAAGACATC GTTATGGAAT TGATGAACAC AGTTGATGAG TACATCCCAG 3 00 

AACCAGAACG TGACACTGAC AAACCATTGC TTCTTCCAGT CGAAGACGTA 3 50 

TTCTCAATCA CTGGTCGTGG TACAGTTGCT TCAGGACGTA TCGACCGTGG 4 00 

TATCGTTAAA GTCAACGACG AAATCGAAAT CGTTGGTATC AAAGAAGAAA 450 

60 CTCAAAAAGC AGTTGTTACT GGTGTTGAAA TGTTCCGTAA ACAACTTGAC 500 

547 
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GAAGGTCTTG CCGGAGATAA TGTAGGTGTC 
TGATGAAATC GAACGTGGAC AAGTTATTGC 
CACACACTAA ATTCAAAGGT GAAGTTTACA 
GGACGTCACA CTCCATTCTT CAACAACTAC 
TACTACTGAC GTTACAGGTT CAATCGAACT 
TAATGCCTGG TGATAACGTG ACAATCGACG 
GCCGTAGAAC 



PCT/CA00/01150 

CTTCTTCGTG GTGTTCAACG 550 

TAAACCAGGT TCAATCAACC 600 

TCCTTACTAA AGAAGAAGGT 650 

CGTCCACAAT TCTACTTCCG 700 

TCCAGCAGGT ACTGAAATGG 750 

TTGAGTTGAT CCACCCAATC 800 

810 



2) INFORMATION FOR SEQ ID NO: 984 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 984 

TCCTTGTAGT AGCTTCAACT GACGGACCAA TGCCACAAAC TCGTGAGCAC 50 

ATCCTTCTTT CACGTCAGGT TGGTGTTAAA CACCTTATCG TCTTCATGAA 100 

CAAGATCGAC TTGGTTGATG ACGAAGAATT GCTTGAATTG GTTGAAATGG 150 

AAATCCGTGA CCTCTTGTCA GAATACGACT TCCCAGGTGA CGATCTTCCA 2 00 

GTTATCCAAG GTTCAGCTCT TAAAGCTCTT GAAGGTGATA CTAAGTACGA 250 

AGACATCATC ATGGAATTGA TGAACACTGT TGATGAGTAC ATCCCAGAAC 3 00 

CAGAACGTGA TACTGACAAA CCTCTTCTTC TTCCAGTCGA AGACGTATTC 3 50 

TCAATCACTG GTCGTGGTAC AGTTGCTTCA GGACGTATCG ACCGTGGTAC 4 00 

TGTTCGTGTC AACGATGAAA TCGAAATCGT TGGTATCAAA GAAGAAATCC 4 50 

AAAAAGCAGT TGTTACTGGT GTTGAAATGT TCCGTAAACA GCTTGACGAA 500 

GGTCTTGCAG GGGACAACGT AGGTGTACTT CTTCGTGGTA TCCAACGTGA 550 

TGAAATCGAA CGTGGTCAAG TTATCGCTAA ACCAGGTTCA ATCAACCCAC 600 

ACACTAAATT CAAGGGTGAA GTTTACATCC TTACTAAAGA AGAAGGTGGA 650 

CGTCACACTC CATTCTTCAA CAACTACCGT CCACAGTTCT ACTTCCGTAC 700 

AACTGACGTT ACAGGTTCAA TCGAACTTCC AGCAGGTACT GAAATGGTAA 750 

TGCCTGGTGA TAACGTAACT ATCGACGTTG AGTTGATCCA CCCAATCGCC 8 00 

GTTGAACAAG GTACTAC 817 



2) INFORMATION FOR SEQ ID NO: 985 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 98 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 



548 
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(xi) SEQUENCE DESCRIPTION; 



SEQ ID NO: 98 5 



GACGGACCAA 
TGGTGTTAAA 
5 ACGAAGAATT 
GAATACGACT 
TAAAGCTCTT 
TGAACACTGT 
CCATTGCTTC 

10 AGTTGCTTCA 
TCGAAATCGT 
GTTGAAATGT 
AGGTGTCCTT 
TTATCGCTAA 

15 GTCTACATCC 
CAACTACCGT 
TCGAACTTCC 
ATCGACGTTG 



TGCCACAAAC 
CACCTTATTG 
GCTTGAATTG 
TCCCAGGTGA 
GAAGGTGACT 
TGATGAGTAC 
TTCCAGTCGA 
GGACGTATCG 
TGGTATCAAA 
TCCGTAAACA 
CTTCGTGGTG 
ACCAGGTTCA 
TTACTAAAGA 
CCACAATTCT 
TGCAGGTACT 
AGTTGATCCA 



TCGTGAGCAC 
TCTTCATGAA 
GTTGAAATGG 
CGATCTTCCA 
CTAAATACGA 
ATCCCAGAAC 
AGACGTATTC 
ACCGTGGTAC 
GAAGAAACTC 
ACTTGACGAA 
TTCAACGTGA 
ATCAACCCAC 
AGAAGGTGGA 
ACTTCCGTAC 
GAAATGGTAA 
CCCAATCGCC 



ATCCTTCTTT 
CAAAATTGAC 
AAATCCGTGA 
GTTATCCAAG 
AGACATCATT 
CAGAACGTGA 
TCAATCACTG 
TGTTCGTGTC 
AAAAAGCAGT 
GGTCTTGCCG 
CGAAATCGAA 
ACACTAAATT 
CGTCACACTC 
TACTGACGTT 
TGCCTGGTGA 
GTAGAACAAG 



PCT/CA00/01150 



CACGTCAGGT 50 

TTGGTAGACG 100 

CCTCTTGTCA 150 

GTTCAGCTCT 200 

ATGGAATTGA 250 

CACTGAAAAA 3 00 

GACGTGGTAC 350 

AACGACGAAA 4 00 

TGTTACTGGT 4 50 

GAGATAACGT 500 

CGTGGACAAG 550 

TAAAGGTGAA 60 0 

CATTCTTCAA 650 

ACAGGTTCAA 700 

TAACGTGACT 750 

GTACTACA 798 
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2) INFORMATION FOR SEQ ID NO: 986 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 986 



TTGTAGTAGC 
CTTCTTTCAC 
AGTTGACTTG 
TCCGTGACCT 
ATCCAAGGTT 
CATCGTTATG 
AACGTGACAC 
ATCACTGGAC 
TAAAGTCAAC 
AAGCAGTTGT 
CTTGCTGGAG 
AATCGAACGT 
CTAAATTCAA 
CACACTCCAT 
TGACGTTACA 
CTGGTGATAA 
GAACAAGGTA 



TTCAACTGAC 
GTCAGGTTGG 
GTTGACGACG 
ATTGTCAGAA 
CAGCACTTAA 
GAATTGATGA 
TGACAAACCA 
GTGGTACAGT 
GACGAAATCG 
TACTGGTGTT 
ATAACGTAGG 
GGACAAGTTA 
AGGTGAAGTC 
TCTTCAACAA 
GGTTCAATCG 
CGTGACAATC 
CTACA 



GGACCAATGC 
TGTTAAACAC 
AAGAATTGCT 
TACGACTTCC 
AGCTCTTGAA 
ACACAGTTGA 
TTGCTTCTTC 
TGCTTCAGGA 
AAATCGTTGG 
GAAATGTTCC 
TGTCCTTCTT 
TCGCTAAACC 
TACATCCTTA 
CTACCGTCCA 
AACTTCCAGC 
GACGTTGAGT 



CACAAACTCG 
CTTATCGTCT 
TGAATTGGTT 
CAGGTGACGA 
GGTGACTCTA 
TGAGTATATT 
CAGTCGAGGA 
CGTATCGACC 
TATCAAAGAA 
GTAAACAACT 
CGTGGTGTTC 
AGGTTCAATC 
CTAAAGAAGA 
CAATTCTACT 
AGGTACTGAA 
TGATTCACCC 



TGAGCACATC 50 

TCATGAACAA 100 

GAAATGGAAA 150 

TCTTCCAGTT 2 00 

AATACGAAGA 2 50 

CCAGAACCAG 3 00 

CGTATTCTCA 3 50 

GTGGTATCGT 4 00 

GAAACTCAAA 4 50 

TGACGAAGGT 500 

AACGTGATGA 550 

AACCCACACA 600 

AGGTGGACGT 650 

TCCGTACTAC 700 

ATGGTAATGC 750 

AATCGCCGTA 800 

815 



60 



2) INFORMATION FOR SEQ ID NO: 987 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
<D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus hirae 
10 (B) STRAIN: ATCC 8043 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 987 

CGGACAATCT TGGTTGTTTC TGCTACGGAT GGCCCAATGC CTCAAACTCG 50 

15 TGAACACATT TTACTTTCTC GCCAAGTAGG CGTGAAATAT TTGATTGTTT 100 

TCTTGAACAA AACAGATTTA GTCGATGATG AAGAATTAAT TGATCTAGTA 150 

GAAATGGAAG TTCGTGAACT ATTAAGCGAA TATGGATTTC CAGGTGATGA 200 

TACACCTGTT ATCAAAGGAT CAGCATTAAA AGCATTACAA GGTGATCCTG 250 

ATGCAGAAGC AGCTATCATG GAATTGATGG ACACAGTCGA TGAATACATC 3 00 

2 0 CCAACACCAG AACGTGATAC GGACAAACCA TTATTGTTAC CCGCCGAAGA 350 

TGTATTTTCA ATTACTGGAC GTGGAACAGT AGCATCTGGT CGGATTGACC 400 

GTGGGGCTGT TAGAGTCGGT GACGAAATCG AAATCGTAGG GATCAAACCA 450 

GAAACGCAAA GAGCCGTAGT AACAGGAGTT GAAATGTTCC GCAAAACGCT 500 

TGATTACGGT GAAGCAGGGG ATAACGTAGG TGTGTTATTA CGTGGGATTC 550 

2 5 AAAGAGAAGA CATCGAACGT GGCCAAGTGA TTGCCAAACC TGGTTCAATT 600 

ACACCTCATA CTAAATTCAA AGCAGAAGTT TAGGTTTTGA CTAAAGAAGA 650 

AGGCGGACGT CATACACCAT TCTTCAATAA TTATCGACCA CAATTTTATT 700 

TCCGCACAAC TGACGTAACA GGAACAATTG TTTTGCCAGA AGGAACGGAA 750 

ATGGTCATGC CTGGCGACAA CGTAACGATC GATGTAGAAT TGATTCATCC 800 

3 0 TGTTGCTATT GAAAACGGGA CGACTTTCTC CA 832 



2) INFORMATION FOR SEQ ID NO: 988 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Enterococcus mundtii 

(B) STRAIN: ATCC 43186 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 988 

50 TGGTGCGATC TTAGTTGTTT CTGCGACAGA TGGACCAATG CCTCAAACTC 50 

GCGAGCATAT CCTTTTATCA CGTCAAGTTG GTGTAAAATA TTTGATTGTA 100 

TTTTTGAATA AAGTTGATTT GGTCGATGAT GAAGAATTGA TCGATCTTGT 150 

AGAAATGGAA GTTCGTGAAT TACTGAATGA ATATGGTTTC CCAGGTGATG 2 00 

ACACACCTGT CATCAAAGGC TCTGCATTGA AAGCATTGCA AGGTGACCCA 2 50 

55 GAAGCAGAAG CTGCGATCAA TGAATTGATG GAAACAGTGG ATGACTATAT 3 00 

CCCAACACCA GAACGTGATA CCGACAAACC ATTGCTTTTA CCAGTTGAAG 3 50 

ATGTTTTCTC AATCACTGGT CGCGGAACGG TAGCATCTGG TCGTATCGAC 4 00 

CGTGGAGCCG TTCGTGTCGG TGATGAAATC GAGATCATCG GAATCAAACC 450 

TGAAACGAAA AAAGCGGTTG TGACAGGGGT AGAGATGTTC CGTAAAACGT 500 

60 TAGATTATGG CGAAGCTGGA GATAACGTAG GAATCTTGTT ACGTGGTATC 550 

550 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



CAAAGAGAAG ATATTGAACG TGGACAAGTA 
CACACCACAT ACAAAATTCA AAGCGGAAGT 
AAGGCGGACG TCATACACCA TTCTTCAATA 
TTCCGCACAA CAGATGTAAC AGGTACGATC 
AATGGTCATG CCTGGAGACA ACGTAACCAT 
CAGTGGCAAT CGAACAAGGA ACGACTTTCT 



ATTGCGAAAC CTGGTTCAAT 
TTATGTATTG ACGAAAGAAG 
ACTACCGCCC ACAATTTTAT 
GTGTTGCCAG AAGGAACAGA 
CGAGGTAGAG TTGATCCATC 
CTATT 



600 
650 
700 
750 
800 
835 



2) INFORMATION FOR SEQ ID NO: 989 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus raffinosus 

(B) STRAIN: ATCC 49427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 98 9 

TGCGATCTTA GTAGTATC T A CTACTGATGG ACCAATGCCT CAAACACGTG 50 

AGCACATTTT GTTATCACGC CAAGTAGGTG TTAAATACTT GATCGTCTTC 100 

TTGAACAAAG TTGATTTAGT CGACGATGAA GAATTGATTG ATTTAGTTGA 150 

AATGGAAGTA CGTGAGTTAC TTTCAGAATA TGGTTTCCCA GGCGATGATA 2 00 

TTCCTGTTCT TAAAGGTTCA GCTCTGAAAG CTTTAGAAGG CGATCCTGAA 250 

CAAGAACAAG TAATCATGGA CCTAATGGAT ACGGTTGACG AATATATCCC 3 00 

AACACCAGAA CGTGACACTG ACAAACCATT CTTGTTACCA GTGGAAGATG 3 50 

TTTTCTCAAT CACAGGACGT GGGACTGTTG CATCTGGTCG TATTGATCGT 400 

GGGGAAGTTA AAGTCGGTGA CGAAGTTGAA ATTATCGGGA TCAAACCTGA 4 50 

AGTTCAAAAG GCTGTCGTAA CTGGACTTGA AATGTTCCGT AAAACATTGG 500 

ATTATGGTGA AGCTGGAGAT AACGTTGGGG TTCTATTACG TGGTATTACA 550 

CGTGATGAAA TCGAACGTGG TCAAGTATTA GCTAAACCAG GTTCAATTAC 600 

ACCACATACG AAATTCAGTG CAGAAGTTTA TGTGTTGACG AAAGAAGAAG 650 

GTGGACGTCA TACACCATTC TTTAACAACT ATCGTCCTCA ATTCTACTTC 700 

CGTACAACAG ACGTTACCGG TAATATCGTG TTGCCAGAAG GTACTGAAAT 750 

GGTCATGCCT GGCGATAACG TAACAATCGA CGTTGAATTA ATCCATCCAA 800 

TCGCCGTAGA AAAAGGAACA ACTTTCTCTA TT 832 



2) INFORMATION FOR SEQ ID NO: 990 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus emthracis 

(B) STRAIN: CIP 9440 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 990 
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GGATCCTGTA TATGCACAAA AACTAGGTGT 
TATCACAACC TGATACAGGG GAGCAAGGTT 
GTACGAAGTG GTGCGGTTGA TATTATCGTA 
5 TGTA 



TAACATCGAT GAATTACTAT 50 
TAGAAATCGC AGAAGCACTT 100 
ATTGACTCTG TAGCAGCTCT 150 

154 



10 



15 



2) INFORMATION FOR SEQ ID NO: 991 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 84 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
2 0 (A) ORGANISM: Prevotella melaninogenica 

(B) STRAIN: ATCC 25845 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 991 



25 GCCATTGCAG AGGCACAGAA 
TGAGCACGCC TTCGACCGTT 
ATAACCTTTG GGTTTCACAG 
GCCGACCAGC TGATTCGCTC 
AGTTGCAGCC TTGACTCCAA 

3 0 CTGCAGTAGG TTTACAAGCA 
ACCTCAACAA TCGCAAAAAC 
GCGTGAGAAG ATTGGTGTGA 



GCAGGGCGGT 
TCTATGCAGA 
CCAGACAATG 
TTCCGCTATT 
AGAAGGAGAT 
CGACTGATGA 
TAATACTTGC 
TGTTTGGTAA 



ATTGCAGCCT 
GAAGTTAGGT 
GTGAGCAGGC 
GACATTCTCG 
TGAGGGTGAC 
GTCAGGCATT 
TGCATCTTCA 
TCCA 



TCATTGATGC 
GTGGATGTTG 
TTTAGAGATT 
TTGTCGACTC 
ATGGGTGACT 
GCGTAAACTT 
TCAACCAGTT 



50 
100 
150 
200 
250 
300 
350 
384 



35 



45 



2) INFORMATION FOR SEQ ID NO: 992 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 624 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R760 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 992 

ACACGTGAAC ACATCTTGTT ATCACGTAAC GTTGGTGTAC CATACATCGT 50 

TGTTTTCTTA AACAAAATGG ATATGGTTGA TGACGAAGAA TTACTAGAAT 100 

TAGTTGAAAT GGAAGTTCGT GACTTATTGT CAGAATATGA CTTCCCAGGC 150 

55 GACGATGTTC CTGTAATCGC TGGTTCTGCT TTGAAAGCTC TTGAAGGCGA 2 00 

TGCTTCATAC GAAGAAAAAA TCATGGAATT AATGGCTGCA GTTGACGAAT 2 50 

ACGTTCCAAC TCCAGAACGT GACACTGACA AACCATTCAT GATGCCAGTC 3 00 

GAAGACGTAT TCTCAATCAC TGGACGTGGT ACTGTTGCTA CAGGCCGTGT 3 50 

TGAACGTGGA CAAGTTCGCG TTGGTGACGA AGTTGAAATC GTTGGTATTG 400 

60 CTGAAGAAAC TGCTAAAACA ACTGTAACTG GTGTTGAAAT GTTCCGTAAA 450 
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TTGTTAGACT ATGCTGAAGC AGGGGATAAC ATTGGTGCAT TGCTACGTGG 50 0 

TGTTGCTCGT GAAGACATCC AACGTGGACA AGTATTGGCT AAAGCTGGTA 550 

CAATCACACC TCATACAAAA TTTAAAGCTG AAGTTTACGT TTTAACAAAA 60 0 

GAAGAAGGTG GACGTCATAC ACCA 624 



2) INFORMATION FOR SEQ ID NO: 993 



10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 756 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 
20 (C) ACCESSION NUMBER: extracted from U40453 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 993 

ATGGAAAACA ATAAAAAAGT ATTGAAGAAA ATGGTATTTT TTGTTTTAGT 50 

25 GACATTTCTT GGACTAACAA TCTCGCAAGA GGTATTTGCT CAACAAGACC 10 0 

CCGATCCAAG CCAACTTCAC AGATCTAGTT TAGTTAAAAA CCTTCAAAAT 150 

ATATATTTTC TTTATGAGGG TGACCCTGTT ACTCACGAGA ATGTGAAATC 2 00 

TGTTGATCAA CTTTTATCTC ACGATTTAAT ATATAATGTT TCAGGGCCAA 2 50 

ATTATGATAA ATTAAAAACT GAACTTAAGA ACCAAGAGAT GGCAACTTTA 3 00 

3 0 TTTAAGGATA AAAACGTTGA TATTTATGGT GTAGAATATT ACCATCTCTG 3 50 

TTATTTATGT GAAAATGCAG AAAGGAGTGC ATGTATCTAC GGAGGGGTAA 4 00 

CAAATCATGA AGGGAATCAT TTAGAAATTC CTAAAAAGAT AGTCGTTAAA 4 50 

GTATCAATCG ATGGTATCCA AAGCCTATCA TTTGATATTG AAACAAATAA 500 

AAAAATGGTA ACTGCTCAAG AATTAGACTA TAAAGTTAGA AAATATCTTA 550 

35 CAGATAATAA GCAACTATAT ACTAATGGAC CTTCTAAATA TGAAACTGGA 600 

TATATAAAGT TCATACCTAA GAATAAAGAA AGTTTTTGGT TTGATTTTTT 650 

CCCTGAACCA GAATTTACTC AATCTAAATA TCTTATGATA TATAAAGATA 700 

ATGAAACGCT TGACTCAAAC ACAAGCCAAA TTGAAGTCTA CCTAACAACC 750 

AAGTAA 756 

40 

2) INFORMATION FOR SEQ ID NO: 994 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 994 
55 TGGACTAACA ATCTCGCAAG AGO 23 



2) INFORMATION FOR SEQ ID NO: 995 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 995 
ACATTCTCGT GAGTAACAGG GT 



2) INFORMATION FOR SEQ ID NO: 996 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 996 
ACAAATCATG AAGGGAATCA TTTAG 



2) INFORMATION FOR SEQ ID NO: 997 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 997 
CTAATTCTTG AGCAGTTACC ATT 



2) INFORMATION FOR SEQ ID NO: 998 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 998 
GGAGGGGTAA CAAATCATGA AGG 
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10 



20 



2) INFORMATION FOR SEQ ID NO: 999 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 3 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 999 
15 TTGACCTTGT TGATGACGAA GAG 23 



2) INFORMATION FOR SEQ ID NO: 1000 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 100 0 

30 

TTAGTGTGTG GGTTGATTGA ACT 23 



3 5 2 ) INFORMATION FOR SEQ ID NO : 1001 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1001 

AAGAGTTGCT TGAATTAGTT GAG 23 



50 

2) INFORMATION FOR SEQ ID NO: 1002 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 894 bases 
55, (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

60 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Streptococcus pyogenes 

(B) STRAIN: ATCC 7002 94 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1002 

AACATGATCA CTGGTGCCGC TCAAATGGAC GGAGCTATCC TTGTAGTTGC 50 

TTCAACTGAT GGACCAATGC CACAAACTCG TGAGCACATC CTTCTTTCAC 100 

GTCAGGTTGG TGTTAAACAC CTTATCGTGT TCATGAACAA AGTTGACCTT 150 

GTTGATGACG AAGAGTTGCT TGAATTAGTT GAGATGGAAA TTCGTGACCT 2 00 

TCTTTCAGAA TACGATTTCC CAGGTGATGA CCTTCCAGTT ATCCAAGGTT 250 

CAGCTCTTAA AGCTCTTGAA GGCGACACTA AATTTGAAGA CATCATCATG 3 00 

GAATTGATGG ATACTGTTGA TTCATACATT CCAGAACCAG AACGCGACAC 350 

TGACAAACCA TTGCTTCTTC CAGTCGAAGA CGTATTCTCA ATTACAGGTC 400 

GTGGTACAGT TGCTTCAGGA CGTATCGACC GTGGTACTGT TCGTGTCAAC 450 

GACGAAATCG AAATCGTTGG TATCAAAGAA GAAACTAAAA AAGCTGTTGT 500 

TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT CTTGCAGGAG 550 

ACAACGTAGG TATCCTTCTT CGTGGTGTTC AACGTGACGA AATCGAACGT 60 0 

GGTCAAGTTA TTGCTAAACC AAGTTCAATC AACCCACACA CTAAATTCAA 650 

AGGTGAAGTA TATATCCTTT CTAAAGACGA AGGTGGACGT CACACTCCAT 700 

TCTTCAACAA CTACCGTCCA CAATTCTACT TCCGTACAAC TGACGTAACA 750 

GGTTCAATCG AACTTCCAGC AGGTACAGAA ATGGTTATGC CTGGTGATAA 8 00 

CGTGACAATC AACGTTGAGT TGATCCACCC AATCGCCGTA GAACAAGGTA 850 

CTACTTTCTC AATCCGTGAA GGTGGACGTA CTGTTGGTTC AGGT 8 94 



2) INFORMATION FOR SEQ ID NO: 1003 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 332 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1003 

AATGGATCCT GTATATGCAC AAAAATTAGG CGTTAACATA GATGAATTAC 50 

TATTATCACA GCCTGATACA GGGGAGCAAG GATTAGAAAT CGCGGAAGCA 100 

CTTGTACGAA GTGGTGCGGT TGACATTATC GTAATTGACT CTGTAGCAGC 150 

TCTTGTACCG AAAGCAGAGA TTGAAGGCGA CATGGGTGAC TCACACGTAG 2 00 

GTTTACAAGC ACGTTTAATG TCACAAGCAC TTCGTAAGCT TTCAGGAGCA 2 50 

ATCAACAAAT CAAAAACAAT TGCAATCTTT ATTAACCAAA TTCGTGAAAA 3 00 

AGTTGGGGTT ATGTTCGGAA ACCCAGAAAC AA 3 32 



2) INFORMATION FOR SEQ ID NO: 1004 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-01 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1004 

10 ACCAAGAAGC TCAAAAACAT CTGTGGGATA TCTACAACTC CGATCAATAC 50 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 100 

TTCAAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 150 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 200 

TCTGCTATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 250 

15 TTATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 3 00 

GAACAAGCAC ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 350 

ACTCTGCAAT ATGCTCTTCA ACAATCACGA AATGTCACAG CCGTTGAGAC 4 00 

TTTGAATAAG GTCGGTCTAG ATAGAGCTAA AACCTTCCTT AATGGTCTTG 4 50 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 500 

2 0 ACAGAATCTA ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 

TTATGCTGCC TTTGCAAATG GTGGCACTTA CTATAAACCA ATGTATATCC 600 

ATAAAGTCGT CTTCAGTGAT GGAAGTAAAA AAGAGTTCTC TAATGTCGGA 650 

ACTCGTGCCA TGAAGGAAAC GACAGCCTAT ATGATGACCG ACATGATGAA 700 

AACAGTCTTG ACTTATGGAA CTGGGCGTGG AGCCTATCTT CCTTGGCTTC 750 

2 5 CTCAAGCTGG TAAAACAGGA ACCTCTAACT ATACAGATGA GGAAGTTGAA 8 00 

AACCACATCA AGAACACTGG CTATGTAGCT CCAGATGAAA TGTTTGTTGG 8 50 

TTATACTCGT AAGTATTCTA TGGCTGTATG GACAGGTTAT TCGAATCGTT 9 00 

TAACTCCTAT CGTTGGAGAT GGTTTCCTAG TTGCAGCTAA AGTTTATCGC 950 

TCAATGATAA CGTATCTATC AGAAGATACT CATCCAGAAG ACTGGACGAT 10 00 

3 0 GCCAGACGGA CTTTTCAGAA ACGGGGAATT TGTATTCAAA AATGGAGCTC 1050 

GCCCAATATG GACTGAACCC TCTACTCAAC AATCCTCAAC AGCTGAAAGT 1100 

TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAGCT CAACCACTCC 1150 

AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 12 00 

CAAATACAAC CC 1212 

35 

2) INFORMATION FOR SEQ ID NO: 1005 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
50 (B) STRAIN: StrR-02 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 10 05 

AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

55 TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 2 00 

ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 2 50 

TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT CCGGGAACAA 3 00 

60 GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 3 50 
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CAATATGCTC TTCAACAATC ACGAAATGTC 
TAAGGTCGGT CTAGATAGAG CTAAAACCTT 
ACTATCCAAG CATGCATTAT GCAAACGCCA 
TCTAATAAAC AATACGGAGC AAGTAGTGAA 
TGCCTTTGCC AACGGTGGAA TTTACCACAA 
TCGTCTTCAG TGACGGTAGT AAAAAAGAAT 
GCTATGAAAG AAACAACTGC TTACATGATG 
CTTGGCATAC GGAACTGGTC GTGGAGCCTA 
CTGGTAAGAC AGGTACTTCT AACTACACAG 
ATCAAGAACA CTGGCTATGT AGCTCCAGAT 
TCGTAAGTAT TCTATGGCTG TATGGACAGG 
CTATCGTTGG AGATGGTTTC CTAGTTGCAG 
ATAACGTATC TGTCTGAAGG AAGCAATCCA 
GGGGCTCTAC AGAAATGGAG AATTCGTATT 
CGTGGAGCTC ACCTGCTCCA CAACAACCCC 
TCATCATCAG ATAGTTCAAC TTCACAGTCT 
AAATAATAGT ACGACTACCA ATCCTAACAA 
CAACCCCTGA TC 
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ACAGCCGTTG AGACTTTGAA 4 00 

CCTTAATGGT CTTGGTATCG 450 

TTTCAAGTAA TACAACAGAA 500 

AAAATGG CTG CTGCTTATGC 550 

ACCCATGTAT ATCAATAAGG 60 0 

TTTCAGATGT AGGTACACGA 650 

ACCGAAATGA TGAAAACTGT 700 

TCTCCCATGG TTAGCGCAAG 750 

ATGATGAAAT TGAAAAACAC 800 

GAAATGTTTG TTGGTTATAC 850 

TTATTCGAAT CGTTTAACTC 90 0 

CTAAAGTTTA TCGCTCAATG 950 

GAGGATTGGA ATATACCAGA 100 0 

TAAAAATGGT GCTCGTTCTA 1050 

CATCAACTGA AAGTTCAAGC 1100 

AACTCAACCA CTCCAAGCAC 1150 

TAATACGCAA CAATCAAATA 12 00 

1212 



2) INFORMATION FOR SEQ ID NO: 1006 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1213 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-03 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1006 

ACCAAGAAGC TCAAAAACAT CTGTGGGATA TCTACAACTC CGATCAATAC 50 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 10 0 

TTCAAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 150 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 2 00 

TCTGCTATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 250 

TTATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 300 

GAACAAGCAC ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 3 50 

ACTCTGCAAT ATGCTCTTCA ACAATCACGA AATGTCACAG CCGTTGAGAC 4 00 

TTTGAATAAG GTCGGTCTAG ATAGAGCTAA AACCTTCCTT AATGGTCTTG 450 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 50 0 

ACAGAATCTA ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 

TTATGCTGCC TTTGCCAACG GTGGAATTTA CCACAAACCC ATGTATATCA 600 

ATAAGGTCGT CTTCAGTGAC GGTAGTAAAA AAGAATTTTC AGATGTAGGT 650 

ACACGAGCTA TGAAAGAAAC AACTGCTTAC ATGATGACCG AAATGATGAA 700 

AACTGTCTTG GCATACGGAA CTGGTCGTGG AGCCTATCTC CCATGGTTAG 750 

CGCAAGCTGG TAAGACAGGT ACTTCTAACT ACACAGATGA TGAAATTGAA 800 

AAACACATCA AGAACACTGG CTATGTAGCT CCAGATGAAA TGTTTGTTGG 850 

TTATACTCGT AAGTATTCTA TGGCTGTATG GACAGGTTAT TCGAATCGTT 900 

TAACTCCTAT CGTTGGAGAT GGTTTCCTAG TTGCAGCTAA AGTTTATCGC 950 

TCAATGATAA CGTATCTGTC TGAAGGAAGC AATCCAGAGG ATTGGAATAT 1000 

AC C AGAGGGG CTCTACAGAA ATGGAGAATT CGTATTTAAA AATGGTGCTC 1050 

GTTCTACGTG GAGCTCACCT GCTCCACAAC AACCCCCATC AACTGAAAGT 1100 

TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAACT CAACCACTCC 1150 
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AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 
CAAATACAAC CCC 



1200 
1213 



2) INFORMATION FOR SEQ ID NO: 1007 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-04 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1007 

ACCAAGAAGC TCAAAAACAT CTGTGGGATA TCTACAACTC CGATCAATAC 50 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 100 

TTCAAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 150 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 2 00 

TCTGCTATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 2 50 

TTATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 3 00 

GAACAAGCAC ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 3 50 

ACTCTGCAAT ATGCTCTTCA ACAATCACGA AATGTCACAG CCGTTGAGAC 4 00 

TTTGAATAAG GTCGGTCTAG ATAGAGCTAA AACCTTCCTT AATGGTCTTG 450 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 500 

ACAGAATCTA ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 

TTATGCTGCC TTTGCAAATG GTGGCACTTA CTATAAACCA ATGTATATCC 600 

ATAAAGTCGT CTTCAGTGAT GGAAGTAAAA AAGAGTTCTC TAATGTCGGA 650 

ACTCGTGCCA TGAAGGAAAC GACAGCCTAT ATGATGACCG ACATGATGAA 700 

AACAGTCTTG ACTTATGGAA CTGGGCGTGG AGCCTATCTT CCTTGGCTTC 750 

CTCAAGCTGG TAAAACAGGA ACCTCTAACT ATACAGATGA GGAAGTTGAA 8 00 

AACCACATCA AGAACACTGG CTATGTAGCT CCAGATGAAA TGTTTGTTGG 850 

TTATACTCGT AAGTATTCTA TGGCTGTATG GACAGGTTAT TCGAATCGTT 900 

TAACTCCTAT CGTTGGAGAT GGTTTCCTAG TTGCAGCTAA AGTTTATCGC 950 

TCAATGATAA CGTATCTATC AGAAGATACT CATCCAGAAG ACTGGACGAT 10 0 0 

GCCAGACGGA CTTTTCAGAA ACGGGGAATT TGTATTCAAA AATGGAGCTC 1050 

GCCCAATATG GACTGAACCC TCTACTCAAC AATCCTCAAC AGCTGAAAGT 1100 

TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAGCT CAACCACTCC 1150 

AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 12 0 0 

CAAATACAAC CCCTGATC 1218 



2) INFORMATION FOR SEQ ID NO: 1008 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1008 

GTAGACCAAG AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA 50 

ATACGTCTCT TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG 100 

ATGTTTCAAA TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA 150 

AGTAACGTTT CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG 2 00 

GGGTTCTGCT ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG 250 

GTGTTTATGA TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT 3 00 

CCGGGAACAA GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA 350 

TATTACTCTG CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG 40 0 

AGACTTTGAA TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT 450 

CTTGGTATCG ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA 500 

TACAACAGAA TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG 550 

CTGCTTATGC TGCCTTTGCA AATGGTGGCA CTTACTATAA ACCAATGTAT 600 

ATCCATAAAG TCGTCTTCAG TGATGGAAGT AAAAAAGAGT TCTCTAATGT 6 50 

CGGAACTCGT GCCATGAAGG AAACGACAGC CTATATGATG ACCGACATGA 700 

TGAAAACAGT CTTGACTTAT GGAACTGGGC GTGGAGCCTA TCTTCCTTGG 750 

CTTCCTCAAG CTGGTAAAAC AGGAACCTCT AACTATACAG ATGAGGAAGT 800 

TGAAAAC C AC ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG 850 

TTGGTTATAC TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT 900 

CGTTTAACTC CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA 950 

TCGCTCAATG ATAACGTATC TATCAGAAGA TACTCATCCA GAAGACTGGA 1000 

CGATGCCAGA CGGACTTTTC AGAAACGGGG AATTTGTATT CAAAAATGGA 1050 

GCTCGCCCAA TATGGAC TGA ACCCTCTACT CAACAATCCT CAACAGCTGA 1100 

AAGTTCAAGC TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA 1150 

CTCCAAGCAC AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA 1200 

CAATCAAATA CAACCCCTGA TCA 1223 



2) INFORMATION FOR SEQ ID NO : 1009 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1214 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1009 

ACCAAGAAGC TCAAAAACAT CTGTGGGATA TCTACAACTC CGATCAATAC 50 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCATCTACGG TCGTAGATGT 100 

TTCAAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 150 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 200 

TCTGCTATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 250 

TTATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 3 00 

GAACAAGCAC ACCTGTCTAC AACTGGGATA GAGCATATTT CGGTAATATT 350 

ACTCTGCAAT ATGCTCTTCA ACAATCACGA AATGTCACAG CCGTTGAGAC 400 

TTTGAATAAG GTCGGTCTAG ATAGAGCTAA AACCTTCCTT AATGGTCTTG 450 

GTATCGACTA TCCAAGCATG CATTATGCAA ACGCCATTTC AAGTAATACA 500 

ACAGAATCTA ATAAACAATA CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC 550 
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SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

TTATGCTGCC TTTGCAAATG GTGGCACTTA CTATAAACCA ATGTATATCC 600 

ATAAAGTCGT CTTCAGTGAT GGAAGTAAAA AAGAGTTCTC TAATGTCGGA 650 

ACTCGTGCCA TGAAGGAAAC GACAGCCTAT ATGATGACCG ACATGATGAA 700 

AACAGTCTTG ACTTATGGAA CTGGGCGTGG AGCCTATCTT CCTTGGCTTC 750 

5 CTCAAGCTGG TAAAACAGGA ACCTCTAACT ATACAGATGA GGAAGTTGAA 800 

AACCACATCA AGAACACTGG CTATGTAGCT CCAGATGAAA TGTTTGTTGG 850 

TTATACTCGT AAGTATTCTA TGGCTGTATG GACAGGTTAT TCGAATCGTT 900 

TAACTCCTAT CGTTGGAGAT GGTTTCCTAG TTGCAGCTAA AGTTTATCGC 950 

TCAATGATAA CGTATCTATC AGAAGATACT CATC CAGAAG ACTGGACGAT 1000 

10 GCCAGACGGA CTTTTCAGAA ACGGGGAATT TGTATTCAAA AATGGAGCTC 1050 

GCCCAATATG GACTGAACCC TCTACTCAAC AATCCTCAAC AGCTGAAAGT 1100 

TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAGCT CAACCACTCC 1150 

AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 1200 

CAAATACAAC CCCT 1214 

15 

2) INFORMATION FOR SEQ ID NO: 1010 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
30 (B) STRAIN: StrR-07 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1010 

CCAAGAAGCT CAAAAACATC TGTGGGATAT CTACAACTCC GATCAATACG 50 

3 5 TCTCTTACCC TGACGATGAT TTGCAAGTCG CATCTACGGT CGTAGATGTT 100 

TCAAATGGTA AAGTCATCGC ACAACTTGGT GCTCGTCATC AAGCAAGTAA 150 

TGTTTCATTC GGTACCAACC AGGCCGTAGA AACCAATCGT GACTGGGGAT 2 00 

CATCAATGAA ACCAATCACT GACTATGCTC CCGCTTTAGA ATATGGAGTC 2 50 

TATGACTCTA CTGCTTCTAT TGTACATGAT GTCCCTTATA ACTATCCTGG 300 

4 0 CACTGATACT CCACTCTACA AC TGGGATC A TGTCTACTTT GGAAACATTA 3 50 

CAATCCAGTA TGCTCTTCAA CAATCACGAA ATGTCACAGC CGTTGAGACT 4 00 

TTGAATAAGG TCGGTCTAGA TAGAGCTAAA ACCTTCCTTA ATGGTCTTGG 450 

TATCGACTAT CCAAGCATGC ATTATGCAAA CGCCATTTCA AGTAACACAA 500 

CTGAATCCAA CAAAAAATAT GGTGCAAGTA GTGAAAAAAT GGCTGCTGCC 550 

45 TACGCTGCTT TTGCTAATGG TGGTATTTAT CACAAACCAA TGTATATCAA 600 

TAAAATCGTC TTTAGTGATG GTAGCGAAAA AGAATTTTCT GATGCTGGTA 650 

CACGAGCTAT GAAAGAGACT ACTGCCTATA TGATGACTGA AATGATGAAA 700 

ACTGTTTTAA CTTACGGAAC AGGACGTGGA GCCTACCTAC CATGGCTTCC 750 

ACAAGCAGGT AAGACAGGTA CTTCTAACTA TACTGACGAA GAAATTGAAA 800 

50 AGTATATCAA GAACACTGGT TACGTAGCTC CAGATGAAAT GTTTGTAGGG 850 

TATACCCGTA AATATGCAAT GGCTGTTTGG ACAGGATACT CAAATCGTCT 900 

AACTCCAATC ATCGGAGATG GTTTCCTTGT TGCTGGTAAA GTCTATCGTT 950 

CAATGATAAC TTACCTTTCT GAAGATGACC AACCTGGAGA TTGGACAATG 10 00 

CCAGATGGCT TGTATAGAAA TGGAGAATTC GTATTTAAAA ATGGTGCTCG 1050 

55 TTCTACGTGG AGCTCACCTG CTCCACAACA ACCCCCATCA ACTGAAAGTT 110 0 

CAAGCTCATC ATCAGATAGT TCAACTTCAC AGTCTAACTC AACCACTCCA 1150 

AGCACAAATA ATAGTACGAC TACCAATCCT AACAATAATA CGCAACAATC 12 00 

AAATACAACC CCTGATCAAC AAA 1223 



60 
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2) INFORMATION FOR SEQ ID NO: 1011 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 12 07 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-08 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1011 



15 



AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

20 TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 2 00 

ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 2 50 

TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT CCGGGAACAA 3 00 

GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 3 50 

25 CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 4 00 

TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 4 50 

ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA TACAACAGAA 500 

TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG CTGCTTATGC 550 

TGCCTTTGCA AATGGTGGCA CTTACTATAA ACCAATGTAT ATCCATAAAG 600 

30 TCGTCTTCAG TGATGGAAGT AAAAAAGAGT TCTCTAATGT CGGAACTCGT 650 

GCCATGAAGG AAACGACAGC CTATATGATG ACCGACATGA TGAAAACAGT 7 00 

CTTGACTTAT GGAACTGGGC GTGGAGCCTA TCTTCCTTGG CTTCCTCAAG 750 

CTGGTAAAAC AGGAACCTCT AACTATACAG ATGAGGAAGT TGAAAACCAC 800 

ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG TTGGTTATAC 850 

35 TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT CGTTTAACTC 900 

CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA TCGCTCAATG 950 

ATAACGTATC TATCAGAAGA TACTCATCCA GAAGACTGGA CGATGCCAGA 1000 

CGGACTTTTC AGAAACGGGG AATTTGTATT CAAAAATGGA GCTCGCCCAA 1050 

TATGGACTGA ACCCTCTACT CAACAATCCT CAACAGCTGA AAGTTCAAGC 1100 

40 TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA CTCCAAGCAC 1150 

AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA CAATCAAATA 12 00 

CAACCCC 12 07 



45 



55 



2) INFORMATION FOR SEQ ID NO: 1012 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1201 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-09 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1012 

562 
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AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

5 CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 200 

ATGAAACCAA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 250 

TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT CCGGGAACAA 300 

GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 3 50 

CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 400 

10 TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 4 50 

ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA TACAACAGAA 500 

TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG CTGCTTATGC 550 

TGCCTTTGCC AACGGTGGAA TTTACCACAA ACCCATGTAT ATCAATAAGG 60 0 

TCGTCTTCAG TGACGGTAGT AAAAAAGAAT TTTCAGATGT AGGTACACGA 650 

15 GCTATGAAAG AAACAACTGC TTACATGATG ACCGAAATGA TGAAAACTGT 70 0 

CTTGGCATAC GGAACTGGTC GTGGAGCCTA TCTCCCATGG TTAGCGCAAG 750 

CTGGTAAGAC AGGTACTTCT AACTACACAG ATGATGAAAT TGAAAAACAC 800 

ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG TTGGTTATAC 850 

TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT CGTTTAACTC 900 

2 0 CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA TCGCTCAATG 950 

ATAACGTATC TATCAGAAGA TACTCATCCA GAAGACTGGA CGATGCCAGA 1000 

CGGACTTTTC AGAAACGGGG AATTTGTATT CAAAAATGGA GCTCGTTCTA 1050 

CGTGGAACTC ACCTGCTCCA CAACAACCCC CATCAACTGA AAGTTCAAGC 1100 

TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA CTCCAAGCAC 1150 

25 AAATAATAGT ACGACTACCG ATCCTAACAA TAATACGCAA CAATCAAATA 1200 

C 1201 



30 2) INFORMATION FOR SEQ ID NO: 1013 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 0 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-10 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1013 



TGTAGACCAA GAAGCTCAAA AACATCTGTG GGATATCTAC AACTCCGATC 50 

AATACGTCTC TTACCCTGAC GATGATTTGC AAGTCGCATC TACGGTCGTA 100 

GATGTTTCAA ATGGTAAAGT CATCGCACAA CTTGGTGCTC GTCATCAAGC 150 

AAGTAATGTT TCATTCGGTA CCAACCAGGC CGTAGAAACC AATCGTGACT 2 00 

50 GGGGATCATC AATGAAACCA ATCACTGACT ATGCTCCCGC TTTAGAATAT 250 

} GGAGTCTATG ACTCTACTGC TTCTATTGTA CATGATGTCC CTTATAACTA 300 

J" TCCTGGCACT GATACTCCAC TCTACAACTG GGATCATGTC TACTTTGGAA 3 50 

ACATTACAAT CCAGTATGCT CTTCAACAAT CACGAAATGT CACAGCCGTT 400 

* GAGACTTTGA ATAAGGTCGG TCTAGATAGA GCTAAAACCT TCCTTAATGG 450 

55 TCTTGGTATC GACTATCCAA GCATGCATTA TGCAAACGCC ATTTCAAGTA 500 

ACACAACTGA ATCCAACAAA AAATATGGTG CAAGTAGTGA AAAAATGGCT 550 

GCTGCCTACG CTGCTTTTGC TAATGGTGGT ATTTATCACA AACCAATGTA 600 

TATCAATAAA ATCGTCTTTA GTGATGGTAG CGAAAAAGAA TTTTCTGATG 650 

CTGGTACACG AGCTATGAAA GAGACTACTG CCTATATGAT GACTGAAATG 700 

60 ATGAAAACTG TTTTAACTTA CGGAACAGGA CGTGGAGCCT ACCTACCATG 750 
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GCTTCCACAA GCAGGTAAGA CAGGTACTTC TAACTATACT GACGAAGAAA 8 00 

TTGAAAAGTA TATCAAGAAC ACTGGCTACG TAGCTCCAGA TGAAATGTTT 850 

GTGGGTTATA CTCGTAAGTA TTCTATGGCT GTATGGACAG GTTATTCGAA 900 

TCGTTTAACT CCTATCGTTG GAGATGGTTT CCTAGTTGCA GCTAAAGTTT 950 

5 ATCGCTCTAT GATGACCTAC CTGTCTGAAG GAAGCAATCC AGAGGATTGG 1000 

AATATACCAG AGGGGCTCTA CAGAAATGGA GAATTCGTAT TTAAAAATGG 1050 

TGCTCGTTCT ACGTGGAGCT CACCTGCTCC ACAACAACCC CCATCAACTG 1100 

AAAGTTCAAG CTCATCATCA GATAGTTCAA CTTCACAGTC TAGCTCAACC 1150 

ACTCCAAGCA CAAATAATAG TACGACTACC AATCCTAACA ATAATACGCA 12 00 

10 ACAATCAAAT ACAACCCCTG 122 0 



15 



55 



2) INFORMATION FOR SEQ ID NO: 1014 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1199 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

2 0 (D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1014 

3 0 CAAAAACATC TGTGGGATAT TTACAATACA GACGAATACG TTGCCTATCC 50 

AGACGATGAA TTGCAAGTCG CTTCTACCAT TGTTGATGTT TCTAACGGTA 100 

AAGTCATTGC CCAGCTAGGA GCACGCCATC AGTCAAGTAA TGTTTCCTTC 150 

GGAATTAACC AAGCAGTAGA AACAAACCGC GACTGGGGAT CAACTATGAA 2 00 

ACCGATCACA GACTATGCTC CTGCCTTGGA GTACGGTGTC TACGATTCAA 250 

3 5 CTGCTACTAT CGTTCACGAT GAGCCCTATA ACTACCCTGG GACAAATACC 3 00 

CCTGTTTATA ACTGGGATAG GGGCTACTTT GGCAACATCA CCTTGCAATA 350 

CGCCCTGCAA CAATCGCGAA ACGTCCCAGC CGTGGAAACT CTAAACAAGG 4 00 

TCGGACTCAA CCGCGCCAAG ACTTTCCTAA ATGGTCTCGG AATCGACTAC 4 50 

CCAAGTATTC ACTACTCAAA TGCCATTTCA AGTAACACAA CCGAATCAGA 500 

40 CAAAAAATAT GGAGCAAGTA GTGAAAAGAT GGCTGCTGCT TACGCTGCCT 550 

TTGCAAATGG TGGAACTTAC TATAAACCAA TGTATATCCA TAAAGTCGTC 600 

TTTAGTGATG GGAGTGAAAA AGAGTTCTCT AATGTCGGAA CTCGTGCCAT 650 

GAAGGAAACG ACAGCCTATA TGATGACCGA. CATGATGAAA ACTGTCTTAG 70 0 

TATACGGAAT CGGACGTGGA GCCTACCTAC CTTGGCTTCC ACAAGCAGGT 750 

45 AAAACAGGTA CTTCTAACTA TACTGACGAA GAAATTGAAA AGTATATCAA 800 

GAACACTGGT TACGTAGCTC CAGATGAAAT GTTTGTAGGG TATACCCGCA 850 

AATATGCAAT GGCTGTATGG ACAGGCTATT CTAACCGTCT GACACCACTT 900 

GTAGGCGATG GCCTTACGGT CGCTGCTAAA GTTTACCGCT CTATGATGAC 950 

CTACCTGTCT GAAGGAAGCA ATCCAGAGGA TTGGAATATA CCAGAGGGGC 100 0 

50 TCTACAGAAA TGGAGAATTC GTATTTAAAA ATGGTGCTCG TTCTACGTGG 1050 

AACTCACCTG CTCCACAACA ACCCCCATCA ACTGAAAGTT CAAGCTCATC 1100 

ATCAGATAGT TCAACTTCAC AGTCTAGCTC AACCACTCCA AGCACAAATA 1150 

ATAGTACGAC TACCAATCCT AACAATAATA CGCAACAATC AAATACAAC 1199 



2) INFORMATION FOR SEQ ID NO: 1015 



(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 1211 bases 

564 
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40 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-12 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1015 



AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

15 TGGTAAAGTC ATCGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

CATTTGGTAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCTGCT 2 00 

ATGAAAC C AA TCACCGATTA TGCACCTGCC ATAGAATACG GTGTTTATGA 2 50 

TTCCACTGCA ACTATGGTTA ATGATATTCC TTATAACTAT CCGGGAACAA 3 00 

GCACACCTGT CTACAACTGG GATAGAGCAT ATTTCGGTAA TATTACTCTG 350 

2 0 CAATATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 4 00 

TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 4 50 

ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA TACAACAGAA 500 

TCTAATAAAC AATACGGAGC AAGTAGTGAA AAAATGGCTG CTGCTTATGC 550 

TGCCTTTGCC AACGGTGGAA TTTACCACAA ACCCATGTAT ATCAATAAGG 6 00 

2 5 TCGTCTTCAG TGACGGTAGT AAAAAAGAAT TTTCAGATGT AGGTACACGA 650 

GCTATGAAAG AAACAACTGC TTACATGATG ACCGAAATGA TGAAAACTGT 70 0 

CTTGGCATAC GGAACTGGTC GTGGAGCCTA TCTCCCATGG TTAGCGCAAG 750 

CTGGTAAGAC AGGTACTTCT AACTACACAG ATGATGAAAT TGAAAAACAC 8 00 

ATCAAGAACA CTGGCTATGT AGCTCCAGAT GAAATGTTTG TTGGTTATAC 8 50 

3 0 TCGTAAGTAT TCTATGGCTG TATGGACAGG TTATTCGAAT CGTTTAACTC 9 00 

CTATCGTTGG AGATGGTTTC CTAGTTGCAG CTAAAGTTTA TCGCTCAATG 950 

ATAACGTATC TGTCTGAAGG AAGCAATCCA GAGGATTGGA ATATACCAGA 10 00 

GGGGCTCTAC AGAAATGGAG AATTCGTATT TAAAAATGGT GCTCGTTCTA 1050 

CGTGGAGCTC ACCTGCTCCA CAACAACCCC CATCAACTGA AAGTTCAAGC 1100 

35 TCATCATCAG ATAGTTCAAC TTCACAGTCT AACTCAACCA CTCCAAGCAC 1150 

AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA CAATCAAATA 12 00 

CAACCCCTGA T 1211 



2) INFORMATION FOR SEQ ID NO: 1016 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1222 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-13 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1016 

GTAGACCAAG AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA 50 

ATACGTCTCT TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG 100 

ATGTTTCAAA TGGTAAAGTC ATCGCACAAC TTGGTGCTCG TCATCAAGCA 150 

60 AGTAATGTTT CATTCGGTAC CAACCAGGCC GTAGAAACCA ATCGTGACTG 2 00 

565 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GGGATCATCA 
GAGTCTATGA 
CCTGGCACTG 
CATTACAATC 
5 AGACTTTGAA 
CTTGGTATCG 
CACAACTGAA 
CTGCCTACGC 
ATCAATAAAA 

10 TGGTACACGA 
TGAAAACTGT 
CTTCCACAAG 
TGAAAAGTAT 
TGGGTTATAC 

15 CGTTTAACTC 
TCGCTCTATG 
ATATACCAGA 
GCTCGTTCTA 
AAGTTCAAGC 

2 0 CTCCAAGCAC 
CAATCAAATA 



ATGAAACCAA 
CTCTACTGCT 
ATACTCCACT 
CAGTATGCTC 
TAAGGTCGGT 
ACTATCCAAG 
TCCAACAAAA 
TGCTTTTGCT 
TCGTCTTTAG 
GCTATGAAAG 
TTTAACTTAC 
CAGGTAAGAC 
ATCAAGAACA 
TCGTAAGTAT 
CTATCGTTGG 
ATGACCTACC 
GGGGCTCTAC 
CGTGGAGCTC 
TCATCATCAG 
AAATAATAGT 
CAACCCCTGA 



TCACTGACTA 
TCTATTGTAC 
CTACAACTGG 
TTCAACAATC 
CTAGATAGAG 
CATGCATTAT 
AATATGGTGC 
AATGGTGGTA 
TGATGGTAGC 
AGACTACTGC 
GGAACAGGAC 
AGGTACTTCT 
CTGGCTACGT 
TCTATGGCTG 
AGATGGTTTC 
TGTCTGAAGG 
AGAAATGGAG 
ACCTGCTCCA 
ATAGTTCAAC 
ACGACTACCA 
TC 



TGCTCCCGCT 
ATGATGTCCC 
GATCATGTCT 
ACGAAATGTC 
CTAAAACCTT 
GCAAACGCCA 
AAGTAGTGAA 
TTTATCACAA 
GAAAAAGAAT 
CTATATGATG 
GTGGAGCCTA 
AACTATACTG 
AGCTCCAGAT 
TATGGACAGG 
CTAGTTGCAG 
AAGCAATC C A 
AATTCGTATT 
CAACAACCCC 
TTCACAGTCT 
ATCCTAACAA 



PCT/CAOO/01150 

TTAGAATATG 2 50 

TTATAACTAT 3 00 

ACTTTGGAAA 3 50 

ACAGCCGTTG 4 00 

CCTTAATGGT 450 

TTTCAAGTAA 500 

AAAATGGCTG 550 

ACCAATGTAT 60 0 

TTTCTGATGC 650 

ACTGAAATGA 700 

CCTACCATGG 750 

ACGAAGAAAT 800 

GAAATGTTTG 850 

TTATTCGAAT 90 0 

CTAAAGTTTA 950 

GAGGATTGGA 100 0 

TAAAAATGGT 1050 

CATCAACTGA 110 0 

AGCTCAACCA 1150 

TAATACGCAA 12 00 
1222 



25 2) INFORMATION FOR SEQ ID NO: 1017 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 122 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



35 



40 



45 



50 



55 



60 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-14 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1017 



TGTAGACCAA 
AATACGTTGC 
GATGTTTCTA 
AAGTAATGTT 
GGGGATCAAC 
GGTGTCTACG 
CCCTGGGACA 
ACATCACCTT 
GAAACTCTAA 
TCTCGGAATC 
ACACAACCGA 
GCTGCTTACG 
TATCCATAAA 
TCGGAACTCG 
ATGAAAACTG 
GTTAGCGCAA 
TTGAAAAACA 
GTTGGTTATA 
TCGTTTAACT 
ATCGCTCAAT 



GAAGCTCAAA 
CTATCCAGAC 
ACGGTAAAGT 
TCCTTCGGAA 
TATGAAACCG 
ATTCAACTGC 
AATACCCCTG 
GCAATACGCC 
ACAAGGTCGG 
GACTACCCAA 
ATCAGACAAA 
CTGCCTTTGC 
GTCGTCTTTA 
TGCCATGAAA 
TCCTGGCATA 
GCTGGTAAGA 
CATCAAGAAC 
CTCGTAAGTA 
CCTATCGTTG 
GATAACGTAT 



AACATCTGTG 
GATGAATTGC 
CATTGCCCAG 
TTAACCAAGC 
ATCACAGACT 
TACTATCGTT 
TTTATAACTG 
CTGCAACAAT 
ACTCAACCGC 
GTATTCACTA 
AAATATGGAG 
AAATGGTGGA 
GTGATGGGAG 
GAAACAACTG 
CGGAAGTGGT 
CAGGTACTTC 
ACTGGCTATG 
TTCTATGGCT 
GAGATGGTTT 
CTATCAAAAG 



GGATATTTAC 
AAGTCGCTTC 
CTAGGAGCAC 
AGTAGAAACA 
ATGCTCCTGC 
CACGATGAGC 
GGATAGGGGC 
CGCGAAACGT 
GCCAAGACTT 
CTCAAATGCC 
CAAGTAGTGA 
ACTTACTATA 
TGAAAAAGAG 
CTTACATGAT 
CGTGGAGCCT 
TAACTACACA 
TAGCTCCAGA 
GTATGGACAG 
CCTAGTTGCA 
ATACTCATCC 



AATACAGACG 50 

TACCATTGTT 100 

GCCATCAGTC 150 

AACCGCGACT 2 00 

CTTGGAGTAC 2 50 

CCTATAACTA 3 00 

TACTTTGGCA 3 50 

CCCAGCCGTG 4 00 

TCCTAAATGG 4 50 

ATTTCAAGTA 500 

AAAGATGGCT 550 

AACCAATGTA 600 

TTCTCTAATG 650 

GACCGAAATG 700 

ATCTCCCATG 750 

GATGATGAAA 8 00 

TGAAATGTTT 850 

GTTATTCGAA 900 

GCTAAAGTTT 950 

AGAAGACTGG 10 00 



566 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

ACGATGCCAG ACGGACTTTT 
AGCTCGTTCT ACGTGGAACT 
AAAGTTCAAG CTCATCATCA 
ACTCCAAGCA CAAATAATAG 
5 ACAATCAAAT ACAACCCCTG 



CAGAAACGGG GAATTTGTAT 
CACCTGCTCC ACAACAACCC 
GATAGTTCAA CTTCACAGTC 
TACGACTACC AATCCTAACA 
ATCAACAAA 



PCT/CA00/01150 
TCAAAAATGG 10 50 

CCATCAACTG 1100 
TAACTCAACC 1150 
ATAATACGCA 12 0 0 

1229 



2) INFORMATION FOR SEQ ID NO: 1018 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1225 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE; 
2 0 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1018 

25 ATGTAGACCA AGAGGCTCAA AAACGTCTGT GGGATATCTA CAACTCCGAT 50 

CAATACGTCT CTTACCCTGA CGATGATTTG CAAGTCGCAT CTACGGTCGT 100 

AGATGTTTCA AATGGTAAAG TCATCGCCCA ACTTGGAGCT CGTCACCAAG 150 

CAAGTAACGT TTCATTTGGT ACCAACCAAG CTGTGGAAAC CAACCGTGAC 200 

TGGGGATCAA GCATGAAACC AATCACTGAT TATGCCCCAG CCTTAGAATA 2 50 

30 TGGTGTATAT GATTCCACTG CAACTATGGT TAATGATATT CCTTATAACT 300 

ATCCGGGCAC AAGCACACCT GTCTACAACT GGGATCGAGC ATATTTTGGT 3 50 

AATATTAGCC TGCAATATGC CCTTCAACAA TCTCGTAACG TGCCTGCCGT 4 00 

TGAAACACTA AACAAGGTTG GTTTAGATAG AGCCAAAACT TTCCTAAATG 450 

GTTTGGGAAT TGACTATCCA AGTATTCACT ACTCAAATGC TATTTCAAGT 50 0 

35 AATACAACTG AATCTAGTAA ACAGTACGGG GCAAGCAGTG AGAAAATGGC 550 

TGCGGCTTAC GCTGCATTCG CTAATGGCGG TATTTACCAC AAACCAATGT 600 

ACATCAATAA AGTTGTCTTT AGTGATGGTA GCGAAAAAGA ATTTTCTGAT 650 

GCTGGTACAC GAGCTATGAA AGAGACTACT GCCTATATGA TGACTGAAAT 700 

GATGAAAACT GTTTTAACTT ACGGAACAGG ACGTGGAGCC TACCTACCAT 75 0 

4 0 GGCTTCCACA AGCAGGTAAG ACAGGTACTT CTAACTATAC TGACGACGAA 800 

ATTGAAAAGT ATATCAAGAA CACTGGCTAC GTAGCTCCAG ATGAAATGTT 850 

TGTGGGTTAT ACTCGTAAGT ATTCTATGGC TGTATGGACT GGATACTCAA 900 

ATCGTTTAAC TCCAATCATT GGAGATGGTT TCCTAGTTGC TGCCAAAGTT 950 

TATCGCTCAA TGATATCGTA TCTATCAGAA GATGACCATC CTGGAGATTG 1000 

45 GACAATGCCT GAGGGAGTAT ACAGAAGTGG AGAATTCGTA TTTAAAAATG 1050 

GTGCTCGTTC TACGTGGAGC TCCCCTGCTC CACAACAACC CCCATCAACT 1100 

GAAAGTTCAA GCTCATCATC AGATAGTTCA ACTTCACAGT CTAGCTTAAC 1150 

CACTCCAAGC ACAAATAATA GTACGACTAC CAATCCTAAC AACAATACGC 12 00 

AACAATCAAA TACAACCCCT GATCA 122 5 

50 

2) INFORMATION FOR SEQ ID NO: 1019 

55 - (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Streptococcus pneumoniae 
5 (B) STRAIN: StrR-01 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1019 

GCCTCTATTT CAAAGGAGAT GCCTGGCATT AGTATTTCTA CTTCTTGGGA 50 

10 TCGAAAGGTT TTGGAAACTT CCCTTTCTTC TATAGTAGGG AGTGT AT C C A 100 

GTGAAAAAGC TGGTCTCCCA GCGGAAGAAG CAGAATCCTA TCTTAAAAAA 150 

GGCTATTCTC TAAATGACCG TGTTGGAACC TCCTATTTGG AAAAGCAATA 2 00 

TGAAGAGACC TTACAAGGAA AACGCTCGGT AAAAGAAATC CATCTGGATA 250 

AATATGGCAA TATGGAAAGC GTGGACACAA TTGAGGAAGG TAGTAAGGGA 3 00 

15 AACAATATCA AACTGACCAT TGATTTGGCC TTCCAAGATA GCGTGGATGC 350 

TTTGCTGAAA AGTTATTTCA ATTCCGAGCT AGGAAATGGT GGAGCTAAAT 4 00 

ATTCTGAAGG TGTCTATGCA GTCGCCCTTA ACCCAAAAAC AGGTGCTGTT 450 

TTATCCATGT CAGGGATCAA ACATGACCTG AAAACGGGAG AGTTGACTCC 500 

TGATTCCTTG GGAACGGTAA CCAATGTCTT TGTCCCAGGT TCGGTTGTCA 550 

2 0 AGGCTGCGAC CATCAGCTCA GGTTGGGAAA ATGGTGTTTT ATCAGGAAAC 600 

CAAACCTTAA CAGATCAGCC TATTGTTTTC CAAGGTTCAG CTCCAATTTA 650 

TTCTTGGTAT AAATTGGCAT ATGGATCTTT TCCTATTACA GCTGTGGAAG 700 

CCTTGGAGTA TTCATCCAAT GCTTACATGG TTCAAACCGC TCTTGGAATC 750 

ATGGGCCAGA CCTATCAACC AAATATGTTT GTTGGAACCA GCAATTTGGA 800 

25 AACAGCTATG GGAAAACTTC GTGCGACCTT TGGCGAATAT GGCTTGGGGG 850 

CTGCGACCGG AATTGACCTA CCAGATGAAT CTACTGGATT TGTTCCCAAA 900 

GAGTATAGCT TTGCTAATTA CATCACCAAT TCCTTTGGGC AGTTTGATAA 950 

CTATACGCCC ATGCAGTTGG CTCAGTATGT AGCAACTATT GCAAATAATG 1000 

GTGTTCGTGT GGCTCCTCGT ATTGTTGAAG GCATTTATGG TAATAATGAT 1050 

3 0 AAGGGAGGAC TGGGTGACTT GATTCAGCAA CTGCAACCGA CAGAGATGAA 1100 

TAAGGTCAAT ATATCCGACT CCGATATGAG CATCTTGCAC CAAGGTTTTT 1150 

ATCAGGTTGC CCATGGTACT AGTGGATTGA CAACTGGACG TGCCTTTTCA 12 00 

AATGGTGCCT TGGTATCCAT TAGCGGAAAA ACAGGTACAG CCGAAAGCTA 12 50 

TGTGGCAGAT GGTCAGCAAG CAACCAATAC CAATGCGGTG GCCTATGCCC 13 00 

35 CATCTGATAA TCCCCAAATC GCTGTCGCAG TGGTCTTTCC TCATAATACC 13 50 

AATCTAACAA ATGGTGTAGG ACCTTCCATT GCGCGTGACA TTATCAATCT 1400 

GTATCAAAAA TACCATCCAA TGAACTAGAA AGGAAATTA 143 9 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1020 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1441 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-02 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1020 

GCCTCTATTT CAAAGGAGAT GCCTGGCATT AGTATTTCTA CTTCTTGGGA 50 

TCGAAAGGTT TTGGAAACTT CCCTTTCTTC TATAGTAGGG AGTGTATCCA 100 

GTGAAAAAGC TGGTCTCCCA GCGGAAGAAG CAGAATCCTA TCTTAAAAAA 150 

60 GGCTATTCTC TAAATGACCG TGTTGGAACC TCCTATTTGG AAAAGCAATA 200 
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SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TGAAGAGACC 
AATATGGCAA 
AACAATATCA 
TTTGCTGAAA 
5 ATTCTGAAGG 
TTATCCATGT 
TGATTCCTTG 
AGGCTGCGAC 
CAAACCTTAA 

10 TTCTTGGTAT 
CCTTGGAGTA 
ATGGGCCAGA 
AACAGCTATG 
CTGCGACCGG 

15 GAGTATAGCT 
CTATACGCCC 
GTGTTCGTGT 
AAGGGAGGAC 
TAAGGTCAAT 

2 0 ATCAGGTTGC 
AATGGTGCCT 
TGTGGCAGAT 
CATCTGATAA 
AATCTAACAA 

25 GTATCAAAAA 



TTACAAGGAA 
TATGGAAAGC 
AACTGACCAT 
AGTTATTTCA 
TGTCTATGCA 
CAGGGATCAA 
GGAACGGTAA 
CATCAGCTCA 
CAGATCAGCC 
AAATTGGCAT 
TTCATCCAAT 
CCTATCAACC 
GGAAAACTTC 
AATTGACCTA 
TTGCTAATTA 
ATGCAGTTGG 
GGCTCCTCGT 
TGGGTGACTT 
ATATCCGACT 
CCATGGTACT 
TGGTATCCAT 
GGTCAGCAAG 
TCCCCAAATC 
ATGGTGTAGG 
TACCATCCAA 



AACGCTCGGT 
GTGGACACAA 
TGATTTGGCC 
ATTCCGAGCT 
GTCGCCCTTA 
ACATGACCTG 
CCAATGTCTT 
GGTTGGGAAA 
TATTGTTTTC 
ATGGATCTTT 
GCTTACATGG 
AAATATGTTT 
GTGCGACCTT 
CCAGATGAAT 
CATCACCAAT 
CTCAGTATGT 
ATTGTTGAAG 
GATTCAGCAA 
CCGATATGAG 
AGTGGATTGA 
TAGCGGAAAA 
CAACCAATAC 
GCTGTCGCAG 
ACCTTCCATT 
TGAACTAGAA 



AAAAGAAATC 
TTGAGGAAGG 
TTCCAAGATA 
AGGAAATGGT 
ACCCAAAAAC 
AAAACGGGAG 
TGTCCCAGGT 
ATGGTGTTTT 
CAAGGTTCAG 
TCCTATTACA 
TTCAAACCGC 
GTTGGAACCA 
TGGCGAATAT 
CTACTGGATT 
TCCTTTGGGC 
AGCAACTATT 
GCATTTATGG 
CTGCAACCGA 
CATCTTGCAC 
CAACTGGACG 
ACAGGTACAG 
CAATGCGGTG 
TGGTCTTTCC 
GCGCGTGACA 
AGGAAATTAT 



PCT/CA00/01150 

CATCTGGATA 2 50 

TAGTAAGGGA 3 00 

GCGTGGATGC 3 50 

GGAGCTAAAT 4 00 

AGGTGCTGTT 4 50 

AGTTGACTCC 500 

TCGGTTGTCA 550 

ATCAGGAAAC 6 00 

CTCCAATTTA 650 

GCTGTGGAAG 700 

TCTTGGAATC 750 

GCAATTTGGA 800 

GGCTTGGGGG 850 

TGTTCCCAAA 900 

AGTTTGATAA 950 

GCAAATAATG 1000 

TAATAATGAT 1050 

CAGAGATGAA 1100 

CAAGGTTTTT 1150 

TGCCTTTTCA 12 00 

CCGAAAGCTA 12 50 

GCCTATGCCC 13 0 0 

TCATAATACC 13 50 

TTATCAATCT 14 00 

G 1441 



30 



2) INFORMATION FOR SEQ ID NO: 1021 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 96 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
4 0 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-03 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1021 

4 5 TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

GGGATCGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAT CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCCTAT TTGGAAAAGC 2 00 

AATATGAAGA GAC CTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

50 GATAAATATG GCAATATGGA AAGCGTGGAC ACAATTGAGG AAGGTAGTAA 3 00 

GGGAAACAAT ATCAAACTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 3 50 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCT 4 00 

AAATATTCTG AAGGTGTCTA TGCAGTCGCC CTTAACCCAA AAACAGGTGC 450 

TGTTTTATCC ATGTCAGGGA TCAAACATGA CCTGAAAACG GGAGAGTTGA 500 

55 CTCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGTTCGGTT 550 

GTCAAGGCTG CGACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

GAAGCCTTGG AGTATTCATC CAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

60 AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 
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TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 

GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

CAAAGAGTAT AGCTTTGCTA ATTACATCAC CAATTCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCCATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

5 AATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGCATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCCCATGG TACTAGTGGA TTGACAACTG GACGTGCCTT 12 00 

TTCAAATGGT GCCTTGGTAT CCATTAGCGG AAAAACAGGT ACAGCCGAAA 12 50 

10 GCTATGTGGC AGATGGTCAG CAAGCAACCA ATACCAATGC GGTGGCCTAT 13 0 0 

GCCCCATCTG ATAATCCCCA AATCGCTGTC GCAGTGGTCT TTCCTCATAA 1350 

TACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GACATT 13 96 



15 



25 



2) INFORMATION FOR SEQ ID NO: 1022 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 142 8 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-04 

3 0 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1022 

AAAGGAGATG CCTGGCATTA GTATTTCTAC TTCTTGGGAT CGAAAGGTTT 50 
TGGAAACTTC CCTTTCTTCT ATAGTAGGGA GTGTATCCAG TGAAAAAGCT 100 
GGTCTCCCAG CGGAAGAAGC AGAATCCTAT CTTAAAAAAG GCTATTCTCT 150 
35 AAATGACCGT GTTGGAACCT CCTATTTGGA AAAGCAATAT GAAGAGACCT 2 00 

TACAAGGAAA ACGCTCGGTA AAAGAAATCC ATCTGGATAA ATATGGCAAT 2 50 

ATGGAAAGCG TGGACACAAT TGAGGAAGGT AGTAAGGGAA ACAATATCAA 3 00 

ACTGACCATT GATTTGGCCT TCCAAGATAG CGTGGATGCT TTGCTGAAAA 3 50 

GTTATTTCAA TTCCGAGCTA GGAAATGGTG GAGCTAAATA TTCTGAAGGT 4 00 

4 0 GTCTATGCAG TCGCCCTTAA CCCAAAAACA GGTGCTGTTT TATCCATGTC 4 50 

AGGGATCAAA CATGACCTGA AAACGGGAGA GTTGACTCCT GATTCCTTGG 500 
GAACGGTAAC CAATGTCTTT GTCCCAGGTT CGGTTGTCAA GGCTGCGACC 550 
ATCAGCTCAG GTTGGGAAAA TGGTGTTTTA TCAGGAAACC AAACCTTAAC 60 0 

AGATCAGCCT ATTGTTTTCC AAGGTTCAGC TCCAATTTAT TCTTGGTATA 650 

45 AATTGGCATA TGGATCTTTT CCTATTACAG CTGTGGAAGC CTTGGAGTAT 700 
TCATCCAATG CTTACATGGT TCAAACCGCT CTTGGAATCA TGGGCCAGAC 750 
CTATCAACCA AATATGTTTG TTGGAACCAG CAATTTGGAA ACAGCTATGG 800 
GAAAACTTCG TGCGACCTTT GGCGAATATG GCTTGGGGGC TGCGACCGGA 850 
ATTGACCTAC CAGATGAATC TACTGGATTT GTTCCCAAAG AGTATAGCTT 900 

50 TGCTAATTAC ATCACCAATT CCTTTGGGCA GTTTGATAAC TATACGCCCA 950 

TGCAGTTGGC TCAGTATGTA GCAACTATTG CAAATAATGG TGTTCGTGTG 1000 

GCTCCTCGTA TTGTTGAAGG CATTTATGGT AATAATGATA AGGGAGGACT 1050 

GGGTGACTTG ATTCAGCAAC TGCAACCGAC AGAGATGAAT AAGGTCAATA 1100 

TATCCGACTC CGATATGAGC ATCTTGCACC AAGGTTTTTA TCAGGTTGCC 1150 

55 CATGGTACTA GTGGATTGAC AACTGGACGT GCCTTTTCAA ATGGTGCCTT 12 00 

GGTATCCATT AGCGGAAAAA CAGGTACAGC CGAAAGCTAT GTGGCAGATG 1250 

GTCAGCAAGC AACCAATACC AATGCGGTGG CCTATGCCCC ATCTGATAAT 13 00 

CCCCAAATCG CTGTCGCAGT GGTCTTTCCT CATAATACCA ATCTAACAAA 13 50 

TGGTGTAGGA CCTTCCATTG CGCGTGACAT TATCAATCTG TATCAAAAAT 14 00 

60 ACCATCCAAT GAACTAGAAA GGAAATTA 1428 
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PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 1023 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1442 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1023 

TGCCTCTATT TCAAAGGAGA TGCCTGGCAT TAGTATTTCT ACTTCTTGGG 50 
ATCGAAAGGT TTTGGAAACT TCCCTTTCTT CTATAGTAGG GAGTGT AT CC 100 
AGTGAAAAAG CTGGTCTCCC AGCGGAAGAA GCAGAATCCT ATCTTAAAAA 150 
AGGCTATTCT CTAAATGACC GTGTTGGAAC CTCCTATTTG GAAAAGCAAT 20 0 

ATGAAGAGAC CTTACAAGGA AAACGCTCGG TAAAAGAAAT CCATCTGGAT 250 
AAATATGGCA ATATGGAAAG CGTGGACACA ATTGAGGAAG GTAGTAAGGG 3 00 

AAACAATATC AAACTGACCA TTGATTTGGC CTTCCAAGAT AGCGTGGATG 350 
CTTTGCTGAA AAGTTATTTC AATTCCGAGC TAGGAAATGG TGGAGCTAAA 4 00 

TATTCTGAAG GTGTCTATGC AGTCGCCCTT AACCCAAAAA CAGGTGCTGT 4 50 

TTTATCCATG TCAGGGATCA AACATGACCT GAAAACGGGA GAGTTGACTC 5 00 

CTGATTCCTT GGGAACGGTA ACCAATGTCT TTGTCCCAGG TTCGGTTGTC 550 
AAGGCTGCGA CCATCAGCTC AGGTTGGGAA AATGGTGTTT TATCAGGAAA 600 
CCAAACCTTA ACAGATCAGC CTATTGTTTT CCAAGGTTCA GCTCCAATTT 650 
ATTCTTGGTA TAAATTGGCA TATGGATCTT TTCCTATTAC AGCTGTGGAA 70 0 

GCCTTGGAGT ATTCATCCAA TGCTTACATG GTTCAAACCG CTCTTGGAAT 750 
CATGGGCCAG ACCTATCAAC CAAATATGTT TGTTGGAACC AGCAATTTGG 8 00 

AAACAGCTAT GGGAAAACTT CGTGCGACCT TTGGCGAATA TGGCTTGGGG 850 
GCTGCGACCG GAATTGACCT ACCAGATGAA TCTACTGGAT TTGTTCCCAA 900 
AGAGTATAGC TTTGCTAATT ACATCACCAA TTCCTTTGGG CAGTTTGATA 950 

ACTATACGCC CATGCAGTTG GCTCAGTATG TAGCAACTAT TGCAAATAAT 10 00 

GGTGTTCGTG TGGCTCCTCG TATTGTTGAA GGCATTTATG GTAATAATGA 1050 

TAAGGGAGGA CTGGGTGACT TGATTCAGCA ACTGCAACCG ACAGAGATGA 1100 

ATAAGGTCAA TATATCCGAC TCCGATATGA GCATCTTGCA CCAAGGTTTT 1150 

TATCAGGTTG CCCATGGTAC TAGTGGATTG ACAACTGGAC GTGCCTTTTC 12 00 

AAATGGTGCC TTGGTATCCA TTAGCGGAAA AACAGGTACA GCCGAAAGCT 1250 

ATGTGGCAGA TGGTCAGCAA GCAACCAATA CCAATGCGGT GGCCTATGCC 13 00 

CCATCTGATA ATCCCCAAAT CGCTGTCGCA GTGGTCTTTC CTCATAATAC 13 50 

CAATCTAACA AATGGTGTAG GACCTTCCAT TGCGCGTGAC ATTATCAATC 14 00 

TGTATCAAAA ATACCATCCA ATGAACTAGA AAGGAAATTA TG 1442 



2) INFORMATION FOR SEQ ID NO: 1024 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1445 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1024 

TTGCCTCTAT TTCAAAGGAG ATGCCTGGCA TTAGTATTTC TACTTCTTGG 50 
GATCGAAAGG TTTTGGAAAC TTCCCTTTCT TCTATAGTAG GGAGTGTATC 100 

10 CAGTGAAAAA GCTGGTCTCC CAGCGGAAGA AGCAGAATCC TATCTTAAAA 150 
AAGGCTATTC TCTAAATGAC CGTGTTGGAA CCTCCTATTT GGAAAAGCAA 2 00 

TATGAAGAGA CCTTACAAGG AAAACGCTCG GTAAAAGAAA TCCATCTGGA 2 50 

TAAATATGGC AATATGGAAA GCGTGGACAC AATTGAGGAA GGTAGTAAGG 3 00 

GAAACAATAT CAAACTGACC ATTGATTTGG CCTTCCAAGA TAGCGTGGAT 3 50 

15 GCTTTGCTGA AAAGTTATTT CAATTCCGAG CTAGGAAATG GTGGAGCTAA 4 00 

ATATTC TGAA GGTGTCTATG CAGTCGCCCT TAACCCAAAA ACAGGTGCTG 4 50 

TTTTATCCAT GTCAGGGATC AAACATGACC TGAAAACGGG AGAGTTGACT 500 
CCTGATTCCT TGGGAACGGT AACCAATGTC TTTGTCCCAG GTTCGGTTGT 550 
CAAGGCTGCG ACCATCAGCT CAGGTTGGGA AAATGGTGTT TTATCAGGAA 600 

20 ACCAAACCTT AACAGATCAG CCTATTGTTT TCCAAGGTTC AGCTCCAATT 650 
TATTCTTGGT ATAAATTGGC ATATGGATCT TTTCCTATTA CAGCTGTGGA 7 00 

AGCCTTGGAG TATTCATCCA ATGCTTACAT GGTTCAAACC GCTCTTGGAA 750 
TCATGGGCCA GACCTATCAA CCAAATATGT TTGTTGGAAC CAGCAATTTG 800 
GAAACAGCTA TGGGAAAACT TCGTGCGACC TTTGGCGAAT ATGGCTTGGG 850 

2 5 GGCTGCGACC GGAATTGACC TACCAGATGA ATCTACTGGA TTTGTTCCCA 900 
AAGAGTATAG CTTTGCTAAT TACATCACCA ATTCCTTTGG GCAGTTTGAT 950 

AACTATACAC CCATGCAGTT GGCTCAGTAT GTAGCAACTA TTGCAAATAA 10 00 

TGGTGTTCGT GTGGCTCCTC GTATTGTTGA AGGCATTTAT GGTAATAATG 10 50 

ATAAGGGAGG ACTGGGTGAC TTGATTCAGC AACTGCAACC GACAGAGATG 1100 

30 AATAAGGTCA ATATATCCGA CTCCGATATG AGCATCTTGC ACCAAGGTTT 1150 

TTATCAGGTT GCCCATGGTA CTAGTGGATT GACAACTGGA CGTGCCTTTT 12 00 

CAAATGGTGC CTTGGTATCC ATTAGCGGAA AAACAGGTAC AGCCGAAAGC 12 50 

TATGTGGCAG ATGGTCAGCA AGCAACCAAT ACCAATGCGG TGGCCTATGC 13 00 

CCCATCTGAT AATCCCCAAA TCGCTGTCGC AGTGGTCTTT CCTCATAATA 13 50 

35 CCAATCTAAC AAATGGTGTA GGACCTTCCA TTGCGCGTGA CATTATCAAT 14 00 

CTGTATCAAA AATACCATCC AATGAACTAG AAAGGAAATT ATGCT 1445 



40 2) INFORMATION FOR SEQ ID NO: 1025 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1441 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-07 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1025 



TTGCCTCTAT TTCAAAGGAG ATGCCTGGCA TTAGTATTTC TACTTCTTGG 50 

GATCGAAAGA TTTTGGAAAC TTCCCTTTCT TCTATAGTAG GGAGTGTATC 100 

CAGTGAAAAA GCTGGTCTCC CAGCGGAAGA AGCAGAATCC TATCTTAAAA 150 

AAGGCTATTC TCTAAATGAC CGTGTTGGAA CCTCGTATTT GGAAAAGCAA 200 

60 TATGAAGAGA CCTTACAAGG AAAACGCTCG GTAAAAGAAA TCCATCTGGA 2 50 
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TAAATATGGC 
GAAACAATAT 
GCTTTGCTGA 
GTATTCTGAG 
TTTTGTCTAT 
CCTGATTCCT 
CAAGGCTGCG 
ACCAAACCTT 
TATTCTTGGT 
AGCCTTGGAG 
TCATGGGCCA 
GAAACAGCTA 
GGCTGCGACC 
AAGACTATAG 
AACTATACGC 
TGGTGTTCGT 
ATAAGGGAGG 
AATAAGGTCA 
TTACCAAGTA 
CAGATGGCGC 
TATGTAGCTG 
TCCAACAGAA 
CCAATTTAAC 
TTATATAACC 



AATATGGAAA 
CAAACTGACC 
AAAGTTATTT 
GGTGTGTATG 
GTCAGGACTC 
TGGGAACGGT 
ACCATCAGCT 
AACAGATCAG 
ATAAATTGGC 
TATTCATCCA 
GACCTATCAA 
TGGGAAAACT 
GGAATTGACC 
CTTTGCTAAT 
CCATGCAGTT 
GTGGCTCCTC 
ACTGGGTGAC 
ATATATCCGA 
TCGCATGGAA 
CACTGTTTCT 
GTGGTCAAGA 
AATCCTCAAA 
CAAAAATGTT 
AACACCATCC 



GCGTGGACAC 
ATTGATTTGG 
CAATTCCGAG 
CAGTCGCCCT 
AAACATGACC 
AACCAATGTC 
CTGGCTGGGA 
CCTATTGTTT 
ATATGGATCT 
ATGCTTACAT 
CCAAATATGT 
TCGTGCGACC 
TACCAGATGA 
TACATCACCA 
GGCTCAGTAT 
GTATTGTTGA 
TTGATTCAGC 
CTCCGATATG 
CTAGTCCCCT 
ATCAGTGGTA 
AGCTAATAAT 
TTGCAGTTGC 
GGGCCAGCAA 
AATGAATTAG 



AATTGAGGAA 
CCTTCCAAGA 
CTAGGAAATG 
TAACCCCAAA 
TGAAAACGGG 
TTTGTCCCAG 
AAATGGTGTT 
TCCAAGGTTC 
TTTCCTATTA 
GGTTCAAACC 
TTGTTGGAAC 
TTTGGCGAAT 
ATCTACTGGA 
ATGCCTTTGG 
GTAGCAACTA 
AGGCATTTAT 
AACTGCAACC 
AGTATCTTGC 
TACGACAGGA 
AGACCGGTAC 
ACCAATGCCG 
AGTAGTCTTT 
TTGCTCGCGA 
AAAGGAAGCC 



PCT/CAOO/01150 

GGTAGTAAGG 3 00 

TAGCGTGGAT 3 50 

GTGGAGCCAA 4 00 

ACAGGTGCTG 450 

AGAGTTGACT 500 

GTTCGGTTGT 550 

TTATCAGGAA 600 

AGCTCCAATT 650 

CAGCTGTGGA 700 

GCTCTTGGAA 750 

CAGCAATTTG 8 00 

ATGGCTTGGG 850 

TTTGTTCCCA 900 

GCAGTTTGAT 950 

TTGCAAATGA 10 00 

GGTAATAATG 1050 

GACAGAGATG 1100 

ACCAAGGATT 1150 

CGGGCGTTTT 12 00 

AGGTGAAAGC 12 50 

TGGCCTATGC 13 00 

CCTCATAATA 13 50 

CATTATCAAT 14 00 

A 1441 



2) INFORMATION FOR SEQ ID NO: 1026 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1443 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
40 (B) STRAIN: StrR-08 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1026 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

45 GGGATCGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAT CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCCTAT TTGGAAAAGC 2 00 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 

GATAAATATG GCAATATGGA AAGCGTGGAC ACAATTGAGG AAGGTAGTAA 3 00 

50 GGGAAACAAT ATCAAACTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 3 50 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCT 4 00 

AAATATTCTG AAGGTGTCTA TGCAGTCGCC CTTAACCCAA AAACAGGTGC 4 50 

TGTTTTATCC ATGTCAGGGA TCAAACATGA CCTGAAAACG GGAGAGTTGA 500 

CTCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGTTCGGTT 550 

55 GTCAAGGCTG CGACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 7 00 

GAAGCCTTGG AGTATTCATC CAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

60 TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 
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GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 90 0 

CAAAGAGTAT AGCTTTGCTA ATTACATCAC CAATTCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCCATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

AATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

5 TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 110 0 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGCATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCCCATGG TACTAGTGGA TTGACAACTG GACGTGCCTT 12 0 0 

TTCAAATGGT GCCTTGGTAT CCATTAGCGG AAAAACAGGT ACAGCCGAAA 1250 

GCTATGTGGC AGATGGTCAG CAAGCAACCA ATACCAATGC GGTGGCCTAT 13 00 

10 GCCCCATCTG ATAATCCCCA AATCGCTGTC GCAGTGGTCT TTCCTCATAA 1350 

TACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GACATTATCA 14 00 

ATCTGTATCA AAAATACCAT CCAATGAACT AGAAAGGAAA TTA 1443 



15 



2) INFORMATION FOR SEQ ID NO: 1027 



(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 1443 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-09 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1027 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

GGGATAGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT TGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAG CCTATCTTAA 150 

35 AAAAGGCTAT TCTCTAAATG ATCGTGTAGG AACCTCCTAT TTGGAAAAGC 2 00 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 2 50 

GATAAATATG GCAATATGGA AAGCGTGGAT ACAATTGAGG AAGGTAGTAA 3 00 

GGGAAACAAT ATCAAACTGA CCATTGATTT GGCTTTCCAA GATAGCGTGG 3 50 

ATGCTTTACT GAAAAGTTAT TTCAATTCCG AGCTAGAAAA TGGTGGAGCC 4 00 

4 0 AAGTATTCTG AAGGTGTCTA TGCAGTCGCC CTTAACCCAA AAACAGGTGC 4 50 

TGTTTTGTCT ATGTCAGGGA TTAAACATGA CTTGAAAACG GGAGATTTAA 500 

CACCTGATTC CTTGGGAACA GTAACCAATG TCTTTGTCCC GGGTTCTGTT 550 

GTCAAGGCGG CGACCATCAG CTCTGGTTGG GAGAATGGAG TCTTATCAGG 600 

AAATCAGACC TTGACAGACC AACCGATTGT CTTCCAAGGT TCAGCTCCGA 650 

45 TTAATTCTTG GTACACTCAG GCTTACGATT CATTTCCGAT TACAGCGGTG 700 

GAAGCCTTGG AGTATTCTTC TAATGCCTAT ATGGTCCAAA CAGCTCTAGG 750 

TCTTATGGGG CAGACCTACC AACCCAATAT GTTTGTCGGC ACCAGCAATC 800 

TAGAGTCTGC TATGGGGAAA TTGCGTTCAA CCTTTGGTGA ATATGGCTTG 850 

GGCTCTGCGA CTGGGATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

50 CAAAGAGTAT AGCTTTGCTA ATTACATTAC TAATGCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCGATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

GATGGTGTTC GTGTGGCTCC TCGTATTGTG GAAGGCATTT ATGGCAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGTATCTT GCACCAAGGT 1150 

55 TTTTATCAGG TTGCTCATGG GACTAG CGGA TTGACAACAG GTCGTGCCTT 12 00 

TTCCAATGGT GCAGCTGTAT CCATTAGTGG AAAAACAGGT ACCGCCGAAA 12 50 

GTTATGTAGC AGGTGGCCAA GAAGCCAACA ATACTAATGC TGTAGCCTAT 13 00 

GCACCATCAG ATAATCCTCA AATAGCTGTT GCTGTTGTCT TCCCTCATAA 13 50 

CACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GATATTATCA 14 00 

60 ATCTGTATCA AAAATACCAT CCAATGAACT AGAAAGGAAT TTA 1443 
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2) INFORMATION FOR SEQ ID NO: 1028 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 102 8 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACCTCTT 50 

GGGATAGAAA GGTTTTGGAA ACTTCTCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAG CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG GACTTCTTAC CTGGAAAAAC 2 00 

AATACGAGGA AACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 2 50 

GATAAATATG GCAATATGGA AAGCGTGGAT ACAATTGAGG AAGGTAGTAA 3 00 

GGGAAACAAT ATCAAACTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 3 50 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCT 4 00 

AAGTATTCTG AAGGCGTGTA TGCAGTCGCC CTTAACCCCA AAACAGGTGC 450 

TGTTTTGTCT ATGTCAGGAC TCAAACATGA CCTGAAAACG GGAGACTTGA 500 

CGCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGGTCAGTA 550 

GTTAAGGCCG CTACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 

GAAGCCTTGG AGTATTCATC TAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 

AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 

TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 

GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 

CAAAGAGTAT AGCTTTGCTA ATTACATTAC TAATGCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCGATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

GATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 110 0 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGCATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCCCATGG TACTAGTGGA TTGACAACTG GACGTGCCTT 12 00 

TTCAAATGGC GCCTTGGTAT CCATTAGCGG AAAAACAGGT ACAGCCGAAA 1250 

GCTATGTGGC AGATGGTCAG CAAGCAACCA ATACCAATGC GGTGGCCTAT 13 00 

GCCCCATCTG ATAATCCCCA AATCGCTGTT GCAGTGGTCT TTCCTCATAA 1350 

TACCAATCTA ACAAATGGTG TAGGACCTTC CATTGCGCGT GACATTATCA 14 00 

ATCTGTATCA AAAATACCAT CCAATGAACT AGAAAGGAAA TTATGC 1446 



2) INFORMATION FOR SEQ ID NO: 1029 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1423 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 9 

TTGCCTCTAT TTCAAAGGAG ATGCCTGGCA TTAGTATTTC TACTTCTTGG 50 

GATAGAAAGG TTTTGGAAAC TTCCCTTTCT TCTATAGTTG GGAGTGTATC 10 0 

10 CAGTGAAAAA GCTGGTCTCC CAGCGGAAGA AGCAGAAGCC TATCTTAAAA 150 

AAGGCTATTC TCTAAATGAC CGTGTAGGAA CCTCCTATTT GGAAAAGCAA 2 00 

TATGAAGAGA CCTTACAAGG AAAACGCTCG GTAAAAGAAA TCCATCTGGA 2 50 

TAAATATGGC AATATGGAAA GCGTGGATAC AATTGAGGAA GGTAGTAAGG 3 00 

GAAACAATAT CAAACTGACC ATTGATTTGG CCTTCCAAGA TAGCGTGGAT 3 50 

15 GCTTTACTGA AAAGTTATTT CAATTCTGAG CTAGAAAATG GTGGAGCCAA 4 00 

GTATTCTGAA GGTGTCTATG CAGTCGCCCT TAACCCAAAA ACAGGTGCGG 4 50 

TTTTGTCTAT GTCAGGGATT AAACATGACT TGAAAACGGG AGAGTTGACG 500 

CCTGATTCCT TGGGAACGGT AACCAATGTC TTTGTTCCAG GTTCGGTTGT 550 

CAAGGCGGCG ACCATCAGCT CAGGTTGGGA AAATGGAGTC TTGTCAGGAA 60 0 

2 0 ACCAGACCTT GACAGACCAG TCCATTGTCT TCCAAGGTTC AGCTCCCATC 650 

AATTCTTGGT ATACTCAGGC TTACGGTTCA TTCCCTATCA CAGCGGTCCA 700 

AGCTCTGGAG TATTCATCCA ATGCTTATAT GGTCCAAACA GCCTTAGGTC 750 

TTATGGGGCA GACCTATCAA CCCAATATGT TTGTCGGCAC CAGCAATCTA 800 

GAGTCTGCTA TGGGTAAATT GCGTTCAACC TTTGGCGAAT ATGGCTTGGG 850 

2 5 GGCTGCGACT GGGATTGATC TACCAGATGA ATCTACTGGA TTTGTTCCCA 900 

AAGACTATAA CTTTGCCAAT TTCATTACCA ATGCCTTTGG GCAGTTTGAT 950 

AACTATACCC CAATGCAATT GGCTCAGTAT GTAGCAACTA TTGCAAATGA 1000 

TGGTGTTCGT GTGGCTCCTC GTATTGTTGA AGGCATTTAT GGTAATAATG 1050 

ATAAGGGAGG ACTGGGTGAC TTGATTCAGC AACTGCAACC GACAGAGATG 1100 

3 0 AATAAGGTCA ATATATCCGA CTCCGATATG AGTGTCTTGC ACCAAGGTTT 1150 

TTATCAGGTT GCTCATGGGA CTAGTGGGTT GACAACTGGC CGTGCCTTTT 12 00 

CAAATGGTGC CTTGGTATCC ATTAGCGGAA AAACGGGTAC AGCCGAAAGC 12 50 

TATGTGGCAG ATGGTCAGGA AGCAACCAAT ACCAATGCGG TGGCCTATGC 13 00 

CCCATCTGAT AATCCCCAAA TCGCTGTCGC AGTGGTCTTT CCTCATAATA 13 50 

3 5 CCAATCTAAC AAATGGTGTA GGACCTTCCA TTGCGCGTGA CATTATCAAT 1400 

CTGTATCAAA AATACCATCC AAT 1423 



40 2) INFORMATION FOR SEQ ID NO: 1030 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1447 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-12 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 0 



TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 50 

GGGATCGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT AGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAT CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCCTAT TTGGAAAAGC 2 00 

60 AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 250 
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GATAAATATG 
GGGAAACAAT 
ATGCTTTGCT 
AAATATTCTG 
TGTTTTATCC 
CTCCTGATTC 
GTCAAGGCTG 
AAACCAAACC 
TTTATTCTTG 
GAAGCCTTGG 
AATCATGGGC 
TGGAAACAGC 
GGGGCTGCGA 
CAAAGAGTAT 
ATAACTATAC 
AATGGTGTTC 
TGATAAGGGA 
TGAATAAGGT 
TTTTATCAGG 
TTCAAATGGT 
GCTATGTGGC 
GCCCCATCTG 
TACCAATCTA 
ATCTGTATCA 



GCAATATGGA 
ATCAAACTGA 
GAAAAGTTAT 
AAGGTGTCTA 
ATGTCAGGGA 
CTTGGGAACG 
CGACCATCAG 
TTAACAGATC 
GTATAAATTG 
AGTATTCATC 
CAGACCTATC 
TATGGGAAAA 
CCGGAATTGA 
AGCTTTGCTA 
GCCCATGCAG 
GTGTGGCTCC 
GGACTGGGTG 
CAATATATCC 
TTGCCCATGG 
GCCTTGGTAT 
AGATGGTCAG 
ATAATCCCCA 
ACAAATGGTG 
AAAATACCAT 



AAGCGTGGAC 
CCATTGATTT 
TTCAATTCCG 
TGCAGTCGCC 
TCAAACATGA 
GTAACCAATG 
CTCAGGTTGG 
AGCCTATTGT 
GCATATGGAT 
CAATGCTTAC 
AACCAAATAT 
CTTCGTGCGA 
CCTACCAGAT 
ATTACATCAC 
TTGGCTCAGT 
TCGTATTGTT 
ACTTGATTCA 
GACTCCGATA 
TACTAGTGGA 
CCATTAGCGG 
CAAGCAACCA 
AATCGCTGTC 
TAGGACCTTC 
CCAATGAACT 



ACAATTGAGG 
GGCCTTCCAA 
AGCTAGGAAA 
CTTAACCCAA 
CCTGAAAACG 
TCTTTGTCCC 
GAAAATGGTG 
TTTCCAAGGT 
CTTTTCCTAT 
ATGGTTCAAA 
GTTTGTTGGA 
CCTTTGGCGA 
GAATCTACTG 
CAATTCCTTT 
ATGTAGCAAC 
GAAGGCATTT 
GCAACTGCAA 
TGAGCATCTT 
TTGACAACTG 
AAAAACAGGT 
ATACCAATGC 
GCAGTGGTCT 
CATTGCGCGT 
AGAAAGGAAA 



PCT/CA00/01150 

AAGGTAGTAA 3 00 

GATAGCGTGG 3 50 

TGGTGGAGCT 4 00 

AAACAGGTGC 450 

GGAGAGTTGA 500 

AGGTTCGGTT 550 

TTTTATCAGG 600 

TCAGCTCCAA 650 

TACAGCTGTG 700 

CCGCTCTTGG 750 

ACCAGCAATT 800 

ATATGGCTTG 850 

GATTTGTTCC 900 

GGGCAGTTTG 950 

TATTGCAAAT 1000 

ATGGTAATAA 1050 

CCGACAGAGA 1100 

GC AC CAAGGT 1150 

GACGTGCCTT 1200 

ACAGCCGAAA 1250 

GGTGGCCTAT 1300 

TTCCTCATAA 1350 

GACATTATCA 1400 

TTATGCT 1447 



2) INFORMATION FOR SEQ ID NO: 1031 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 6 bases 

(B) . TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
40 (B) STRAIN: StrR-13 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1031 

ATTGCCTCTA TTTCAAAGGA GATGCCTGGC ATTAGTATTT CTACCTCTTG 50 

45 GGATAGAAAG GTTTTGGAAA CTTCTCTTTC TTCTATAGTA GGGAGTGTAT 100 

CCAGTGAAAA AGCTGGTCTC CCAGCGGAAG AAGCAGAAGC CTATCTTAAA 150 

AAAGGCTATT CTCTAAATGA CCGTGTTGGG ACTTCTTACC TGGAAAAACA 200 

ATACGAGGAA ACCTTACAAG GAAAACGCTC GGTAAAAGAA ATCCATCTGG 250 

ATAAATATGG CAATATGGAA AGCGTGGATA CAATTGAGGA AGGTAGTAAG 300 

50 GGAAACAATA TCAAACTGAC CATTGATTTG GCCTTCCAAG ATAGCGTGGA 350 

TGCTTTGCTG AAAAGTTATT TCAATTCCGA GCTAGGAAAT GGTGGAGCTA 4 00 

AGTATTCTGA AGGCGTGTAT GCAGTCGCCC TTAACCCCAA AACAGGTGCT 450 

GTTTTGTCTA TGTCAGGACT CAAACATGAC CTGAAAACGG GAGACTTGAC 500 

GCCTGATTCC TTGGGAACGG TAACCAATGT CTTTGTCCCA GGGTCAGTAG 550 

55 TTAAGGCCGC TACCATCAGC TCAGGTTGGG AAAATGGTGT TTTATCAGGA 600 

AACCAAACCT TAACAGATCA GCCTATTGTT TTCCAAGGTT CAGCTCCAAT 650 

TTATTCTTGG TATAAATTGG CATATGGATC TTTTCCTATT ACAGCTGTGG 700 

AAGCCTTGGA GTATTCATCT AATGCTTACA TGGTTCAAAC CGCTCTTGGA 750 

ATCATGGGCC AGACCTATCA ACCAAATATG TTTGTTGGAA CCAGCAATTT 800 

60 GGAAACAGCT ATGGGAAAAC TTCGTGCGAC CTTTGGCGAA TATGGCTTGG 850 

577 
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GGGCTGCGAC CGGAATTGAC CTACCAGATG AATCTACTGG ATTTGTTCCC 900 

AAAGAGTATA GCTTTGCTAA TTACATTACT AATGCCTTTG GGCAGTTTGA 950 

TAACTATACG CCGATGCAGT TGGCTCAGTA TGTAGCAACT ATTGCAAATG 100 0 

ATGGTGTTCG TGTGGCTCCT CGTATTGTTG AAGGCATTTA TGGTAATAAT 1050 

5 GATAAGGGAG GACTGGGTGA CTTGATTCAG CAACTGCAAC CGACAGAGAT 1100 

GAATAAGGTC AATATATCCG ACTCCGATAT GAGCATCTTG CACCAAGGTT 1150 

TTTATCAGGT TGCCCATGGT ACTAGTGGAT TGACAACTGG ACGTGCCTTT 12 00 

TCAAATGGCG CCTTGGTATC CATTAGCGGA AAAACAGGTA CAGCCGAAAG 12 50 

CTATGTGGCA GATGGTCAGC AAGCAACCAA TACCAATGCG GTGGCCTATG 130 0 

10 CCCCATCTGA TAATCCCCAA ATCGCTGTTG CAGTGGTCTT TCCTCATAAT 1350 

ACCAATCTAA CAAATGGTGT AGGACCTTCC ATTGCGCGTG ACATTATCAA 14 0 0 

TCTGTATCAA AAATACCATC CAATGAACTA GAAAGGAAAT TATGCT 1446 



15 



2) INFORMATION FOR SEQ ID NO: 1032 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1446 bases 
2 0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-14 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1032 

ATTGCCTCTA TTTCAAAGGA GATGCCTGGC ATTAGTATTT CTACTTCTTG 50 
GGATAGAAAG GTTTTGGAAA CTTCCCTTTC TTCTATAGTT GGGAGTGTAT 100 
CCAGTGAAAA AGCTGGTCTC CCAGCGGAAG AAGCAGAAGC CTATCTTAAA 150 

35 AAAGGCTATT CTCTAAATGA TCGTGTTGGA ACCTCCTATT TGGAAAAGCA 2 00 

ATATGAAGAG ACCTTACAAG GGAAACGCTC GGTAAAAGAA ATCCATCTGG 250 
ATAAATATGG CAATATGGAA AGTGTGGATA CAATTGAGGA AGGTAGTAAG 3 00 

GGAAACAATA TCAAGCTGAC CATTGATTTG GCCTTCCAAG ATAGCGTGGA 3 50 

TGCTTTGCTG AAAAGTTATT TCAATTCCGA GCTAGGAAAT GGTGGAGCCA 4 00 

40 AGTATTCTGA GGGTGTGTAT GCAGTCGCCC TTAACCCCAA AACAGGTGCT 450 
GTTTTGTCTA TGTCAGGACT CAAACATGAC CTGAAAACGG GAGAGTTGAC 500 
TCCTGATTCC TTGGGAACGG TAACCAATGT CTTTGTCCCA GGTTCGGTTG 550 
TCAAGGCTGC GACCATCAGC TCTGGCTGGG AAAATGGAGT CTTATCAGGA 600 
AACCAGACCT TGACAGACCA GTCCATTGTC TTTCAAGGTT CAGCTCCCAT 650 

4 5 CAATTCTTGG TATACTCAGG CTTACGGTTC ATTCCCTATC ACAGCAGTCC 700 

AAGCTCTGGA GTATTCATCT AATGCCTATA TGGTCCAAAC AGCTTTAGGT 750 
CTTATGGGGC AGACCTACCA ACCTAATATG TTTGTCGGCA CCAGCAACCT 8 00 

AGAGTCTGCT ATGGGGAAAT TGCGTTCAAC CTTTGGTGAA TATGGTTTGG 850 
GTTCTGCGAC CGGGATTGAC CTACCAGATG AATCTACTGG ATTTGTTCCC 900 

5 0 AAAGACTATA GCTTTGCTAA TTACATCACC AATGCCTTTG GGCAGTTTGA 950 

TAACTATACG CCGATGCAGT TGGCTCAGTA TGTAGCAACT ATTGCAAATG 1000 

ATGGTGTTCG TGTGGCTCCT CGTATTGTTG AAGGCATTTA TGGAAATAAT 1050 

GATAAGGGAG GCCTAGGCGA CTTGATTCAG CAACTGCAAC CGACAGAGAT 1100 

GAATAAGGTC AATATATCCG ACTCTGATAT GAGTATTTTG CACCAAGGTT 1150 

55 TTTATCAGGT TGCTCATGGG ACTAGTGGAT TGACAACTGG ACGTGCCTTT 12 00 

TCAAATGGCG CAGCGGTATC CATTAGTGGA AAAACAGGTA CTGCCGAAAG 12 50 

TTATGTTGAG GGTGGTCAAG AAGCTAACAA TACTAATGCT GTGGCCTATG 13 00 

CACCATCAGA TAATCCTCAA ATCGCTGTAG CTGTTGTCTT CCCTCATAAC 13 50 

ACCAACCTTA CAAATGGTGT CGGACCTTCC ATTGCGCGCG ATATTATCAA 14 00 

60 CCTCTATAAC CAACATCATC CAATGAATTA GAAAGGAACA TATGCT 1446 
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2) INFORMATION FOR SEQ ID NO: 1033 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1446 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 3 

TATTGCCTCT ATTTCAAAGG AGATGCCTGG CATTAGTATT TCTACTTCTT 5 0 

GGGATAGAAA GGTTTTGGAA ACTTCCCTTT CTTCTATAGT TGGGAGTGTA 100 

TCCAGTGAAA AAGCTGGTCT CCCAGCGGAA GAAGCAGAAG CCTATCTTAA 150 

AAAAGGCTAT TCTCTAAATG ACCGTGTTGG AACCTCGTAT TTGGAAAAGC 2 00 

AATATGAAGA GACCTTACAA GGAAAACGCT CGGTAAAAGA AATCCATCTG 2 50 

GATAAATATG GCAATATGGA AAGCGTGGAT ACAATTGAGG AAGGTAGTAA 3 00 

GGGAAACAAT ATCAAGCTGA CCATTGATTT GGCCTTCCAA GATAGCGTGG 3 50 

ATGCTTTGCT GAAAAGTTAT TTCAATTCCG AGCTAGGAAA TGGTGGAGCC 400 

AAGTATTCTG AGGGTGTGTA TGCAGTCGCC CTTAACCCCA AAACAGGTGC 450 

TGTTTTGTCT ATGTCAGGAC TCAAACATGA CCTGAAAACG GGAGAGTTGA 500 

CTCCTGATTC CTTGGGAACG GTAACCAATG TCTTTGTCCC AGGTTCGGTT 550 

GTTAAGGCCG CTACCATCAG CTCAGGTTGG GAAAATGGTG TTTTATCAGG 600 

AAACCAAACC TTAACAGATC AGCCTATTGT TTTCCAAGGT TCAGCTCCAA 650 

TTTATTCTTG GTATAAATTG GCATATGGAT CTTTTCCTAT TACAGCTGTG 700 
GAAGCCTTGG AGTATTCATC CAATGCTTAC ATGGTTCAAA CCGCTCTTGG 750 
AATCATGGGC CAGACCTATC AACCAAATAT GTTTGTTGGA ACCAGCAATT 800 
TGGAAACAGC TATGGGAAAA CTTCGTGCGA CCTTTGGCGA ATATGGCTTG 850 
GGGGCTGCGA CCGGAATTGA CCTACCAGAT GAATCTACTG GATTTGTTCC 900 
CAAAGAGTAT AGCTTTGCTA ATTACATCAC CAATGCCTTT GGGCAGTTTG 950 

ATAACTATAC GCCCATGCAG TTGGCTCAGT ATGTAGCAAC TATTGCAAAT 1000 

GATGGTGTTC GTGTGGCTCC TCGTATTGTT GAAGGCATTT ATGGTAATAA 1050 

TGATAAGGGA GGACTGGGTG ACTTGATTCA GCAACTGCAA CCGACAGAGA 1100 

TGAATAAGGT CAATATATCC GACTCCGATA TGAGTATCTT GCACCAAGGT 1150 

TTTTATCAGG TTGCTCATGG GACTAGTGGA TTGACAACTG GACGTGCTTT 1200 

TTCAAATGGT GCCTTGGTAT CCATTAGTGG GAAAACAGGT ACTGCCGAAA 1250 

GTTATGTTGC AGGTGGTCAA GAAGCCAACA ATACCAACGC GGTGGCCTAT 13 00 

GCCCCATCAG ATAATCCTCA AATCGCTGTT GCCGTTGTCT TCCCTCATAA 13 50 

CACCAATCTA ACAAATGGTG TTGGACCTTC TATTGCACGC GATATTATCA 14 00 

ACCTCTATAA CCAACACCAT CCAATGAATT AGAAAGGAAC TTATGC 1446 



2) INFORMATION FOR SEQ ID NO: 1034 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1670 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-01 



5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1034 

GGAAACTGCA GAGGTCAAGG GGATTGATTT TACAACCAGT CCCAATCGTA 
GTTATCCAAA CGGACAATTT GCTTCTAGTT TTATCGGACT AGCTCAGCTC 

10 CATGAAAATG AAGATGGCAG CAAGAGCTTA CTGGGAACCT CTGGAATGGA 
GAGTTCCTTG AACAGTATTC TTGCAGGGAC AGACGGTATT ATTACCTATG 
AAAAAGACCG TGTAGGAAAT ATCGTACCAG GTACAGAACT GGTATCGCAA 
CAAACTGTGG ATGGCAAGGA TGTTTATACA ACATTGTCTA GTCCGCTACA 
ATCTTTCATG GAAACTCAGA TGGATGCCTT TCTAGAAAAA GTAAAAGGTA 

15 AGTATATGAC CGCGACCTTG GTCAGTGCAA AGACCGGTGA AATCCTCGCT 
ACCACCCAAC GACCTACCTT TAATGCAGAT ACTAAAGAAG GAATCACTGA 
GGACTTTGTT TGGCGTGATA TTCTTTATCA AAGTAACTAT GAACCAGGAT 
CAGCCATGAA GGTTATGACG TTAGCTTCTT CTATTGATAA TAATACCTTC 
CCAAGTGGAG AATACTTCAA TAGCAGTGAA TTCAAAATAG CGGATGCGAC 

2 0 GACTCGAGAT TGGGATGTTA ATGATGGTTT GACTACTGGT GGGATGATGA 
CTTTCTTACA AGGTTTCGCT CACTCCAGTA ATGTTGGAAT GAGTCTACTT 
GAACAAAAAA TGGGAGATGC TACTTGGTTG GATTATCTAA AACGCTTTAA 
ATTTGGGGTT CCAACTCGCT TTGGCTTGAC AGATGAATAC GCTGGTCAAC 
TTCCAGCTGA TAATATTGTT AGTATTGCTC AAAGCTCATT TGGGCAAGGA 

2 5 . ATTTCAGTGA CACAAACACA AATGCTTCGT GCCTTTACAG CTATTGCTAA 

TGATGGAGTT ATGCTGGAGC CAAAATTTAT AAGTGCTATT TATGATACTA 
ACAATCAGTC TGTACGTAAG TCACAAAAAG AAATAGTAGG AAATCCTGTT 
TCCAAAGAGG CAGCAAGCAC AACTCGAAAT CACATGATCT TAGTTGGGAC 
GGACCCTCTA TATGGAACTA TGTATAATCA CTACACAGGA AAGCCAATTA 

3 0 TAACAGTTCC TGGACAAAAT GTAGCAGTTA AATCCGGTAC GGCTCAAATC 

GCTGATGAGA AAAATGGAGG ATACTTGGTT GGTTCTACCA ATTATATTTT 
CTCAGTTGTG ACTATGAATC CTGCTGAAAA TCCTGATTTT ATCTTGTATG 
TAACGGTTCA ACAGCCTGAG CATTATTCAG GTATCCAGTT GGGAGAATTT 
GCCACCCCAA TCTTGGAGCG GGCTTCAGCT ATGAAAGAAT CTCTCAATCT 
35 TCAATCTCCA GCCAAAAATT TAGATAAAGT TACGACAGAA TCTTCTTATG 
CAATGCCTAG CATCAAGGAT ATTTCACCTG GTGAGTTGGC GGAAGCCTTA 
CGCCGAAATA TTGTGCAACC AATCGTTGTA GGTACTGGAA CAAAGATTAA 
AGAGACTTCT GTAGAAGAAG GGACCAATCT TGCACCAAAC CAACAAGTTC 
TCCTTTTATC GGATAAGGTA GAAGAAATTC CAGACATGTA TGGCTGGAAA 

4 0 AAAGAGACTG CCGAGACCTT TGCTAAATGG TTGGATATTG AACTGGAATT 

TGAAGGTTCA GGTTCCGTTG 



45 2) INFORMATION FOR SEQ ID NO: 1035 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1670 
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(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1683 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) 



MOLECULE TYPE: Genomic DNA 



55 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-02 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 103 5 



60 



580 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

AAAGAATTGG AAACTGCAGA GGTCAAGGGG ATTGATTTTA CAACCAGTCC 50 

CAATCGTAGT TACCCAAATG GACAATTTGC TTCTAGTTTT ATCGGTCTAG 10 0 

CTCAGCTCCA TGAAAATGAA GATGGAAGCA AGAGTTTGCT GGGAACCTCT 150 

GGAATGGAGA GTTCCTTGAA CAGTATTCTT GCAGGGACAG ACGGCATTAT 2 00 

5 TACCTATGAA AAGGATCGTC TGGGCAATAT TGTACCCGGA ACAGAACAAG 2 50 

TTTCCCAACA AACGGTGGAT GGCAAGGATG TTTACACAAC CATTTCCAGC 3 00 

CCCCTCCAGT CCTTCATGGA AACTCAGATG GATGCCTTTC TAGAAAAAGT 3 50 

AAAAGGTAAG TATATGACCG CGACCTTGGT CAGTGCAAAG ACCGGTGAAA 4 00 

TCCTCGCTAC CACCCAACGA CCTACCTTTA ATGCAGATAC TAAAGAAGGA 4 50 

10 ATCACTGAGG ACTTTGTTTG GCGTGATATT CTTTATCAAA GTAACTATGA 500 

ACCAGGATCA GCCATGAAGG TTATGACGTT AGCTTCTTCT ATTGATAATA 550 

ATACCTTCCC AAGTGGAGAA TACTTCAATA GCAGTGAATT CAAAATAGCG 600 

GATGCGACGA CTCGAGATTG GGATGTTAAT GATGGTTTGA CTACTGGTGG 650 

GATGATGACT TTCTTACAAG GTTTCGCTCA CTCCAGTAAT GTTGGAATGA 700 

15 GTCTACTTGA ACAAAAAATG GGAGATGCTA CTTGGTTGGA TTATCTAAAA 750 

CGCTTTAAAT TTGGGGTTCC AACTCGCTTT GGCTTGACAG ATGAATACGC 8 00 

TGGTCAACTT CCAGCTGATA ATATTGTTAG TATTGCTCAA AGCTCATTTG 8 50 

GGCAAGGAAT TTCAGTGACA CAAACACAAA TGCTTCGTGC CTTTACAGCT 900 

ATTGCTAATG ATGGAGTTAT GCTGGAGCCA AAATTTATAA GTGCTATTTA 950 

2 0 TGATACTAAC AATCAGTCTG TACGTAAGTC ACAAAAAGAA ATAGTAGGAA 1000 

ATCCTGTTTC CAAAGAGGCA GCAAGCACAA CTCGAAATCA CATGATCTTA 1050 

GTTGGGACGG ACCCTCTATA TGGAACTATG TATAATCACT ACACAGGAAA 1100 

GCCAATTATA ACAGTTCCTG GACAAAATGT AGCAGTTAAA TCCGGTACGG 1150 

CTCAAATCGC TGATGAGAAA AATGGAGGAT ACTTGGTTGG TTCTACCAAT 12 0 0 

25 TATATTTTCT CAGTTGTGAC TATGAATCCT GCTGAAAATC CTGATTTTAT 12 50 

CTTGTATGTA ACGGTTCAAC AGCCTGAGCA TTATTCAGGT ATCCAGTTGG 13 00 

GAGAATTTGC CACCCCAATC TTGGAGCGGG CTTCAGCTAT GAAAGAATCT 13 50 

CTCAATCTTC AATCTCCAGC CAAAAATTTA GATAAAGTTA CGACAGAATC 14 00 

TTCTTATGCA ATGCCTAGCA TCAAGGATAT TTCACCTGGT GAGTTGGCGG 14 50 

3 0 AAGCCTTACG CCGAAATATT GTGCAACCAA TCGTTGTAGG TACTGGAACA 1500 

AAGATTAAAG AGACTTCTGT AGAAGAAGGG ACCAATCTTG CACCAAACCA 1550 

ACAAGTTCTC CTTTTATCGG ATAAGGTAGA AGAAATTCCA GACATGTATG 160 0 

GCTGGAAAAA AGAGACTGCC GAGACCTTTG CTAAATGGTT GGATATTGAA 1650 

CTGGAATTTG AAGGTTCAGG TTCCGTTGTT CAG 1683 
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2) INFORMATION FOR SEQ ID NO: 1036 



40 



45 



50 



( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 1682 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-03 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1036 



55 



60 



TCAAAAAAGA ATTGGAAACT GCAGAGGTCA AGGGGATTGA 
AGTCCCAATC GTAGTTACCC AAATGGACAA TTTGCTTCTA 
CTCCATGAAA ATGAAGATGG AAGCAAGAGT 
GGAGAGTTCC TTGAACAGTA TTCTTGCAGG 
ATGAAAAGGA TCGTCTGGGC AATATTGTAC 
CAACAAACGG TGGATGGCAA GGATGTTTAC 
CCAGTCCTTC ATGGAAACTC AGATGGATGC 

581 



TCTAGCTCAG 
CCTCTGGAAT 
ATTATTACCT 
ACAAGTTTCC 
CCAGCCCCCT 



TTTTACAACC 50 

GTTTTATCGG 10 0 

TTGCTGGGAA 150 

GACAGACGGC 200 

CCGGAACAGA 250 

ACAACCATTT 300 

CTTTCTAGAA 350 
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AAAGTAAAAG 
TGAAATCCTC 
AAGGAATCAC 
TATGAACCAG 
5 TAATAATACC 
TAGCGGATGC 
GGTGGGATGA 
AATGAGTCTA 
TAAAACGCTT 

10 TACGCTGGTC 
ATTTGGGCAA 
CAGCTATTGC 
ATTTATGATA 
AGGAAATCCT 

15 TCTTAGTTGG 
GGAAAGCCAA 
TACGGCTCAA 
CCAATTATAT 
TTTATCTTGT 

2 0 GTTGGGAGAA 
AATCTCTCAA 
GAATCTTCTT 
GGCGGAAGCC 
GAACAAAGAT 

2 5 AACCAACAAG 
GTATGGCTGG 
TTGAACTGGA 



GTAAGTATAT 
GCTACCACCC 
TGAGGACTTT 
GATCAGCCAT 
TTCCCAAGTG 
GACGACTCGA 
TGACTTTCTT 
CTTGAACAAA 
TAAATTTGGG 
AACTTCCAGC 
GGAATTTCAG 
TAATGATGGA 
CTAACAATCA 
GTTTCCAAAG 
GACGGACCCT 
TTATAACAGT 
ATCGCTGATG 
TTTCTCAGTT 
ATGTAACGGT 
TTTGCCACCC 
TCTTCAATCT 
ATGCAATGCC 
TTACGCCGAA 
TAAAGAGACT 
TTCTCCTTTT 
AAAAAAGAGA 
ATTTGAAGGT 



GACCGCGACC 
AACGACCTAC 
GTTTGGCGTG 
GAAGGTTATG 
GAGAATACTT 
GATTGGGATG 
ACAAGGTTTC 
AAATGGGAGA 
GTTCCAACTC 
TGATAATATT 
TGACACAAAC 
GTTATGCTGG 
GTCTGTACGT 
AGGCAGCAAG 
CTATATGGAA 
TCCTGGACAA 
AGAAAAATGG 
GTGACTATGA 
TCAACAGCCT 
CAATCTTGGA 
CCAGCCAAAA 
TAGCATCAAG 
ATATTGTGCA 
TCTGTAGAAG 
ATCGGATAAG 
CTGCCGAGAC 
TCAGGTTCCG 



TTGGTCAGTG 
CTTTAATGCA 
ATATTCTTTA 
ACGTTAGCTT 
CAATAGCAGT 
TTAATGATGG 
GCTCACTCCA 
TGCTACTTGG 
GCTTTGGCTT 
GTTAGTATTG 
ACAAATGCTT 
AGCCAAAATT 
AAGTCACAAA 
CACAACTCGA 
CTATGTATAA 
AATGTAGCAG 
AGGATACTTG 
ATCCTGCTGA 
GAGCATTATT 
GCGGGCTTCA 
ATTTAGATAA 
GATATTTCAC 
ACCAATCGTT 
AAGGGACCAA 
GTAGAAGAAA 
CTTTGCTAAA 
TT 



PCT/CA00/01150 

CAAAGACCGG 400 

GATACTAAAG 450 

TCAAAGTAAC 500 

CTTCTATTGA 550 

GAATTCAAAA 600 

TTTGACTACT 650 

GTAATGTTGG 700 

TTGGATTATC 750 

GACAGATGAA 800 

CTCAAAGCTC 850 

CGTGCCTTTA 900 

TATAAGTGCT 950 

AAGAAATAGT 1000 

AATCACATGA 1050 

TCACTACACA 110 0 

TTAAATCCGG 1150 

GTTGGTTCTA 1200 

AAATCCTGAT 12 50 

CAGGTATCCA 13 00 

GCTATGAAAG 1350 

AGTTACGACA 1400 

CTGGTGAGTT 14 50 

GTAGGTACTG 1500 

TCTTGCACCA 1550 

TTCCAGACAT 1600 

TGGTTGGATA 1650 
1682 



30 



40 



2) INFORMATION FOR SEQ ID NO: 1037 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1681 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-04 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 7 

CAAAAAAGAA TTGGAAACTG CAGAGGTCAA GGGGATTGAT TTTACAACCA 50 

GTCCCAATCG TAGTTATCCA AACGGACAAT TTGCTTCTAG TTTTATCGGA 100 

CTAGCTCAGC TCCATGAAAA TGAAGATGGC AGCAAGAGCT TACTGGGAAC 150 

50 CTCTGGAATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG ACAGACGGTA 2 00 

TTATTACCTA TGAAAAAGAC CGTGTAGGAA ATATCGTACC AGGTACAGAA 250 

CTGGTATCGC AACAAACTGT GGATGGCAAG GATGTTTATA CAACATTGTC 3 00 

TAGTCCGCTA CAATCTTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA 350 

AAGTAAAAGG TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT 4 00 

55 GAAATCCTCG CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA 4 50 

AGGAATCACT GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT 5 00 

ATGAACCAGG ATCAGCCATG AAGGTTATGA CGTTAGCTTC TTCTATTGAT 550 

AATAATACCT TCCCAAGTGG AGAATACTTC AATAGCAGTG AATTCAAAAT 600 

AGCGGATGCG ACGACTCGAG ATTGGGATGT TAATGATGGT TTGACTACTG 650 

60 GTGGGATGAT GACTTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA 700 
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ATGAGTCTAC TTGAACAAAA AATGGGAGAT GCTACTTGGT TGGATTATCT 750 

AAAACGCTTT AAATTTGGGG TTCCAACTCG CTTTGGCTTG ACAGATGAAT 800 

ACGCTGGTCA ACTTCCAGCT GATAATATTG TTAGTATTGC TCAAAGCTCA 850 

TTTGGGCAAG GAATTTCAGT GACACAAACA CAAATGCTTC GTGCCTTTAC 900 

5 AGCTATTGCT AATGATGGAG TTATGCTGGA GCCAAAATTT ATAAGTGCTA 950 

TTTATGATAC TAACAATCAG TCTGTACGTA AGTCACAAAA AGAAATAGTA 1000 

GGAAATCCTG TTTCCAAAGA GGCAGCAAGC ACAACTCGAA ATCACATGAT 1050 

CTTAGTTGGG ACGGACCCTC TATATGGAAC TATGTATAAT CACTACACAG 1100 

GAAAGCCAAT TATAACAGTT CCTGGACAAA ATGTAGCAGT TAAATCCGGT 1150 

10 ACGGCTCAAA TCGCTGATGA GAAAAATGGA GGATACTTGG TTGGTTCTAC 1200 

CAATTATATT TTCTCAGTTG TGACTATGAA TCCTGCTGAA AATCCTGATT 1250 

TTATCTTGTA TGTAACGGTT CAACAGCCTG AGCATTATTC AGGTATCCAG 13 00 

TTGGGAGAAT TTGCCACCCC AATCTTGGAG CGGGCTTCAG CTATGAAAGA 1350 

ATCTCTCAAT CTTCAATCTC CAGCCAAAAA TTTAGATAAA GTTACGACAG 14 00 

15 AATCTTCTTA TGCAATGCCT AGCATCAAGG ATATTTCACC TGGTGAGTTG 1450 

GCGGAAGCCT TACGCCGAAA TATTGTGCAA CCAATCGTTG TAGGTACTGG 1500 

AACAAAGATT AAAGAGACTT CTGTAGAAGA AGGGACCAAT CTTGCACCAA 1550 

ACCAACAAGT TCTCCTTTTA TCGGATAAGG TAGAAGAAAT TCCAGACATG 1600 

TATGGCTGGA AAAAAGAGAC TGCCGAGACC TTTGCTAAAT GGTTGGATAT 1650 

2 0 TGAACTGGAA TTTGAAGGTT CAGGTTCCGT T 1681 



25 



2) INFORMATION FOR SEQ ID NO: 1038 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1685 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
30 (D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103 8 

4 0 AAAAAAGAAT TGGAAACTGC AGAGGTCAAG GGGATTGATT TTACAACCAG 50 

TCCCAATCGT AGTTATCCAA ACGGACAATT TGCTTCTAGT TTTATCGGAC 100 

TAGCTCAGCT CCATGAAAAT GAAGATGGCA GCAAGAGCTT ACTGGGAACC 150 

TCTGGAATGG AGAGTTCCTT GAACAGTATT CTTGCAGGGA CAGACGGTAT 2 00 

TATTACCTAT GAAAAAGACC GTGTAGGAAA TATCGTACCA GGTACAGAAC 2 50 

4 5 TGGTATCGCA ACAAACTGTG GATGGCAAGG ATGTTTATAC AACATTGTCT 3 00 

AGTCCGCTAC AATCTTTCAT GGAAACTCAG ATGGATGCCT TTCTAGAAAA 3 50 

AGTAAAAGGT AAGTATATGA CCGCGACCTT GGTCAGTGCA AAGACCGGTG 40 0 

AAATCCTCGC TACCACCCAA CGACCTACCT TTAATGCAGA TACTAAAGAA 450 

GGAATCACTG AGGACTTTGT TTGGCGTGAT ATTCTTTATC AAAGTAACTA 500 

50 TGAACCAGGA TCAGCCATGA AGGTTATGAC GTTAGCTTCT TCTATTGATA 550 

ATAATACCTT CCCAAGTGGA GAATACTTCA ATAGCAGTGA ATTCAAAATA 600 

GCGGATGCGA CGACTCGAGA TTGGGATGTT AATGATGGTT TGACTACTGG 650 

TGGGATGATG ACTTTCTTAC AAGGTTTCGC TCACTCCAGT AATGTTGGAA 70 0 

TGAGTCTACT TGAACAAAAA ATGGGAGATG CTACTTGGTT GGATTATCTA 750 

55 AAACGCTTTA AATTTGGGGT TCCAACTCGC TTTGGCTTGA CAGATGAATA 800 

CGCTGGTCAA CTTCCAGCTG ATAATATTGT TAGTATTGCT CAAAGCTCAT 850 

TTGGGCAAGG AATTTCAGTG ACACAAACAC AAATGCTTCG TGCCTTTACA 900 

GCTATTGCTA ATGATGGAGT TATGCTGGAG CCAAAATTTA TAAGTGCTAT 950 

TTATGATACT AACAATCAGT CTGTACGTAA GTCACAAAAA GAAATAGTAG 1000 

60 GAAATCCTGT TTCCAAAGAG GCAGCAAGCA CAACTCGAAA TCACATGATC 1050 
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TTAGTTGGGA CGGACCCTCT 
AAAGCCAATT ATAACAGTTC 
CGGCTCAAAT CGCTGATGAG 
AATTATATTT TCTCAGTTGT 
5 TATCTTGTAT GTAACGGTTC 
TGGGAGAATT TGCCACCCCA 
TCTCTCAATC TTCAATCTCC 
ATCTTCTTAT GCAATGCCTA 
CGGAAGCCTT ACGCCGAAAT 
10 ACAAAGATTA AAGAGACTTC 
CCAACAAGTT CTCCTTTTAT 
ATGGCTGGAA AAAAGAGACT 
GAACTGGAAT TTGAAGGTTC 



ATATGGAACT ATGTATAATC 
CTGGACAAAA TGTAGCAGTT 
AAAAATGGAG GATACTTGGT 
GACTATGAAT CCTGCTGAAA 
AACAGCCTGA GCATTATTCA 
ATCTTGGAGC GGGCTTCAGC 
AGCCAAAAAT TTAGATAAAG 
GCATCAAGGA TATTTCACCT 
ATTGTGCAAC CAATCGTTGT 
TGTAGAAGAA GGGACCAATC 
CGGATAAGGT AGAAGAAATT 
GCCGAGACCT TTGCTAAATG 
AGGTTCCGTT GTTCA 



PCT/CA00/01150 

ACTACACAGG 110 0 

AAATCCGGTA 1150 

TGGTTCTACC 1200 

ATCCTGATTT 1250 

GGTATCCAGT 13 00 

TATGAAAGAA 13 50 

TTACGACAGA 1400 

GGTGAGTTGG 1450 

AGGTACTGGA 1500 

TTGCACCAAA 1550 

CCAGACATGT 16 00 

GTTGGATATT 1650 
1685 



15 



2 ) INFORMATION FOR SEQ ID NO: 1039 



(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 1679 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1039 



AAAAGAATTG 
CCAATCGTAG 

3 5 GCTCAGCTCC 

TGGAATGGAG 
TTACCTATGA 
GTATCGCAAC 
TCCGCTACAA 

4 0 TAAAAGGTAA 

ATCCTCGCTA 
AATCACTGAG 
AACCAGGATC 
AATACCTTCC 

4 5 GGATGCGACG 
GGATGATGAC 
AGTCTACTTG 
ACGCTTTAAA 
CTGGTCAACT 

50 GGGCAAGGAA 
TATTGCTAAT 
ATGATACTAA 
AATCCTGTTT 
AGTTGGGACG 

55 AGCCAATTAT 
GCTCAAATCG 
TTATATTTTC 
TCTTGTATGT 
GGAGAATTTG 

60 TCTCAATCTT 



GAAACTGCAG 
TTATCCAAAC 
ATGAAAATGA 
AGTTCCTTGA 
AAAAGACCGT 
AAAC TGTGGA 
TCTTTCATGG 
GTATATGACC 
CCACCCAACG 
GACTTTGTTT 
AGCCATGAAG 
CAAGTGGAGA 
ACTCGAGATT 
TTTCTTACAA 
AACAAAAAAT 
TTTGGGGTTC 
TCCAGCTGAT 
TTTCAGTGAC 
GATGGAGTTA 
CAATCAGTCT 
CCAAAGAGGC 
GACCCTCTAT 
AACAGTTCCT 
CTGATGAGAA 
TCAGTTGTGA 
AACGGTTCAA 
CCACCCCAAT 
CAATCTCCAG 



AGGTCAAGGG 
GGACAATTTG 
AGATGGCAGC 
ACAGTATTCT 
GTAGGAAATA 
TGGCAAGGAT 
AAACTCAGAT 
GCGACCTTGG 
ACCTACCTTT 
GGCGTGATAT 
GTTATGACGT 
ATACTTCAAT 
GGGATGTTAA 
GGTTTCGCTC 
GGGAGATGCT 
CAACTCGCTT 
AATATTGTTA 
ACAAACACAA 
TGCTGGAGCC 
GTACGTAAGT 
AGCAAGCACA 
ATGGAACTAT 
GGACAAAATG 
AAATGGAGGA 
CTATGAATCC 
CAGCCTGAGC 
CTTGGAGCGG 
CCAAAAATTT 



GATTGATTTT 
CTTCTAGTTT 
AAGAGCTTAC 
TGCAGGGACA 
TCGTACCAGG 
GTTTATACAA 
GGATGCCTTT 
TCAGTGCAAA 
AATGCAGATA 
TCTTTATCAA 
TAGCTTCTTC 
AGCAGTGAAT 
TGATGGTTTG 
ACTCCAGTAA 
ACTTGGTTGG 
TGGCTTGACA 
GTATTGCTCA 
ATGCTTCGTG 
AAAATTTATA 
CACAAAAAGA 
ACTCGAAATC 
GTATAATCAC 
TAGCAGTTAA 
TACTTGGTTG 
TGCTGAAAAT 
ATTATTCAGG 
GCTTCAGCTA 
AGATAAAGTT 



ACAACCAGTC 50 

TATCGGACTA 100 

TGGGAACCTC 150 

GACGGTATTA 2 00 

TACAGAACTG 2 50 

CATTGTCTAG 300 

CTAGAAAAAG 3 50 

GACCGGTGAA 4 00 

CTAAAGAAGG 4 50 

AGTAACTATG 500 

TATTGATAAT 550 

TCAAAATAGC 600 

ACTACTGGTG 650 

TGTTGGAATG 700 

ATTATCTAAA 750 

GATGAATACG 800 

AAGCTCATTT 850 

CCTTTACAGC 900 

AGTGCTATTT 950 

AATAGTAGGA 1000 

ACATGATCTT 1050 

TACACAGGAA 1100 

ATCCGGTACG 1150 

GTTCTACCAA 12 00 

CCTGATTTTA 1250 

TATCCAGTTG 13 00 

TGAAAGAATC 1350 

ACGACAGAAT 1400 
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CTTCTTATGC AATGCCTAGC ATCAAGGATA 
GAAGCCTTAC GCCGAAATAT TGTGCAACCA 
AAAGATTAAA GAGACTTCTG TAGAAGAAGG 
AACAAGTTCT CCTTTTATCG GATAAGGTAG 
5 GGCTGGAAAA AAGAGACTGC CGAGACCTTT 
ACTGGAATTT GAAGGTTCAG GTTCCGTTG 



PCT/CA00/01150 

TTTCACCTGG TGAGTTGGCG 1450 
ATCGTTGTAG GTACTGGAAC 1500 
GACCAATCTT GCACCAAACC 1550 
AAGAAATTCC AGACATGTAT 1600 
GCTAAATGGT TGGATATTGA 1650 

1679 



10 2) INFORMATION FOR SEQ ID NO: 1040 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1685 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-07 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1040 



TAAAAAGGAT CTAAAAGACG CTAGTGTTGA AGGAATTGAC TTCACAACTA 50 

GCCCTAATAG AAGCTATCCA AATGGACAAT TCGCTTCTAG TTTTATTGGT 100 

TTGGCCCAAC TCCATGAAAA TGAGGATGGT AGCAAGAGTT TGCTGGGAAC 150 

TTCTGGGATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG AAAGACGGTA 2 00 

30 TTATTACCTA TGAAAAAGAT CGTCTGGGTA ATATTGTCCC TGGAACAGAA 2 50 

CAAGTTTCCC AACAAACGGT AGATGGCAAG GATGTTTATA CAACATTGTC 3 00 

TAGTCCGCTA CAATCTTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA 3 50 

AAGTAAAAGG TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT 4 00 

GAAATCCTCG CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA 4 50 

35 AGGAATCACT GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT 50 0 

ATGAACCAGG ATCAGCCATG AAGGTTATGA CGTTAGCTTC TTCTATTGAT 550 

AATAATACCT TCCCAAGTGG AGAATACTTC AATAGCAGTG AATTCAAAAT 600 

AGCGGATGCG ACGACTCGAG ATTGGGATGT TAATGAAGGT TTGACTACTG 650 

GTGGGATGAT GACTTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA 700 

40 ATGAGTCTAC TTGAACAAAA AATGGGAGAT GCTACTTGGT TGGATTATCT 750 

AAAACGCTTT AAATTTGGGG TTCCAACTCG CTTTGGCTTG ACAGATGAAT 800 

ACGCTGGTCA ACTTCCAGCT GATAATATTG TTAGTATTGC TCAAAGCTCA 850 

TTTGGGCAAG GAATTTCAGT GACACAAACA CAAATGCTTC GTGCCTTTAC 900 

AGCTATTGCT AATGATGGAG TTATGCTGGA GCCAAAATTT ATAAGTGCTA 950 

45 TTTATGATAC TAACAATCAG TCTGTACGTA AGTCACAAAA AGAAATAGTA 1000 

GGAAATCCTG TTTCCAAAGA GGCAGCAAGC ACAACTCGAA ATCACATGAT 1050 

CTTAGTTGGG ACGGACCCTC TATATGGAAC TATGTATAAT CACTACACAG 1100 

GAAAGCCAAT TATAACAGTT CCTGGACAAA ATGTAGCAGT TAAATCCGGT 1150 

ACGGCTCAAA TCGCTGATGA GAAAAATGGA GGATACTTGG TTGGTTCTAC 12 00 

50 CAATTATATT TTCTCAGCTG TGACTATGAA TCCTGCTGAA AATCCTGATT 12 50 

TTATCTTGTA TGTAACGGTT CAACAGCCTG AGCATTATTC AGGTATCCAG 13 00 

TTGGGAGAAT TTGCCACCCC AATCTTGGAG CGGGCTTCAG CTATGAAAGA 13 50 

ATCTCTCAAT CTTCAATCTC CAGCCAAAAA TTTAGATAAA GTTACGACAG 14 00 

AATCTTCTTA TGCAATGCCT AGCATCAAGG ATATTTCACC TGGTGAGTTG 14 50 

55 GCGGAAGCCT TACGCCGAAA TATTGTGCAA CCAATCGTTG TAGGTACTGG 150 0 

AACAAAGATT AAAGAGACTT CTGTAGAAGA AGGGACCAAT CTTGCACCAA 1550 

ACCAACAAGT TCTCCTTTTA TCGGATAAGG TAGAAGAAAT TCCAGACATG 1600 

TATGGCTGGA AAAAAGAGAC TGCCGAGACC TTTGCTAAAT GGTTGGATAT 1650 

TGAACTGGAA TTTGAAGGTT CAGGTTCCGT CGTTC 1685 



60 
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2) INFORMATION FOR SEQ ID NO: 1041 



5 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1696 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



10 



(ii) 



MOLECULE TYPE: Genomic DNA 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-08 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1041 

GTCTATCAAA AAAGAATTGG AAACTGCAGA GGTCAAGGGG ATTGATTTTA 
2 0 CAACCAGTCC CAATCGTAGT TATCCAAACG GACAATTTGC TTCTAGTTTT 
ATCGGACTAG CTCAGCTCCA TGAAAATGAA GATGGCAGCA AGAGCTTACT 
GGGAACCTCT GGAATGGAGA GTTCCTTGAA CAGTATTCTT . GCAGGGACAG 
ACGGTATTAT TACCTATGAA AAAGACCGTG TAGGAAATAT CGTACCAGGT 
ACAGAACTGG TATCGCAACA AACTGTGGAT GGCAAGGATG TTTATACAAC 

2 5 ATTGTCTAGT CCGCTACAAT CTTTCATGGA AACTCAGATG GATGCCTTTC 

TAGAAAAAGT AAAAGGTAAG TATATGACCG CGACCTTGGT CAGTGCAAAG 
ACCGGTGAAA TCCTCGCTAC CACCCAACGA CCTACCTTTA ATGCAGATAC 
TAAAGAAGGA ATCACTGAGG ACTTTGTTTG GCGTGATATT CTTTATCAAA 
GTAACTATGA ACCAGGATCA GCCATGAAGG TTATGACGTT AGCTTCTTCT 

3 0 ATTGATAATA ATACCTTCCC AAGTGGAGAA TACTTCAATA GCAGTGAATT 

CAAAATAGCG GATGCGACGA CTCGAGATTG GGATGTTAAT GATGGTTTGA 

CTACTGGTGG GATGATGACT TTCTTACAAG GTTTCGCTCA CTCCAGTAAT 

GTTGGAATGA GTCTACTTGA ACAAAAAATG GGAGATGCTA CTTGGTTGGA 

TTATCTAAAA CGCTTTAAAT TTGGGGTTCC AACTCGCTTT GGCTTGACAG 

3 5 ATGAATACGC TGGTCAACTT C CAGCTGAT A ATATTGTTAG TATTGCTCAA 

AGCTCATTTG GGCAAGGAAT TTCAGTGACA CAAACACAAA TGCTTCGTGC 
CTTTACAGCT ATTGCTAATG ATGGAGTTAT GCTGGAGCCA AAATTTATAA 
GTGCTATTTA TGATACTAAC AATCAGTCTG TACGTAAGTC ACAAAAAGAA 
ATAGTAGGAA ATCCTGTTTC .CAAAGAGGCA GCAAGCACAA CTCGAAATCA 

4 0 CATGATCTTA GTTGGGACGG ACCCTCTATA TGGAACTATG TATAATCACT 

ACACAGGAAA GCCAATTATA ACAGTTCCTG GACAAAATGT AGCAGTTAAA 
TCCGGTACGG CTCAAATCGC TGATGAGAAA AATGGAGGAT ACTTGGTTGG 
TTCTACCAAT TATATTTTCT CAGTTGTGAC TATGAATCCT GCTGAAAATC 
CTGATTTTAT CTTGTATGTA ACGGTTCAAC AGCCTGAGCA TTATTCAGGT 

4 5 ATC CAGTTGG GAGAATTTGC CACCCCAATC TTGGAGCGGG CTTCAGCTAT 
GAAAGAATCT CTCAATCTTC AATCTCCAGC CAAAAATTTA GATAAAGTTA 
CGACAGAATC TTCTTATGCA ATGCCTAGCA TCAAGGATAT TTCACCTGGT 
GAGTTGGCGG AAGCCTTACG CCGAAATATT GTGCAACCAA TCGTTGTAGG 
TACTGGAACA AAGATTAAAG AGACTTCTGT AGAAGAAGGG ACCAATCTTG 

50 CACCAAACCA ACAAGTTCTC CTTTTATCGG ATAAGGTAGA AGAAATTCCA 
GACATGTATG GCTGGAAAAA AGAGACTGCC GAGACCTTTG CTAAATGGTT 
GGATATTGAA CTGGAATTTG AAGGTTCAGG TTCCGTTGTT CAGAAG 



55 

2) INFORMATION FOR SEQ ID NO: 1042 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1600 
1650 
1696 



60 



. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1689 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-09 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1042 

AAAAAAGAGT TGGAAACTGC AGAGGTCAAG GGGATTGATT TTACAACCAG 50 

TCCCAATCGT AGTTACCCAA ACGGACAATT TGCTTCTAGT TTTATCGGAC 100 

TAGCTCAGCT CCATGAAAAT GAAGATGGCA GCAAGAGCTT GCTGGGAACC 150 

15 TCTGGAATGG AGAGTTCCTT GAACAGTATT CTTGCAGGGA CAGACGGCAT 2 00 

TATTACCTAT GAAAAGGATC GTCTGGGAAA TATTGTCCCC GGAACGGAAC 250 

AAGTTTCCCA ACAAACTGTA GATGGCAAGG ATGTTTATAC GACTATTTCC 300 

AGCACCCTTC AGTCCTTCAT GGAAACTCAG ATGGATGCCT TTCTAGAAAA 3 50 

AGTAAAAGGT AAGTATATGA CCGCGACCTT GGTCAGTGCA AAGACCGGTG 4 00 

2 0 AAATCCTCGC TACCACCCAA CGACCTACCT TTAATGCAGA TACTAAAGAA 4 50 

GGAATCACTG AGGACTTTGT TTGGCGTGAT ATTCTTTATC AAAGTAACTA 500 

TGAACCAGGA TCAGCCATGA AGGTTATGAC GTTAGCTTCT TCTATTGATA 550 

ATAATACCTT CCCAAGTGGA GAATACTTCA ATAGCAGTGA ATTCAAAATA 600 

GCGGATGCGA CGACTCGAGA TTGGGATGTT AATGATGGTT TGACTACTGG 650 

2 5 TGGGATGATG ACTTTCTTAC AAGGTTTCGC TCACTCCAGT AATGTTGGAA 700 

TGAGTCTACT TGAACAAAAA ATGGGAGATG CTACTTGGTT GGATTATCTA 750 

AAACGCTTTA AATTTGGGGT TCCAACTCGC TTTGGCTTGA CAGATGAATA 800 

CGCTGGTCAA CTTCCAGCTG ATAATATTGT TAGTATTGCT CAAAGCTCAT 8 50 

TTGGGCAAGG AATTTCAGTG ACACAAACAC AAATGCTTCG TGCCTTTACA 900 

3 0 GCTATTGCTA ATGATGGAGT TATGCTGGAG CCAAAATTTA TAAGTGCTAT 950 

TTATGATACT AACAATCAGT CTGTACGTAA GTCACAAAAA GAAATAGTAG 100 0 

GAAATCCTGT TTCCAAAGAG GCAGCAAGCA CAACTCGAAA TCACATGATC 1050 

TTAGTTGGGA CGGACCCTCT ATATGGAACT ATGTATAATC ACTACACAGG 1100 

AAAGCCAATT ATAACAGTTC CTGGACAAAA TGTAGCAGTT AAATCCGGTA 1150 

3 5 CGGCTCAAAT CGCTGATGAG AAAAATGGAG GATACTTGGT TGGTTCTACC 12 00 

AATTATATTT TCTCAGTTGT GACTATGAAT CCTGCTGAAA ATCCTGATTT 12 50 

TATCTTGTAT GTAACGGTTC AACAGCCTGA GCATTATTCA GGTATCCAGT 13 00 

TGGGAGAATT TGCCACCCCA ATCTTGGAGC GGGCTTCAGC TATGAAAGAC 13 50 

TCTCTCAATC TTCAAACAAC AGCTAAAGCT TTGGAGCAAG TAAGTCAACA 14 00 

4 0 AAGTCCTTAT CCTATGCCTA GTGTCAAGGA TATTTCACCT GGTGATTTAG 14 50 

CAGAAGAATT GCGTCGCAAT CTTGTACAAC CCATCGTTGT GGGAACAGGA 1500 

ACGAAGATTA AAAACAGTTC TGCTGAAGAA GGGAAGAATC TTGCCCCGAA 1550 

TCAGCAAGTC CTTATCTTAT CTGATAAAGT AGAGGAAGTT CCAGATATGT 1600 

ATGGTTGGAC AAAGGAGACT GCTGAGACCC TTGCTAAGTG GCTCAATATA 1650 

4 5 GAACTTGAAT TTCAAGGTTC GGGTTCTACT GTGCAGAAG 1689 



50 



2) INFORMATION FOR SEQ ID NO: 1043 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 90 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Streptococcus pneumoniae 

587 
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(B) STRAIN: StrR-10 



PCT/CAOO/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1043 

5 CAAAAAAGAG TTGGAAACTG CAGAGGTCAA GGGGATTGAT TTTACAACCA 50 
GTCCTAATCG TAGTTACCCA AACGGACAAT TTGCTTCTAG TTTTATCGGT 100 
CTAGCTCAAC TCCATGAAAA TGAAGATGGA AGCAAGAGTT TGCTGGGAAC 150 
TTCTGGAATG GAGAGTTCCT TGAACAGTAT TCTTGCAGGG AAAGACGGTA 2 00 

TTATTACTTA TGAAAAGGAT CGTCTGGGTA ATATTGTCCC TGGAACAGAA 250 

10 CAAGTTTCCC AACAAACGGT AGATGGCAAG GATGTTTATA CAACATTGTC 3 00 

TAGTCCGCTC CAATCTTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA 350 
AAGTAAAAGG TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT 400 
GAAATCCTCG CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA 450 
AGGAATCACT GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT 500 

15 ATGAACCAGG ATCAGCCATG AAGGTTATGA CGTTAGCTTC TTCTATTGAT 550 
AATAATACCT TCCCAAGTGG AGAATACTTC AATAGCAGTG AATTCAAAAT 600 
AGCGGATGCG ACGACTCGAG ATTGGGATGT TAATGATGGT TTGACTACTG 65 0 

GTGGGATGAT GACTTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA 700 
ATGAGTCTAC TTGAACAAAA AATGGGAGAT GCTACTTGGT TGGATTATCT 750 

2 0 AAAACGCTTT AAATTTGGGG TTCCAACTCG CTTTGGCTTG ACAGATGAAT 800 
ACGCTGGTCA ACTTCCAGCT GATAATATTG TTAGTATTGC TCAAAGCTCA 850 
TTTGGGCAAG GAATTTCAGT GACACAAACA CAAATGCTTC GTGCCTTTAC 900 
AGCTATTGCT AATGATGGAG TTATGCTGGA GCCAAAATTT ATAAGTGCTA 950 

TTTATGATAC TAACAATCAG TCTGTACGTA AGTCACAAAA AGAAATAGTA 1000 

2 5 GGAAATCCTG TTTCCAAAGA GGCAGCAAGC ACAACTCGAA ATCACATGAT 1050 

CTTAGTTGGG ACGGACCCTC TATATGGAAC TATGTATAAT CACTACACAG 1100 

GAAAGCCAAT TATAACAGTT CCTGGACAAA ATGTAGCAGT TAAATCCGGT 1150 

ACGGCTCAAA TCGCTGATGA GAAAAATGGA GGATACTTGG TTGGTTCTAC 12 00 

CAATTATATT TTCTCAGTTG TGACTATGAA TCCTGCTGAA AATCCTGATT 1250 

3 0 TTATCTTGTA TGTAACGGTT CAACAGCCTG AGCATTATTC AGGTATCCAG 13 0 0 

TTGGGAGAAT TTGCCACCCC AATCTTGGAG CGGGCTTCAG CTATGAAAGA 13 50 

ATCTCTCAAT CTTCAATCTC CAGCCAAAAA TTTAGATAAA GTTACGACAG 14 00 

AATCTTCTTA TGCAATGCCT AGCATCAAGG ATATTTCACC TGGTGAGTTG 1450 

GCGGAAGCCT TACGCCGAAA TATTGTGCAA CCAATCGTTG TAGGTACTGG 1500 

3 5 AACAAAGATT AAAGAGACTT CTGTAGAAGA AGGGACCAAT CTTGCACCAA 1550 

ACCAACAAGT TCTCCTTTTA TCGGATAAGG TAGAAGAAAT TCCAGACATG 1600 

TATGGCTGGA AAAAAGAGAC TGCCGAGACC TTTGCTAAAT GGTTGGATAT 1650 

TGAACTGGAA TTTGAAGGTT CAGGTTCCGT TGTTCAGAAG 16 90 



40 



55 



2) INFORMATION FOR SEQ ID NO: 1044 



(i) SEQUENCE CHARACTERISTICS: 
4 5 (A) LENGTH: 1668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1044 



AGTTGGAAAC TGCAGAGGTC AAGGGGATTG ATTTTACAAC CAGTC CTAAT 50 

CGTAGTTACC CAAACGGACA ATTTGCTTCT AGTTTTATCG GACTAGCTCA 100 

60 GCTCCATGAA AATGAAGATG GCAGCAAGAG CTTGCTGGGA ACCTCTGGGA 150 
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TGGAGAGTTC TTTAAATAGA ATTCTTGCAG GGACAGACGG CATTATTACC 2 00 

TATGAAAAGG ATCGTCTGGG AAATATTGTC CCCGGAACGG AACAAGTTTC 2 50 

CCAACAAACG GTAGATGGCA AGGATGTTTA TACGACTATT TCCAGCACCC 3 00 

TTCAGTCCTT CATGGAGACC CAGATGAATG CCTTTCAAGA AAAAGTAAAA 3 50 

5 GGCAAGTATA TGACGGCTAC CTTGGTCAGT GCTAAAACAG GGGAAATTCT 4 00 

TGCAACAACG CAACGGCCGA CCTTCGATGC TGATACTAAG GAAGGACTTA 4 50 

CCAAGGACTT TGTTTGGCGT GATATCCTCT ATCAAAGTAA CTATGAGCCA 500 

GGGTCAACCA TGAAGGTCAT GACGCTTGCT GCTGCTATTG ATAATAACAC 550 

TTTCCCAGGA GGAGAAGTTT TCAATAGTAG TGAATTAAAA ATAGCGGATG 600 

10 CGACAATTCG AGATTGGGAT GTTAATGATG GTTTGACGAC TGGTGGGATG 650 

ATGACTTTCT TACAAGGTTT CGCTCACTCC AGTAATGTTG GAATGAGTCT 700 

ACTTGAACAA AAAATGGGAG ATGCTACTTG GTTGGATTAT CTAAACCGCT 750 

TTAAGTTTGG GGTGCCGACG CGTTTTGGTC TGACTGATGA GTATTCAGGT 800 

CAATTGCCTG CAGATAATAT TGTTAATATT GCCATGAGTG CATTTGGTCA 850 

15 GGGGATTTCA GTGACCCAGA CGCAAATGAT TCGTGCCTTT ACAGCTATTG 900 

CTAATGATGG TGTTATGCTG GAGCCTAAAT TTATTAGTGC CATTTATGAT 950 

CCAAATGATC AAACTGCTCG GAAATCTCAA AAAGAAATTG TGGGAAATCC 1000 

TGTTTCTAAA GATGCAGCTA GTCTAACTCG GACTAACATG ATTTTGGTAG 1050 

GGACGGATCC GGTTTATGGA ACCATGTATA ACCACAGCAC AGGTAAGCCA 1100 

20 ACTGTAACTG TTCCTGGGCA AAATGTAGCC CTCAAGTCTG GTACGGCTCA 1150 

GATTGCTGAC GAGAAAAATG GTGGTTATCT AGTCGGGTTA ACCAACTATA 12 00 

TTTTCTCGGC TGTATCGATG AATCCGGCTG AAAATCCTGA TTTTATCTTG 12 50 

TATGTAACGG TACAGCAACC TGAACATTAT TCAGGTATCC AGTTGGGAGA 13 00 

ATTTGCCACC CCAATCTTGG AGCGGGCTTC AGCTATGAAA GAATCTCTCA 13 50 

25 ATCTTCAATC TCCAGCCAAG AATTTAGATA AAGTTACGAC AGAATCTTCT 14 00 

TATGCAATGC CTAGCATCAA GGATATTTCA CCTGGTGAGT TGGCGGAAGC 14 50 

CTTACGCCGA AATATTGTGC AACCAATCGT TGTAGGTACT GGAACAAAGA 1500 

TTAAAGAGAC TTCTGTAGAA GAAGGGACCA ATCTTGCACC AAACCAACAA 1550 

GTTCTCCTTT TATCGGATAA GGTAGAAGAA ATTCCAGACA TGTATGGCTG 16 0 0 

30 GAAAAAAGAG ACTGCTGAAA CCTTTGCTAA ATGGTTGGAT ATTGAGCTGG 16 50 

AATTTGAAGG GTCAGGTT 1668 



35 2) INFORMATION FOR SEQ ID NO: 1045 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1680 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-12 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1045 



TTGGAAACTG CAGAGGTCAA GGGGATTGAT TTTACAACCA GTCCCAATCG 50 

TAGTTACCCA AATGGACAAT TTGCTTCTAG TTTTATCGGT CTAGCTCAGC 100 

TCCATGAAAA TGAAGATGGA AGCAAGAGTT TGCTGGGAAC CTCTGGAATG 150 
GAGAGTTCCT TGAACAGTAT TCTTGCAGGG ACAGACGGCA TTATTACCTA • 2 00 

55 TGAAAAGGAT CGTCTGGGCA ATATTGTACC CGGAACAGAA CAAGTTTCCC 250 

AACAAACGGT GGATGGCAAG GATGTTTACA CAACCATTTC CAGCCCCCTC 300 

CAGTCCTTCA TGGAAACTCA GATGGATGCC TTTCTAGAAA AAGTAAAAGG 350 

TAAGTATATG ACCGCGACCT TGGTCAGTGC AAAGACCGGT GAAATCCTCG 4 00 

CTACCACCCA ACGACCTACC TTTAATGCAG ATACTAAAGA AGGAATCACT 450 

60 GAGGACTTTG TTTGGCGTGA TATTCTTTAT CAAAGTAACT ATGAACCAGG 500 
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ATCAGCCATG AAGGTTATGA CGTTAGCTTC TTCTATTGAT AATAATACCT 550 

TCCCAAGTGG AGAATACTTC AATAGCAGTG AATTCAAAAT AGCGGATGCG 6 00 

ACGACTCGAG ATTGGGATGT TAATGATGGT TTGACTACTG GTGGGATGAT 650 

GACTTTCTTA CAAGGTTTCG CTCACTCCAG TAATGTTGGA ATGAGTCTAC 700 

5 TTGAACAAAA AATGGGAGAT GCTACTTGGT TGGATTATCT AAAACGCTTT 750 

AAATTTGGGG TTCCAACTCG CTTTGGCTTG ACAGATGAAT ACGCTGGTCA 800 

ACTTCCAGCT GATAATATTG TTAGTATTGC TCAAAGCTCA TTTGGGCAAG 8 50 

GAATTTCAGT GACACAAACA CAAATGCTTC GTGCCTTTAC AGCTATTGCT 900 

AATGATGGAG TTATGCTGGA GCCAAAATTT ATAAGTGCTA TTTATGATAC 950 

10 TAACAATCAG TCTGTACGTA AGTCACAAAA AGAAATAGTA GGAAATCCTG 10 00 

TTTCCAAAGA GGCAGCAAGC ACAACTCGAA ATCACATGAT CTTAGTTGGG 1050 

ACGGACCCTC TATATGGAAC TATGTATAAT CACTACACAG GAAAGCCAAT 1100 

TATAACAGTT CCTGGACAAA ATGTAGCAGT TAAATCCGGT GCGGCTCAAA 1150 

TCGCTGATGA GAAAAATGGA GGATACTTGG TTGGTTCTAC CAATTATATT 12 00 

15 TTCTCAGTTG TGACTATGAA TCCTGCTGAA AATCCTGATT TTATCTTGTA 1250 

TGTAACGGTT CAACAGCCTG AGCATTATTC AGGTATCCAG TTGGGAGAAT 13 00 

TTGCCACCCC AATCTTGGAG CGGGCTTCAG CTATGAAAGA ATCTCTCAAT 13 50 

CTTCAATCTC CAGCCAAAAA TTTAGATAAA GTTACGACAG AATCTTCTTA 14 00 

TGCAATGCCT AGCATCAAGG ATATTTCACC TGGTGAGTTG GCGGAAGCCT 14 50 

2 0 TACGCCGAAA TATTGTGCAA CCAATCGTTG TAGGTACTGG AACAAAGATT 1500 

AAAGAGACTT CTGTAGAAGA AGGGACCAAT CTTGCACCAA ACCAACAAGT 1550 

TCTCCTTTTA TCGGATAAGG TAGAAGAAAT TCCAGACATG TATGGCTGGA 1600 

AAAAAGAGAC TGCCGAGACC TTTGCTAAAT GGTTGGATAT TGAACTGGAA 1650 

TTTGAAGGTT CAGGTTCCGT TGTTCAGAAG 168 0 

25 



2) INFORMATION FOR SEQ ID NO: 1046 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1689 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
40 (B) STRAIN: StrR-13 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1046 

AAAAAAGAGT TGGAAACTGC AGAGGTCAAG GGGATTGATT TTACAACCAG 50 

45 TCCTAATCGT AGTTACCCAA ACGGACAATT TGCTTCTAGT TTTATCGGTC 100 

TAGCTCAACT CCATGAAAAT GAAGATGGAA GCAAGAGTTT GCTGGGAACT 150 

TCTGGAATGG AGAGTTCCTT GAACAGTATT CTTGCAGGGA AAGACGGTAT 2 00 

TATTACTTAT GAAAAGGATC GTCTGGGTAA TATTGTCCCT GGAACAGAAC 2 50 

AAGTTTCCCA ACAAACGGTA GATGGCAAGG ATGTTTATAC AACATTGTCT 300 

50 AGTCCGCTCC AATCTTTCAT GGAAACTCAG ATGGATGCCT TTCTAGAAAA 3 50 

AGTAAAAGGT AAGTATATGA CCGCGACCTT GGTCAGTGCA AAGACCGGTG 4 00 

AAATCCTCGC TACCACCCAA CGACCTACCT TTAATGCAGA TACTAAAGAA 4 50 

GGAATCACTG AGGACTTTGT TTGGCGTGAT ATTCTTTATC AAAGTAACTA 500 

TGAACCAGGA TCAGCCATGA AGGTTATGAC GTTAGCTTCT TCTATTGATA 550 

55 ATAATACCTT CCCAAGTGGA GAATACTTCA ATAGCAGTGA ATTCAAAATA 600 

GCGGATGCGA CGACTCGAGA TTGGGATGTT AATGATGGTT TGACTACTGG 650 

TGGGATGATG ACTTTCTTAC AAGGTTTCGC TCACTCCAGT AATGTTGGAA 700 

TGAGTCTACT TGAACAAAAA ATGGGAGATG CTACTTGGTT GGATTATCTA 750 

AAACGCTTTA AATTTGGGGT TCCAACTCGC TTTGGCTTGA CAGATGAATA 800 

60 CGCTGGTCAA CTTCCAGCTG ATAATATTGT TAGTATTGCT CAAAGCTCAT 850 

590 
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